Detection of Rare Earth Elements in Minerals and Soils by Laser-Induced Breakdown Spectroscopy (LIBS) Using Interval PLS

: The numerous applications of rare earth elements (REE) has lead to a growing global demand and to the search for new REE deposits. One promising technique for exploration of these deposits is laser-induced breakdown spectroscopy (LIBS). Among a number of advantages of the technique is the possibility to perform on-site measurements without sample preparation. Since the exploration of a deposit is based on the analysis of various geological compartments of the surrounding area, REE-bearing rock and soil samples were analyzed in this work. The ﬁeld samples are from three European REE deposits in Sweden and Norway. The focus is on the REE cerium, lanthanum, neodymium and yttrium. Two different approaches of data analysis were used for the evaluation. The ﬁrst approach is univariate regression (UVR). While this approach was successful for the analysis of synthetic REE samples, the quantitative analysis of ﬁeld samples from different sites was inﬂuenced by matrix effects. Principal component analysis (PCA) can be used to determine the origin of the samples from the three deposits. The second approach is based on multivariate regression methods, in particular interval PLS ( i PLS) regression. In comparison to UVR, this method is better suited for the determination of REE contents in heterogeneous ﬁeld samples.


Introduction
Rare earth elements (REE) are used in a wide range of modern technologies, which include, for instance, renewable energy technologies, communication technologies and petrochemistry [1][2][3]. Lanthanum and neodymium are relevant for electric mobility, for example, as well as for optimizing the material properties of steel alloys in conjunction with cerium. In the future, zirconium dioxide stabilized by yttrium will be applied as electrolyte material in fuel cells [2]. These diverse applications, especially in high-tech-industries, drive the global demand for REE. While in 2013 the need for REE in important future technologies accounted for about 30,900 t, studies predict an increasing demand of up to 70,900 t by 2035 [4]. Due to this increasing demand, REE are important resources [5]. Contrary to their name, they are widely distributed in the earth's crust as trace elements. Due to the chemical similarity, elements of neighboring atomic numbers can substitute each other in the crystal structures of various minerals [1,6]. The most economically important REE-containing minerals are carbonates and phosphates, such as monazite, xenotime, bastnaesite or parisite. Within the minerals, the proportions of the various REE vary. Light REE (LREE), all elements from lanthanum to europium, predominate mainly in bastnaesite and monazite, whereas xenotime is a source of heavy REE (HREE), from gadolinium to lutetium and yttrium. This distribution depends on the deposit, its surrounding area of a potential REE deposit using LIBS and univariate and multivariate iPLS regression. In this study, mineral, rocks and soil samples from the Scandinavian deposits Norra Kärr and Olserum in Sweden and Fen-Complex in Norway are investigated. Their elemental contents were determined by reference analysis (ICP-OES and XRF). The relatively small number of samples examined is a result of the high costs of sampling, transport and reference analysis. This problem accompanies geologists in their daily work and is used in this study to demonstrate the potential of our methods. In order to characterize the influence of matrix effects, synthetic REE samples produced from REE chloride salts in serpentinite and a model soil as well as the field samples are compared. All samples are analyzed with respect to the elements cerium (Ce), lanthanum (La), neodymium (Nd) and yttrium (Y). The results of the analysis with LIBS are evaluated with UVR and iPLS regression.

Samples and Reference Analysis
A total of 167 samples containing REE were analyzed. Of these, 122 were artificial samples (hereafter referred to as synthetic) and 45 were natural samples (hereafter referred to as field). In both cases, a further distinction was made between rock and soil samples. All samples were available as ground powders. The concentrations of REE in the synthetic samples are summarized in Table S1 in the Supplementary Materials. Table S2 contains the reference data of the composition of the field samples.

Geological Overview of Deposits
The field samples were collected at three different deposits in Sweden and Norway. 19 of these samples were taken in the Olserum-Jupval REE deposit in south east Sweden. Sampled outcrops are located ca. 7 km NW of Gambleby Västervik region (Figure 1a). REEphosphates-rich dikes cross cut metasedimentary wall rocks (cordierite-bearing gneisses). The phosphates comprise monazite (LREE), xenotime (Y, HREE) and fluor-apatite (LREE). In addition, allanite (REE-silicate), and the REE carbonates bastnaesite and synchisite-Ce occur. The origin of the REE-deposit was studied in detail by Andersson et al. [22], Fullerton [23], Russel [24] and Gavelin [25]. REE-phospate-rich dikes of up to 2 m thickness occur near the contact to granitoids of the Transscandinavian Igneous Belt (ca. 1.85-1.65 Ga [26]) in the metasedimentary Västervik Formation. The samples are from two dikes located at a distance of ca. 150 m (from 57.94839N to 57.96364N, from 16.35213E to 16.32522E) [22,23,27].
The Norra-Kärr Complex is a small, magmatic intrusion with an "agpaitic composition", i.e., (Na + K) > Al. The complex has an elliptical shape with a north-south extension of approx. 1300 m and an east-west extension of approx. 500 m (Figure 1a,b). The Norra Kärr Complex (ca. 1.5 Ga old [28]) intruded into igneous rocks of the Trans-Scandinavian Magmatic Belt (ca. 1.8-1.7 Ga [29]). It essentially consists of various types of nepheline syenites [26]. The original plutonic rocks show evidence of strong deformation and metamorphic overprint [30]. The tectonometamorphic overprinting is assigned to the Sveconorwegian orogeny (ca. 1048 Ga [31]). The rock sequence is known as a zirconium deposit due to the high abundance of Zr-rich minerals [26,32]. The major REE-bearing minerals are the silicates eudialyte, catapleiite and cerite. Eudialyte occurs as a REE-poor magmatic type and a HREE-rich secondary (metamorphic) type, which is the major REEbearing phase. The analyzed rock and soil samples are from different parts of the intrusion, with slightly different compositions. The REE concentrations of the rocks are up to 1 wt.%. In addition, samples were taken from outside the intrusion as a reference (subsoil and topsoil Nr. 20). Another sample obtained to the east of the intrusion was contaminated by hydrothermal fluids, a so called fenitized rock. A total of 17 samples form Norra Kärr were examined [26][27][28][29][30][31][32]. The Fen Complex, located in the Telemark province of southern Norway, represents an early Cambrian magmatic intrusion complex. It is dominated by carbonatites, formed from carbonate melts [33][34][35][36][37]. The complex shows a surface outcrop of ca. 10 km 2 . The previous carbonatitic melts intruded into Mesoproterozoic gneisses (Figure 1a). The carbonatites are interlayered by alkaline ultramafic silicate rocks, i.e., containing <40% SiO 2 [26]. All melts generated from a deeper mantle source. The carbonaceous intrusions modified the surrounding gneisses by CO 2 -rich streams leading to alkaline metasomatic reactions. Therefore, the Fen Complex is the type locality of the "fenitisation-process". The Fen Complex is known for its highly enriched thorium and rare earth concentrations [37].  [38]. Nine field samples for analyses of the Fen Complex were provided by the Federal Institute for Geosciences and Natural Resources (BGR) [38].
Reference analysis for the REE cerium, lanthanum and neodymium was carried out by inductively coupled plasma optical emission spectroscopy (ICP-OES) and X-ray fluorescence spectroscopy (XRF spectroscopy) for yttrium. The samples were available as ground powders. For LIBS measurement, all powders were pressed into pellets (TP 40, Herzog Maschinenfabrik, Osnabrück, Germany).

Preparation of the Synthetic Samples
The synthetic samples were prepared by standard addition. REE used as chloride hydrates (CeCl 3 · 7H 2 O, LaCl 3 · 7H 2 O, NdCl 3 · 6H 2 O, YCl 3 · 6H 2 O) were mixed with the respective matrices, soil and rock. For the samples in rock, a milled mixture of serpentinite and weathered crust was used. The matrix used in the soil-based samples is a soil containing loess from central Europe. The salts were obtained from Sigma Aldrich (St. Louis, MO, USA). The weighed rock samples were mixed with 3 wt.% distilled water, homogenized in a mill (MM 400, Retsch, Düsseldorf, Germany) and pressed into pellets (TP 40, Herzog Maschinenfabrik, Osnabrück, Germany). The soil samples were only homogenized and pressed into pellets. For one pellet, 3 g of sample is used. Pellets were prepared separately for each REE and concentration: 16 pellets for both Ce and Nd and 15 pellets for both La and Y were prepared, respectively (see Table S1 in the Supplementary Materials). Also, the pellets containing blank rock and soil were prepared for preliminary measurements but were not involved in iPLS regressions. The REE concentrations in the serpentinite and the soil were much less than 10 ppm.

Instrumental Set up of LIBS and Measurement Parameters
The samples were positioned on a rotating and linearly translating sample desk, resulting in a spiral-like measurement pattern. This ensured a fresh surface on the pellet for each ablation event. The light of the Nd:YAG laser (Bernoulli LIBS, Litron Lasers, Rugby, England, Great Britain, λ = 1064 nm, E = 20 mJ, repetition rate 10 Hz) was focused by a lens (focal length 50 mm) and created the plasma on the sample surface. Emissions were collected by a concave mirror (ME-OPT-0007, Andor Technology, Belfast, Northern Ireland, UK, focal length 52 mm, λ = 200-1100 nm) and focused on an optical fiber to guide the radiation to an echelle spectrometer (Aryelle Butterfly, LTB, Berlin, Germany). The two wavelength ranges (UV range: 190-330 nm, VIS range: 275-750 nm) of the spectrometer were measured separately with a resolution of 20-30 pm (delay time 2 µs). The spectrometer was equipped with an ICCD camera (iStar, AndorTechnology, Belfast, UK) as detector (gate width 10 µs). A total of about 230 spectra per sample were recorded in both the UV and VIS regions. For each spectrum, 10 single shots were accumulated.

Processing and Analysis of Data by Univariate and Multivariate Methods
Origin (OriginLab, Northampton, MA, USA) was used for univariate analysis. Matlab (Version 2019b, MathWorks, Natick, MA, USA) was used for the preprocessing and the multivariate methods PCA and iPLS regression.
Spectra were preprocessed by background correction and standard normal variate (SNV) normalization. A top-hat filter was used for the background correction. The utilized structure element length of 20 data points corresponds to a filter width of approximately 0.26 nm. The SNV normalization followed the background correction. For SNV normalization, the mean value of the entire spectrum was subtracted from the spectrum and the difference was divided by the standard deviation. After processing the data using background correction and SNV normalization, the 230 spectra were averaged so that one spectrum was finally obtained for each sample examined.
For the univariate regression, one element-specific peak was identified for each REE examined, and the peak area (A) was determined by integration of each of the 230 spectra and subsequent calculation mean and standard deviation. The peak area was plotted against the concentration. Principal component analysis was performed on the raw spectra of the field and synthetic samples.
For the iPLS regression, we used the iToolbox from Nørgaard [20]. The 230 spectra were averaged prior to iPLS and a separate iPLS was performed for each element, with the only input variables being the concentrations of the element in question. Cross validation was performed internally in the iPLS algorithm, as well as validation externally with test data excluded from the algorithm. The iPLS regression was based on variable selection in spectral intervals. The number of intervals was varied, with a maximum of 5000 intervals. Interval separation was performed on raw wavelength data. Due to the way echelle data is transformed from the 2D to the (standard) 1D representation, this resulted in intervals with different spectral widths (but constant numbers of sample points). The exact number of intervals and the number of components were determined individually for each sample (Tables 4 and 5) by systematic variation of the interval length. The interval number that resulted in an interval with the smallest RMSE was selected. The PLS regression was performed at this interval. Validation of the iPLS regression results was performed by 5-fold cross-validation with systematic exclusion.

Structure of REE Spectra
LIB spectra of REE-bearing minerals are highly complex due to the line-rich REE [12,13]. The NIST LIBS database displays, e.g., 7723 lines for cerium. Furthermore, REE often occur associated with other REE. In addition, strong emission lines of other elements of the surrounding matrix contribute to the complexity of the spectra. A typical LIB spectrum of a REE-bearing mineral and an enlarged partial spectrum are displayed in Figure 2. As a consequence of the large number, emission lines from different elements can be partially or even completely superimposed. This complicates the assignment of the lines to the numerous elements, especially in spectrometers with a lower resolution. The use of high-resolution spectrometers can reduce these problems in line evaluation ( Figure 3). Compared to spectra with a resolution of ≥100 pm, spectra with a higher resolution can be obtained by spectrometers based on echelle gratings, allowing better resolved peaks to be observed. The echelle spectrometer used here has a spectral resolution of about 25-30 pm in the VIS range. The improved resolution of the spectra enables the separation of partially superimposed lines in lower-resolution instruments and yields a more exact line position. This allows the identification of numerous lines of REE using the NIST database (Figure 2b) [39]. The identified emission lines can then be used in quantitative elemental analysis with LIBS. Two examples of unresolved features in the simulated low-resolution spectrum (resolution of 100 pm) of Nd in Figure 3 are the line triplet at 381.05 nm, which can be completely resolved in the simulated high-resolution spectrum which mimics the spectrum of an echelle spectrometer (resolution of 25 pm), and the duplet at 380.9 nm which can only be partially resolved even in the echelle spectrometer. REE lines that overlap lines of the matrix elements are unsuitable for the investigation. These lines, marked with an asterisk in Table 1 below, are not included in the analyses performed in this work. The other spectral lines listed were used in the univariate and multivariate regression. Table 1. Experimental spectral lines of analyzed REE (shown in Figure 2b) used in this work collected with the echelle spectrometer. Emission wavelengths marked with an asterisk (*) are not used in this work.

Species
Spectral Lines

Univariate Regression
REE do not occur as native metals in nature. Rather, they occur in various rock-forming minerals [6]. This results in field samples being complex mixtures of a wide variety of elements [13]. To evaluate the influence of the matrix elements on LIB spectra of REE, univariate regression analyses were performed on the data sets of the synthetic and field samples experimentally obtained. While the matrix of the synthetic REE samples (one rock or soil) is constant, REE in field samples are embedded in various minerals, rocks or soils. The samples in this work contain Ce, La, Nd and Y at different concentrations. For each REE investigated, a regression analysis was performed at up to 3 emission lines. The corresponding coefficients of determination (R 2 ) and limits of detection (LOD) are reported. The LOD is the quotient of three times the standard deviation of the baseline and the slope of the obtained regression line. The resulting values of LOD are shown in Table 2.
Only spectral lines which occur in the entire examined concentration range were used. Since unsuitable lines also occur for each element (Table 1), different emission lines were used for the analyses in the different matrices. In the investigated field samples, the REE content varies over several orders of magnitude (from 1 ppm up to 3%). Therefore, the presentation of the regression lines is best done by a double logarithmic plot to cover the whole concentration range. All lines investigated are summarized in Table 2 with the corresponding coefficients of determination and LOD.

Univariate Regression of Synthetic Samples
For synthetic REE samples of rocks, the UVR functions of the Ce line at 413.76 nm and the Nd line at 386.34 nm are shown in Figure 4 as examples. Both elements show the best regression results at these emission lines in terms of linearity, which is R 2 = 0.91 for Ce and R 2 = 0.98 for Nd. The other studied REE, La and Y, also have good coefficients of determination at the selected wavelengths: R 2 (La at 379.08 nm) = 0.99 and R 2 (Y at 412.82 nm) = 0.97.  The results of the UVR obtained for the synthetic samples in soils were similar to the synthetic minerals although the emission lines varied. Examples are the results for the REE Ce and Y in soils ( Figure 5). For Ce the emission line at 413.76 nm, and for Y the line at 488.36 nm were used, respectively. Both have high coefficients of determination (R 2 (Ce) = 0.99, R 2 (Y) = 0.98). Coefficients of R 2 > 0.96 were also obtained for the UVR of the elements La (404.29 nm) and Nd (386.34 nm). The other reported lines of the REE in rock and soil also yield good results, but they show reduced linearity as indicated by smaller R 2 (R 2 = 0.90-0.99, Table 2). The detection limits of the REE in rock and soil are similar for most elements. Their average LOD is 200 ppm (except for La in rock, which exhibited an unusually high LOD as a result of the large spread of the data points, very likely due to matrix effects) with a standard deviation of 110 ppm. Nevertheless, the values of the LOD vary between elements and even between the matrices for one element, sometimes significantly.

Univariate Regression of Field REE Samples
In comparison to the synthetic samples, the UVR of the field samples of REE-bearing rocks yielded worse results. This is due to two reasons: first, the matrix, which is significantly more complex than the matrix of the synthetic samples, which consists of only one rock (serpentinite). Second, there are great matrix variations between the different sample locations. The UVR results were again evaluated on the basis of the coefficients of determination. These range from 0.43 to 0.85 for field rock samples (Table 3). An example is the best La line at 408.67 nm (R 2 = 0.82, Figure 6a). Except for Nd, the best emission lines of the other REE investigated yield coefficients of determination around 0.80 (Table 3).  Stronger matrix effects can be observed in the UVR of field soil samples. These samples have an even higher variance in composition, which affects the LIB spectra. UVR of the best lines yields low coefficients of determination between 0.16 and 0.52 (Table 3). This is shown in Figure 6b for the regression lines of La (404.29 nm).
The average LOD for the investigated REE in field samples (soils and rocks) is 300 ppm (except for Nd in rock) with a standard deviation of 140 ppm. Thus, the LOD values are slightly higher than the LOD of the synthetic samples. Better results of the UVR analysis were obtained for those samples that have the same and a less complex matrix composition, i.e., for the synthetic samples of rock or soil. In natural field samples with strong matrix variations, multivariate methods have the potential to compensate the variations in spectral interference caused by the differences in matrix compositions and can be an alternative to the UVR.

Characterization of Matrix Effects by Means of PCA
One way to investigate these influences of the matrices is to characterize the differences in the chemical composition of soil and rock. This is possible by means of unsupervised methods such as principal component analysis (PCA). For this purpose, the LIB spectra of the soils and rocks collected near the three REE deposits are investigated by PCA. The first two principal components are shown in the score plot (Figure 7). For the reference rocks, 82.8% of the variance can be explained by the first two components (Figure 7a). In the case of the reference soils, 96.7% of the variance is explained by the two components (Figure 7b). In the score plot of the rock samples (Figure 7a), the samples of the three deposits form well separated clusters, which are displayed in different colors. The samples of the individual deposit clusters also show a considerable scattering, especially in the case of the Norra Kärr samples. This allows the determination of the provenance of the samples and also indicates the strong differences in the chemical composition of the matrices within and between the deposits, which is caused by the types of rocks present in the individual deposits. All three deposits are composed of siliceous or carbonaceous rocks. However, the included minerals influence the composition and thus allow a differentiation of the deposits by PCA. The igneous rock of the Swedish Norra Kärr deposit (red) is a nepheline-bearing syenite with agpaitic composition. These samples are particularly rich in sodium and potassium. In contrast, the Olserum deposit (green) has a mainly cordierite-bearing gneiss. As a result, the siliceous rock is magnesium-and aluminum-bearing. The rocks of the Fen complex (blue) are carbonatites. Due to chemical and physical weathering processes and accompanying degradation, the data points are partly scattered over large areas even within the clusters. In addition, the widely spaced sampling locations (>50 m) contribute  For the soils, the interpretation of the PCA (Figure 7b) is more complex, since the soils consist not only of physically weathered rock, but also of organic matter from the biosphere and atmospheric dust [40]. Nevertheless, as in the principal component analysis of the reference rocks, a clustering of the data points according to the deposits and their separation from each other can be observed. This probably results from the rocks present. Although the sampling locations in the deposits are far apart, the data points within the PCA are not as split up as in the case of the reference rocks. This could be due to incoming organic matter and atmospheric dust during formation. Thus, the samples are possibly more homogeneous. In contrast, the strong splitting of the data points from the Fen complex (blue) is particularly striking. This is probably due to the different weathering grade of the soils.
In comparison, both PCA show similar results: data points are grouped by deposit and separated from each other. All clusters show an internal separation of the data points. The explained variance of both PCA analyses is already >82% for two principal components. Thus, it is possible to assign the present samples to a single deposit. Surprisingly, the explained variance of the reference soils is larger than that of the rocks. The reason for this could be the nature of the samples. The soil samples are in powder form and thus show a greater macroscopic homogeneity than the investigated rocks. As a result, the properties are more evenly distributed, so that the components of the PCA capture more properties. Thus, the reference soils can be explained with fewer principal components.
However, the differences in the chemical composition of the samples are also evident in both analyses. This heterogeneity of the field samples is striking in the PCA of field and synthetic samples (Figure 8a,b). In addition to the LIB spectra of the reference rocks, the spectra of the synthetic samples (Ce) were included in the analysis. From the principal component analysis (Figure 8a) it can be seen that the synthetic samples are more homogeneous. These data points (orange) are concentrated in a small area of the score plot. Nevertheless, they are not located at exclusively one point in the score plot, as the magnification in Figure 8b shows. In comparison, the data points of the field samples are distributed over a wide area in the plot. This supports our observation that the separation between the samples from different sites is caused by the differences in the chemical composition of the matrix and not by the variations in REEs concentration. After performing PCA, it is clear that UVR is well suited for the synthetic samples because they have a more homogeneous composition. For field samples, on the other hand, univariate regression is unsuitable due to the heterogeneous matrix. For heterogeneous samples, the use of multivariate regression models is therefore examined.

Interval Partial Least-Squares (iPLS) Regression
The multivariate method used was interval partial least-squares (iPLS) regression based on variable selection. In this method, the spectra are divided into intervals of equal spectral width. The number of intervals and the number of components were determined based on the lowest RMSE individually for each sample and are reported in Tables 4 and 5.  The method was first tested on the more homogeneous synthetic samples. As shown by the results in Table 4, the iPLS regression provides similarly good and, in some cases, even better results than the UVR for these samples. Thus, for the synthetic samples of cerium in rock, a better coefficient of determination of 0.98, compared to the UVR (R 2 = 0.91, Cf. Table 2), could be obtained. An improvement of the regression could also be accomplished for yttrium (R 2 (iPLS) = 0.99, R 2 (UVR) = 0.97). For lanthanum and neodymium ( Figure 9a) the coefficients of determination of iPLS regression and UVR are in the same good range. Similar results were obtained for the synthetic soil samples (see univariate data in Table 2 and data of iPLS regression in Table 4). Figure 9b shows the iPLS results of yttrium in soils. For the investigated REE cerium, lanthanum, neodymium and yttrium, coefficients of determination between R 2 = 0.95-0.99 were achieved. These are similar to the R 2 of the UVR (R 2 = 0.96-0.99). The fact that the multivariate method performs as well or even better for the synthetic samples suggest that their application to heterogeneous field samples could be beneficial.
The application of iPLS regression to the heterogeneous field samples yields significantly better regression results than the UVR in all cases investigated. For samples in rock, R 2 > 0.91 can be obtained (Table 5). For lanthanum (Figure 10a), in contrast to the UVR (R 2 = 0.82), a coefficient of determination of R 2 = 0.92 can be achieved by iPLS regression. For neodymium, a R 2 of 0.98 is found by using the multivariate regression method (Figure 10b), which is also significantly better than for UVR (R 2 (UVR) = 0.56). Somewhat less pronounced, but equally good, are the improvements in the regressions of cerium and yttrium in rock. The improvement of the regression is most obvious for the field samples in soils. The best coefficient of determination achieved for a UVR was R 2 = 0.52. All coefficients of determination obtained for iPLS regression are better than R 2 > 0.83 (Table 5). For lanthanum, the univariate regression value R 2 = 0.52 can be increased to R 2 = 0.93 in the multivariate regression. The improvement is similarly strong in the case of neodymium. Comparable to this are the changes in regression for yttrium and cerium.
In the iPLS regression, only the intervals containing relevant information, such as the element lines, are considered (Tables 4 and 5). To that end, very small interval widths had to be used in most cases. The subsequent PLS regression could thus also be considered an automated UVR. Nevertheless, the iPLS regression has higher coefficients of determination than the UVR. In addition to iPLS, PLS regression of the entire spectra was also examined. However, this method was not able to provide a prediction better than guessing. This demonstrates the advantage of variable selection in iPLS. Due to the sometimes extremely narrow intervals, only parts of the relevant element lines are included in the regression. Negative effects, such as self-absorption and partial contamination of the flank of lines, are excluded, resulting in improved regressions with higher coefficients of determination [41].
In particular, the quality of the regression of the field soil samples shows the strength of the multivariate method iPLS regression chosen here. Whereas, in the univariate regression, coefficients of determination of R 2 = 0.47 are achieved on average, the regression can be improved to a mean R 2 = 0.88. In addition to selecting appropriate spectral regions that contain relevant information, variable selection also allows elimination of intervals that contain noise or other irrelevant data. This can consider the influence of matrix effects of the studied rocks and soils.

Summary and Conclusions
LIBS is a promising method for the detection of rare earth elements in rocks and soils, as this technique can be used to detect REE in various matrices with no or minimal sample preparation. However, the spectral influence due to the matrices is problematic. In order to consider this influence and increase the power of the analysis, multivariate methods are required, especially for heterogeneous samples. In the present work, two approaches were used to analyze the LIBS data. The first approach is the UVR based on the peak area of the relevant LIBS signals. Here, the matrix effects of the heterogeneous field samples were identified as problematic and further investigated using PCA. The influence of these effects is considered in the second approach, a multivariate method. An iPLS regression was used in which only relevant regions of the spectrum were included in the regression. This improved the predictive accuracy of the regression. The investigations show that the detection and evaluation of REE in rocks and soils is possible using LIBS in combination with multivariate iPLS regression. The use of mobile LIB spectrometers can thus potentially enable the exploration of new REE deposits.
To increase the predictive power of the method, further investigations should be extended by samples from other deposits. Of particular interest here are REE-bearing samples, which have different accompanying matrices. For future studies, larger intervals and combinations of individual intervals could be used with a smaller number of components. For possible applications in the field, the use of a mobile spectrometer should be further investigated. Should these investigations prove successful, the transfer of the methods developed here into the field could be a promising possibility.
Supplementary Materials: The following are available online at https://www.mdpi.com/article/ 10.3390/min11121379/s1, Table S1: Concentrations of the synthetic samples in rock and soil, Table S2. Reference data of the field samples.

Conflicts of Interest:
The authors declare no conflict of interest.