1. Introduction
Validation of assay methods is a fundamental requirement for laboratories performing quantitative analyses. It is defined as “the process of defining an analytical requirement, and confirming that the method under consideration has capabilities consistent with what the application requires” [
1]. Validation involves demonstrating that the method can produce results that are both consistent and reproducible. Several parameters must be assessed during method validation, including accuracy (i.e., trueness and precision [
2]), specificity, linearity, regression model, working range, method detection limit (MDL), method quantification limit (MQL), and method uncertainty.
Among these, the selection of the regression model and the calculation of method limits often exhibit the greatest discrepancies and variability in how they are determined. Furthermore, the measurement of uncertainty is frequently overlooked by research laboratories, despite being a mandatory requirement for laboratories seeking ISO/IEC 17025 accreditation [
3].
In the case of the regression model, linear functions are the most commonly used, as many instrumental analytical techniques exhibit a linear response across a wide range. However, validation requires assessing whether the response is genuinely linear within the working range defined for the method. Once linearity is demonstrated, a linear regression function must be selected. Most laboratories apply ordinary least squares (OLS) regression without evaluating whether the response of the dependent variable is homoscedastic (i.e., constant variability across the calibration range) or heteroscedastic (i.e., non-equivalent variability).
It is important to note that OLS regression assumes that the absolute errors of the dependent variable (i.e., standard deviation, SD) are constant across the calibration range, an assumption of homoscedasticity. However, in analytical and bioanalytical calibrations, this assumption is rarely valid. More commonly, the absolute errors vary while the relative standard deviation (RSD) remains constant [
4,
5], indicating that these calibrations are typically heteroscedastic rather than homoscedastic.
Studies have shown that the use of OLS regression under heteroscedastic conditions leads to problems, particularly at low calibration levels. Under such conditions, OLS produces biased estimates of the intercept [
6] and tends to overestimate variability at low concentrations by factors greater than 10 [
7]. As a result, OLS fails to provide unbiased estimates at the lower end of the concentration range [
5,
8]. Therefore, when heteroscedasticity is confirmed, OLS is not appropriate for estimating low-level concentrations. Instead, weighted least squares (WLS) regression should be applied [
8], as it does not assume equal variability of the dependent variable.
WLS regression has been shown to produce slope estimates equivalent to those obtained with OLS, but with more accurate and reliable estimates of the intercept and its associated uncertainty [
4,
9]. Consequently, while OLS may be suitable for obtaining unbiased estimates at intermediate and high concentration levels, it is inadequate at low levels. WLS regression, by contrast, yields unbiased estimates across the entire calibration range. This extends the usable working range of the validated method, particularly when trueness and precision are considered, and results in a lower MQL. Indeed, several studies have demonstrated that when both trueness and precision are taken into account, the choice of regression model may significantly affect the accurate estimation of MQL values [
8,
10,
11,
12].
Various approaches have been proposed for the determination of the MDL and MQL of analytical methods. For chromatographic methods, many laboratories rely on the signal-to-noise (S/N) ratio, as it offers a fast and simple procedure that does not require replicate analyses of blank samples. However, the S/N ratio is only appropriate for determining instrumental limits, not method limits, as it does not account for the variability of blank samples analyzed using the full method.
A more robust approach involves performing replicate measurements of blank samples using the validated method or fortified blank samples at levels near the expected limits when true blank samples do not produce a measurable signal. The SD of these measurements is then used as an estimate of the variability at blank level, denoted as
SDblank [
13,
14,
15]. The MDL and MQL are then defined as the concentration that produces signals
k times the
SDblank (typically,
k = 3.3 for MDL and
k = 10 for MQL). While this approach appropriately accounts for precision at limit levels, it has a critical limitation for estimating the MQL: it does not account for bias (i.e., trueness) [
16]. As a result, there is a risk of either accepting a method with insufficient accuracy or rejecting a suitable one [
12,
17].
To address this issue, new approaches have been proposed that incorporate both precision and trueness into the estimation of the MQL, aligning with its definition as the lowest amount of an analyte that can be determined with acceptable accuracy [
16,
18,
19]. Furthermore, the IUPAC recommends that variability at the quantification limit should not be based solely on blank-level variability. Instead, uncertainty propagation through the parameters of the regression function must also be considered [
16,
19,
20].
In this study, we developed a new method using liquid chromatography for the quantification of caffeine and four of the most commonly used artificial sweeteners (acesulfame K, saccharin, aspartame, and cyclamate) in non-alcoholic beverages. There is significant interest in the determination of artificial sweeteners in foods and beverages due to their increasing use as low-calorie additives. However, their safety remains controversial. Some laboratory studies on animals suggest that artificial sweeteners may disrupt the balance of the gut microbiome, although findings in humans remain inconsistent [
21,
22].
Moreover, aspartame, one of the most widely used sweeteners, was classified in July 2023 as possible carcinogenic to humans (Group 2B) by the World Health Organization, the International Agency for Research on Cancer, and the Food and Agriculture Organization [
23]. Although maximum permitted levels of these additives are regulated [
24], manufacturers are currently only required to disclose the type of sweeteners used on product labels, not their actual concentrations.
The method developed in this study was validated, and the influence of the selected regression function was assessed in terms of (i) its impact on the trueness and precision of quantitative estimates in samples and (ii) its effect on the determination of the MQL. Concentrations of analytes in selected beverage samples were determined using different regression models, and the results were compared to demonstrate the significant effect of the regression function on the analytical outcomes.
2. Materials and Methods
2.1. Chemicals and Reagents
Acesulfame K (potassium 6-methyl-2,2-dioxo-2H-1,2,3-oxathiazin-4-olate), aspartame (methyl L-α-aspartyl-L-phenylalaninate, Asp-Phe methyl ester), saccharin (1J-1λ,2-benzothiazole-1,1,3(2H)-trione), cyclamate (sodium N-cyclohexylsulfamate), and caffeine (1,3,7-trimethylxanthine) were obtained from Sigma-Aldrich (Steinheim, Germany), each with a purity of ≥99%. Phosphoric acid (85%), potassium dihydrogen phosphate, and HPLC-grade acetonitrile (gradient grade, ≥99.9%) were also purchased from Sigma-Aldrich.
2.2. Standard Solutions
Individual stock standard solutions (c.a. 1000 mg·L−1, with a measured standard uncertainty u = 0.6%) were prepared in ultrapure milli·Q water (Millipore Ibérica, Barcelona, Spain; resistivity: 18.2 MΩ·cm−1) and stored at 4 °C. Working and intermediate solutions were prepared by appropriate dilution and mixing of stock solutions with Milli·Q water. The dilution procedures resulted in uncertainties (u) of <0.5%. Considering the uncertainties of the stock solutions and the subsequent dilutions, the measured uncertainty for all calibration standards was <0.8%, ensuring that the variability in the independent variable (concentration) remained smaller than that of the dependent variable (signal), which was consistently ≥1%.
2.3. HPLC Mobile Phase
The HPLC mobile phase consisted of a binary mixture of acetonitrile and a 10 mM phosphate buffer (pH = 2.4). The mobile phase was degassed and filtered with 0.45 µm filters before use.
2.4. Sample Preparation
Beverage samples were degassed using an ultrasonic bath to remove carbon dioxide, followed by dilution with Milli·Q water (1:2 dilution, u = 0.25%). All solutions, including samples and standards, were filtered through 0.45 µm cellulose acetate syringe filters prior to HPLC analysis.
2.5. Instrumentation
Chromatographic determinations were carried out using an Agilent 1260 Infinity II HPLC system (Agilent Technologies, Santa Clara, CA, USA) equipped with a UV diode array detector (1260 DAD HS, Agilent Technologies, Santa Clara, CA, USA) and a fluorescence detector (1260 FLD, Agilent Technologies, Santa Clara, CA, USA). A 20 µL injection volume was delivered via an autosampler (1260 Vialsampler, Agilent Technologies, Santa Clara, CA, USA).
Separations were performed using a binary gradient elution (
Table 1) on a 20 cm × 0.46 cm i.d. column packed with 5 µm Kromasil 100-5-C18 silica (Teknokroma, Barcelona, Spain). This column has good recommended stability and performance within the pH range of 1.5 to 10. The mobile phase flow rate was set at 1.0 mL·min
−1, and all chromatographic runs were conducted at 25 ± 0.5 °C.
Additionally, saccharin was also detected using fluorescence (FL) detection with an excitation wavelength of 250 nm and an emission wavelength of 440 nm.
2.6. Regresion and Statistical Analyses
Statistical analyses were performed using SPSS for Windows, version 29.0.1.0 (SPSS Inc., Chicago, IL, USA). A significance level of p < 0.05 was considered statistically significant. Excel software was not used for regression analysis as it does not provide weighted regression calculations.
4. Conclusions
An analytical method for the quantification of artificial sweeteners and caffeine has been developed and validated. The results confirm that analytical and bioanalytical methods often exhibit heteroscedasticity in their response, meaning that the variance of the response increases with concentration. Consequently, weighted regression must be applied to obtain accurate estimates, particularly at low concentration levels.
As demonstrated with the assessed method, the choice of regression model significantly affects the determination of the MQL when both trueness and precision are considered. For heteroscedastic calibrations, conventional unweighted regression functions, such as OLS for linear regression, are capable of providing accurate results (i.e., unbiased and precise) only at intermediate or high concentration levels. In contrast, only weighted regression models provide reliable estimates at low concentrations with heteroscedastic conditions.
The most commonly used procedure for determining method limits involves estimating the standard deviation at blank level (SDblank) by analyzing blank or fortified blank samples. While this approach accounts for variability at blank level, it fails to consider other significant sources of uncertainty that influence the combined uncertainty, particularly at the MQL level. These additional sources include uncertainties associated with the regression parameters.
In the case of weighted regressions, the simplified approach remains valid because the uncertainty of the intercept is generally comparable to SDblank. As a result, the combined uncertainty can be approximated by . Consequently, only minor differences are observed between MQL values obtained using the conventional method and those calculated with full uncertainty propagation.
However, for unweighted regressions such as OLS, the uncertainty of the intercept is typically much larger and becomes the dominant contributor to combined uncertainty. In heteroscedastic contexts, uc calculated using unweighted regression is often more than 10 times greater than that obtained with weighted regression, resulting in similarly inflated MQL values, sometimes more than an order of magnitude higher.
Measurement uncertainty was also evaluated for the proposed method. Because bias (systematic error) must be included in uncertainty assessments, it became evident that the lower limit of the working range is significantly improved when weighted regression is used in the presence of heteroscedasticity.
The necessity of using weighted regression functions is especially relevant for accurate estimation of the MQL, and therefore for measurements conducted near the lower end of the calibration range, a common requirement in environmental trace analysis. For measurements at intermediate or high concentration levels, however, the choice between weighted and unweighted regression has a negligible impact on result accuracy.