Microbial Parameters as Predictors of Heterotrophic Prokaryotic Production in the Ross Sea Epipelagic Waters (Antarctica) during the Austral Summer

: A regression-based approach was used to test the suitability of a range of parameters, including total prokaryotic cell abundance and biomass, as lipopolysaccharide (LPS) content, and ex-oenzymatic activities (leucine aminopeptidase, LAP, beta-glucosidase, ß-G, and alkaline phosphatase, AP) as predictors of heterotrophic prokaryotic production (HPP) in the Ross Sea epipelagic waters. A close association between HPP and protein hydrolysis mediated by enzymatic activity (LAP), and to a lower signiﬁcance level with the other variables, was recorded. Three multiple regression equations were developed from two microbial datasets collected during middle austral summer periods. All showed a good predictive ability for HPP, and this was further validated through a comparison between the predicted and the observed HPP values. The obtained regression equations proved to represent a promising example of empirical models for further predictive studies in the Ross Sea where—through the incorporation of additional microbiological and environmental parameters—the developed models could ﬁnd a practical application to cover the entire austral summer period.


Introduction
In aquatic environments, the heterotrophic activity expressed by the microbial assemblage is a dominant energy pathway in the planktonic food web. Prokaryotes are the major players in the production of new living biomass, as well as in the transformation of particulate organic matter (POM) into dissolved organic matter (DOM) and in the mineralization of DOM to CO 2 and inorganic nutrients [1]. By means of the bacterial carbon production (now renamed as heterotrophic prokaryotic production (HPP), DOC was assimilated by the prokaryotic assemblage with production of new biomass. The HPP process has a primary importance in understanding the fluxes of carbon and energy in aquatic ecosystems [2]. On the other hand, organic matter hydrolysis by exoenzymatic activities is recognized as an important bottleneck for substrate utilization by microbial assemblage and an important factor controlling the prokaryotic metabolism [3]. Consequently, estimates of potential activity rates may provide important insights on the organic matter flux along the microbial loop, as well as on the quality and quantity of organic polymers available to heterotrophs [4]. The hydrolysis and utilization of complex organic substrates are crucial steps in the use of macromolecules to sustain microbial growth rates [5]. Microorganisms are recognized as major enzyme synthesizers [6][7][8]; the enzymes leucine aminopeptidase (LAP), beta-glucosidase (ß-G), and alkaline phosphatase (AP) play a critical role in the

Materials and Methods
The field datasets used in this study were obtained from two Italian surveys (2000 and 2004) carried out in coastal areas of the Ross Sea during the austral summer period.

Study Area and Samplings
Samplings were planned as a part of two projects, carried out in the frame of the XV and XVIII Expeditions of the Italian National Program for Antarctic Research (PNRA). The first dataset analyzed was from the Terra Nova Bay 2000 survey (TNB), performed from 22 January to 10 Figure 1. Detailed information about the sampling station locations, water column characteristics, and microbiological analyses is reported by Monticelli et al. [18] and Povero et al. [32]. Here, the microbiological topics are briefly discussed. model for epipelagic net HPP. For that purpose, the relationships among prokaryotic parameters, the validity of each one, regression models, and HPP prediction across the water columns of coastal stations in the Ross Sea were considered.

Materials and Methods
The field datasets used in this study were obtained from two Italian surveys (2000 and 2004) carried out in coastal areas of the Ross Sea during the austral summer period.

Study Area and Samplings
Samplings were planned as a part of two projects, carried out in the frame of the XV and XVIII Expeditions of the Italian National Program for Antarctic Research (PNRA). The first dataset analyzed was from the Terra Nova Bay 2000 survey (TNB), performed from 22 January to 10 Figure 1. Detailed information about the sampling station locations, water column characteristics, and microbiological analyses is reported by Monticelli et al. [18] and Povero et al. [32]. Here, the microbiological topics are briefly discussed.

Heterotrophic Prokaryotic Production (HPP) Measurements
HPP was estimated using the 3 H-leucine incorporation method [33] by the centrifugation micro-method of Smith and Azam [34] using 2 mL Eppendorf tubes, triplicate samples, and two killed blanks. Net Prokaryotic carbon biomass production was calculated according to Kirchman [35], assuming no intracellular isotope dilution, applying a theoretical factor of 1.55 Kg C mol leucine −1 [36]. Values were expressed as ng C L −1 h −1 . Cell-specific activity (csa) was calculated dividing the HPP activity rates by the prokaryotic abundance (cells L −1 ) and expressed as ag C cell −1 h −1 .

Extracellular Enzymatic Activity (EEA) Measurements
The potential extracellular enzymatic activity (EEA) rates were measured immediately after sample collection, according to the Hoppe's multi-concentration method [3], by using methylumbelliferyl (MUF) substrates (MUF-phosphate and MUF-ß-Glucopyranoside, Sigma Aldrich, St. Louis, MO, USA) for alkaline phosphatase (AP) and β-glucosidase (ß-G) activities, respectively. L-leucine-4 Methyl-7-coumarinylamide (Leu-MCA) (Sigma Aldrich) was the substrate used to measure leucine aminopeptidase activity (LAP). Details about the used protocols are reported in Monticelli et al. [18]. Briefly, increasing concentrations of fluorogenic substrates (standing stocks 5 mM) were added to 10 mL seawater samples to get final concentrations of 10, 25, 50, 100, and 200 nM. Fluorometric readings were performed with a Turner TD 700 fluorometer (Turner Designs, San Jose, CA, USA) using the filters 380-440 nm (excitation-emission wavelengths) for LAP and 350-445 nm for ß-G and AP. After immediate measurements of fluorescence after substrate addition (time 0), readings were repeated 3 h after incubation at in situ temperature. Known MUF and MCA concentrations (from 10 to 200 nM) were used to build a standard curve and convert the increase in fluorescence into the hydrolysis velocity, reported as V max . Results were expressed as nM of hydrolyzed substrate h −1 , LAPcsa, ß-Gcsa, and APcsa as amol cell −1 h −1 .

Lipopolysaccharides (LPS) Quantitative Determinations
The determination of total (dissolved plus particulate) LPS as an indicator of Gramnegative biomass was estimated according to Watson and Hobbie [38]. Subsamples (1 mL) for LPS quantification were stored in pyrogen-free vials at −20 • C until analysis, performed in Italy. The Quantitative Chromogenic LAL test (QCL-1000, Bio Whittaker, Inc., Walkersville, MD, USA) was used for LPS determinations, following the producer instructions. Each sample was gently mixed with the Limulus amebocyte lysate (LAL) and incubated at 37 • C for 10 min. The subsequent addition of the chromogenic reagent was followed by a further incubation for 6 min; the reaction was stopped by adding 100 µL acetic acid (final concentration 25%). The color change from white to yellow indicated the concentration of LPS in the sample. The absorbance of the test tube was measured at 405 nm using a microtiter ELX−808 spectrophotometer (Bio Whittaker, Inc.) equipped with an Automatic Microplate Reader (for 96 microtiters) controlled by the specific software WIN-KQCL (Bio Whittaker, Inc.). The LPS concentration in each sample was expressed as ng LPS L −1 and LPScsa as ag cell −1 .

Statistical Analysis
Descriptive statistics, normality tests, correlations, and regressions were performed using the Minitab Statistical Software (Release 17 for Windows, State College, Pennsylvania) [39]. In particular, in the regression study, the microbiological parameters TPP, LPS, LAP, AP, and ß-G were considered as free predictors versus HPP (response). The adopted principle was to achieve a predictive equation (multiple regression model) using the lowest number of predictors associated with the highest coefficient of determination-CD (R-sq). CD was adjusted according to the number of predictors in the model (R-sq adj) and R square was predicted (R-sq pred) to indicate how well the model predicts responses for new observations.
Other considered parameters were the optimal goodness of prediction according to the Mallows' Cp coefficient, where a small Cp value indicates a small variance, namely that the model is relatively precise in estimating the true regression coefficients and predicting future responses; the standard error of coefficients (SE) and of the regression (S); the variance inflation factor (VIF) indicates the extent to which multi-collinearity is present in the regression analysis (VIF = 1: whether predictors are not correlated, 1 < VIF < 5: whether they are moderately correlated, VIF > 5 to 10: whether predictors are highly correlated, VIF > 10: indicate that multi-collinearity is inappropriately influencing the regression results), and the prediction sum of squares (PRESS) to assess the predictive ability of the model (in general, a smaller PRESS value supports the best model's predictive ability). To choose the best subsets of predictors, a level of significance p = 0.05 was used. Post-regression tests using Mann-Whitney pairwise comparisons between the observed and the predicted HPP values were carried out.

Microbiological and Environmental Characteristics
The mean values and the ranges of microbiological parameters recorded during each survey are summarized in Table 1. During the TNB survey in the post-bloom period sampled in an area adjacent to the MZ Station, all hydrolytic activities, prokaryotic standing stock, and biomass were relatively high, with mean values 1-2 orders of magnitude higher than those measured in the VLT survey. The mean and extreme values of HPP were significantly higher also. The maximum values of HPP (282.79 ng C L −1 h −1 ) and LAP (733.05 nM h −1 ), and their mean values, were exceptionally high in Antarctic waters. Many ice-covered stations were sampled during the VLT survey; under these conditions, the photosynthetic production of biopolymers, as well as the hydrolytic activities and cell standing stock, were low. Table 1. Microbiological parameters observed in Terra Nova Bay (TNB) and Victoria Land Transect (VLT) surveys, together with temperature values. T (temperature), HPP (heterotrophic prokaryotic production), LAP (leucine aminopeptidase), AP (alkaline phosphatase), ß-G (ß-glucosidase), LPS (lipopolysaccharide), and TPP (total picoplankton). Prior the regression analysis the multiple correlations between the logarithmic-transformed values of the microbiological parameters, particularly between HPP and other parameters, Spearman's rho correlation coefficients were calculated ( Table 2). The highest correlation was observed between Log 10 HPP and Log 10 LAP activity (0.848 and 0.853 in TNB and VLT surveys, respectively, as reported in the matrices A and B), which was reflected in the total dataset (matrix C, rho = 0.826) and in the csa values (matrix D, rho = 0.828). Table 2. Spearman's rho correlation coefficients calculated between the log 10 -transformed values of heterotrophic prokaryotic production (HPP, as ngC L −1 h −1 ) and exoenzymatic activities (leucine aminopeptidase-LAP, alkaline phosphatase-AP, ß-glucosidase-ß-G) (as nM h −1 ), total picoplankton-TPP (as cells L −1 ), and lipopolysaccharide-LPS (as ng L −1 ). For the matrix D, the correlations were calculated using the cell-specific activity (csa) values, reported as nmol cell −1 h −1 for enzyme activities and as ag cell −1 for LPS.
From the total dataset ( Table 2, matrix C), the level of relative correlation of HPP with the other microbiological parameters was in the order LAP (0.826) > AP(0.499) > LPS (0.450) > TPP(0.329)~ß-G (0.331). When considering cell-specific activity (matrix D) it was in the order LAP csa (0.828) > LPS csa (0.588) > AP csa (0.542) > ß-G csa (0.441), again showing the highest correlation of HPP csa with LAP csa and increased correlations with LPS csa and AP csa . LAPcsa was shown to be the best predictor parameter, followed by LPS csa and AP csa . Cell-specific rates of hydrolytic activities revealed higher LAP values in the TNB area (mean LAP csa = 11.84 amolcell −1 h −1 ) than in the VLT area (mean LAP csa = 0.209 amol cell −1 h −1 ); LPS csa was also six times higher in TNB area than in VLT (mean = 9.26 ag cell −1 ).

Regression Analysis
A first hypothesis to be verified was if the predictive model, generated from the dataset of the TNB survey 2000, was suitable for the dataset obtained from the VLT survey 2004. Unfortunately, this test was unsuccessful due to very high values of residuals (i.e., the differences between the observed and the predicted data). Actually, although both surveys were performed in late summer periods, and the range of water depths was the same (0-100 m), the sampled areas were different, as shown by the different ranges of the values measured for the predictor parameters (Table 1). Furthermore, a possible influence of autotrophic organisms, such as phytoplankton (as a potential producer of AP) and cyanobacteria (as potential producers of LAP), both actively involved in the enzyme synthesis within the photic zone, could partially explain this failure. Consequently, the model produced from one dataset did not fit well within the range of the predictors' values of the other dataset and therefore it did not meet the general condition of being suitable for the total dataset; for these reasons, a total dataset of predictors from both surveys was necessarily built.
A second screening was carried out using all the predictors, which were significantly correlated with HPP ( Table 2, C, D). According to the highest correlation values observed among all the microbiological parameters, the predictive equation was built considering only the heterotrophic activity in terms of csa.
From the total dataset (TDS), the best subsets of regression HPP (Y) versus LPS, ß-G, AP, LPS, and TPP (variables-vars) were calculated. For confirmation, an analysis was carried out with their original values (see Table 1) and after with csa values and Log 10 (L) transformed values. The accuracy and goodness of each single variable (var) and of a group of vars to yield a regression equation were evaluated analyzing the coefficients of determination (R-sq, R-sq adj, and R-sq-pred), the prediction sum of squares (PRESS), the goodness of prediction (Mallows Cp), and the error standard of the regression (S) (see Materials and Methods).
The best results were obtained expressing the vars as Logarithmic-transformed, cellspecific activities (L-csa). At the top of Table 3, the statistics of each single var and the best combination of vars (from 2 to 5) are reported. When single vars were considered, LAP showed to be the best var (R-sq pred = 59.9%), followed by ß-G (R-sq pred = 29.2%), AP, LPS and TPP (R-sq pred = 1.5%). According to Mallows' Cp, the regression models using a single predictor resulted in very imprecise predictive responses; with two or more vars as predictors, the precision of the models was significantly increased, giving higher CD and lower PRESS, Mallows Cp and S. Table 3. Exploratory best subsets log-log regression analysis of specific heterotrophic prokaryotic production (HPP) versus specific leucine aminopeptidase (LAP), specific alkaline phosphatase (AP), specific ß-glucosidase (ß-G) and specific lipopolysaccharide (LPS). Reported are: HPP csa , LAP csa , AP csa and ß-G csa as Log 10 cell-specific activity h −1 , LPS as Log 10 ag cell −1 , TPP as Log 10 cells L −1 . Abbreviations: PRESS-prediction sum of squares; S-standard error of the regression; R squared (sq)-coefficients of determination. The meaning of the statistical parameters is explained in Materials and Methods. For the Model Building Sequence and Optimize Response steps, three vars (LAP, ß-G, and LPS) were selected as potential predictors of HPP. ß-G was not considered due to its negligible contribution to the regression model. To reduce the multi-collinearity, logarithmic-transformed-LAP and logarithmic-transformed-LPS were used as predictors, and the statistic of the regression model from total dataset (TDS-RE) is shown at the top of Table 4. Their main features were R-sq = 97.23%, R-sq pred = 97.16%, low S, and moderate multi-collinearity (VIF = 5.2) values, which were compatible with a good predictive equation. Table 4. Heterotrophic prokaryotic production (HPP) prediction from logarithmic-transformed LAP and LPS variables: selected multiple regression analysis with optimized response from total, ≤50 nM h −1 and >50 nM h −1 LAP datasets. Abbreviations: SE (standard error of coefficient), VIF (variance inflation factor), TDS (total dataset), RE (regression equation). For the selection of predictor parameters see Table 1, and for the meaning of the regression parameters see Materials and Methods. For the matrix D, the correlations were calculated using the cell-specific activity (csa) values. Leucine AminoPeptidase-LAP and alkaline phosphatase-AP are reported as nM h −1 , lipopolysaccharide-LPS as ng L −1 ; csa are reported as amol cell −1 h −1 for LAP and AP and as ag cell −1 for LPS. Log HPP csa = 0.1230 + 0.3950 log (LAP csa **) + 0.4540 log (LPS csa **) * As Log 10 of total activity, ** as Log 10 of cell-specific activity.

Predictor
In the equation building process, seven predictor values (four responsible of large residuals and three of unusual values) were removed from the total dataset (plots not shown).
From the highest correlations observed between HPP and LAP (Table 2), a simple scatter plot between these two parameters was produced. The general trend was that the dataset points were arranged following a linear distribution. In detail, in correspondence with low LAP values (<50 nM h −1 ), the respective "cloud" of points were fitted on an axis different from the axis of the main regression line. This is also shown in Figure 2. From the LAP ≤ 50 nM h −1 dataset (≤50 DS), the respective exploratory best subsets regression analysis is detailed in the middle part of Table 3. As observed from the total dataset analysis, the best single var was LAP (R-sq = 67.8%), followed by LPS (R-sq = 29.0%), ß-G, and AP (R-sq = 25.3%). Single var regression models were imprecise, as shown by high Mallows Cp and S; better results were obtained simultaneously using two or more vars as predictors. The three vars selected to optimize the regression analysis were the three enzymatic activities expressed as Log-transformed cell-specific activities (L-csa) (R-sq = 71.2%, PRESS 9.2, Mallows Cp = 7.9, and S = 0.3155). The regression equation ( ≤ 50 DS-RE), that is shown in the middle of Table 4 was characterized by discrete values of SE, high R-sq (80.73%), and R-sq pred (79.17%), and very low levels of multicollinearity (VIF = 1.28). As found from TDS-RE optimization step, β-G gave a negligible contribution to the regression model. Consequently only two predictors (LAPcsa and LPScsa) were considered, and four vars were excluded, as they were responsible for large residuals.
From the LAP > 50 nM h −1 dataset (>50 DS), the respective exploratory best subsets regression analysis, detailed at the bottom of Table 3, was markedly different from the From the LAP ≤ 50 nM h −1 dataset (≤50 DS), the respective exploratory best subsets regression analysis is detailed in the middle part of Table 3. As observed from the total dataset analysis, the best single var was LAP (R-sq = 67.8%), followed by LPS (R-sq = 29.0%), ß-G, and AP (R-sq = 25.3%). Single var regression models were imprecise, as shown by high Mallows Cp and S; better results were obtained simultaneously using two or more vars as predictors. The three vars selected to optimize the regression analysis were the three enzymatic activities expressed as Log-transformed cell-specific activities (L-csa) (R-sq = 71.2%, PRESS 9.2, Mallows Cp = 7.9, and S = 0.3155). The regression equation (≤ 50 DS-RE), that is shown in the middle of Table 4 was characterized by discrete values of SE, high R-sq (80.73%), and R-sq pred (79.17%), and very low levels of multi-collinearity (VIF = 1.28). As found from TDS-RE optimization step, β-G gave a negligible contribution to the regression model. Consequently only two predictors (LAP csa and LPS csa ) were considered, and four vars were excluded, as they were responsible for large residuals.
From the LAP > 50 nM h −1 dataset (>50 DS), the respective exploratory best subsets regression analysis, detailed at the bottom of Table 3, was markedly different from the other two exploratory analyses. The best single var was LPS (R-sq = 81.5%), followed by LAP (R-sq = 73.9%), AP, and β-G (R-sq = 62.8%). In general terms, LPS increased its value as predictor, while two enzymatic activities (AP and ß-G) did not. Better results were obtained using two or more vars as predictors simultaneously, which resulted in increased R-sq, R-sq pred, and precision, as shown by low Mallows Cp and PRESS values. To model the regression analysis, logarithmic-transformed cell-specific LAP csa and LPS csa were selected, and the statistic of this test is shown at the bottom of Table 4. Conversely, TPP was not considered as an X variable of the regression equation; indeed, using cell-specific variables there was an autocorrelation, as HPP csa was calculated by dividing HPP by TPP.

Observed and Predicted HPP Values
Using the regression equation TDS-RE, the predicted HPP (P-HPP) values, as Logarithmictransformed, cell-specific-HPP values (HPP csa ), were calculated and plotted versus the observed HPP (O-HPP) ( Figure 2). A simple regression analysis was performed employing P-HPP as a dependent variable, and yielded a regression equation with slope 1 and intercept close to zero. Values located outside the prediction intervals (PI, indicated with a dashed line) correspond to anomalous values responsible for large residuals.

Distribution of O-HPP and P-HPP along TNB and VLT Epipelagic Water Columns
From the selected parameters, the predicted HPP values along eight water columns (four from TNB, namely from station PF sampled at different times and from station SMN, with four stations from VLT: C2, A3, H1, and R2) were calculated and plotted with the observed HPP (O-HPP) values ( Figure 3). The predicted HPP values were calculated from the predictor values measured in each water sample, both using the regression equation from the total dataset (indicated as P-HPP) and simultaneously using two regression equations (indicated as P2-HPP). All the predicted values showed trends similar to the observed ones, with few exceptions. In TNB, O-HPP and P-HPP values were closely related, except for the O-HPP values detected at stations SMN 02, PF 03 at surface; the same discrepancy was found at R2 station in VTL area.

Discussion
Accounting for about 20% of the atmospheric CO2 uptake, the Southern Ocean plays a significant role as a major carbon sink [40]; in this context, HPP regulates the efficiency of the biologically-mediated carbon sequestration and its quantification is important to obtain a biogeochemical model of the carbon cycle, and also in the light of current climate change scenario [41,42]. This study aimed at finding a set of microbial variables as suitable predictors of HPP in Antarctic epipelagic waters during the austral summer period; to this end, two datasets were analyzed through calculation of regression equations. In a comprehensive view of ocean carbon fluxes, HPP is universally recognized as a key process to evaluate the role played by the prokaryotic assemblage. Nevertheless, criticism has frequently been moved against the method based on tritiated leucine incorporation, regarding the use of standard carbon-to-leucine conversion factors, which vary along environmental gradients and must be determined in situ [43]. These factors could result in underestimation of the actual production rates by at least a factor of 6.1 in open-ocean waters

Discussion
Accounting for about 20% of the atmospheric CO 2 uptake, the Southern Ocean plays a significant role as a major carbon sink [40]; in this context, HPP regulates the efficiency of the biologically-mediated carbon sequestration and its quantification is important to obtain a biogeochemical model of the carbon cycle, and also in the light of current climate change scenario [41,42]. This study aimed at finding a set of microbial variables as suitable predictors of HPP in Antarctic epipelagic waters during the austral summer period; to this end, two datasets were analyzed through calculation of regression equations. In a comprehensive view of ocean carbon fluxes, HPP is universally recognized as a key process to evaluate the role played by the prokaryotic assemblage. Nevertheless, criticism has frequently been moved against the method based on tritiated leucine incorporation, regarding the use of standard carbon-to-leucine conversion factors, which vary along environmental gradients and must be determined in situ [43]. These factors could result in underestimation of the actual production rates by at least a factor of 6.1 in open-ocean waters [44]. Starting from these considerations, the present study aimed at assessing which, among a set of microbial parameters, was the best hypothetic predictor of HPP. In the perspective of proposing an eco-friendly methodological approach alternative to the commonly used radiotracer measurements for HPP estimation, empirical models suitable to be applied to cold marine environments such as the Antarctic waters were developed.

Microbiological Scenario
During a high photosynthetically active radiation (PAR) availability period, in the epipelagic layer, the photoautotrophic processes and the consequent production of biopolymers are particularly active, as well as the synthesis of prokaryotic inducible catabolic enzymes [6]. In the polynya of the Ross Sea, the hydrolytic activities and their relationships with particulate organic substrates were analyzed [17], confirming the close link between organic substrates availability and degradations systems response; particularly high LAP activity was detected in the upper layer (0-50 m depth). During the early summer period in the Arctic Ocean, significantly positive correlations were detected among HPP, amino acids, temperature, and chlorophyll a, as well as between LAP activity and amino acids, in relation to the prokaryotic activity [10]. In such a case, a direct linear relationship was recorded between cell-specific HPP and cell-specific LAP (R2 = 0.74, p < 0.001, n = 16). In the same study a high temperature variability (∆t = 7-8 • C) was observed, positively associated with amino acid concentrations and prokaryotic activity. In the summer season in a South American sub-Antarctic area, high positive correlations among HPP and LAP activity, TPP, POC, and Chla were observed [9]. Conversely, in the present study, narrow temperature variations were observed during both surveys (∆t~1.6 • C, 0.5 • C in TNB surface waters) and temperature showed no or little influence on the microbiological activities (Temperature versus HPP Spearman rho = −0.02). As a consequence, this environmental parameter was not considered as a predictor parameter in our regression study.
The values of the microbial parameters collected from the TNB area were in the same range as those observed by Talbot et al. [9] and Piontek et al. [10], even expressed as cellspecific activity; conversely, at the VLT area, all parameters showed values significantly lower than those measured at the TNB one (Table 1) and similar to those reported under ice close to Cape Hallett (Victoria Land) [19]. The high rates of HPP, enzymatic activities, TPP, and LPS suggested the occurrence of intense biological activities in the TNB area compared to the VLT one.
In both the study areas HPP was significantly correlated with LAP, with a Spearman's rho coefficient similar to that reported by Talbot et al. [9], indicating a coupling of organic matter decomposition and heterotrophic production during summer. At a lower level, significant correlations were also observed between HPP and TPP, as well as between HPP and LPS, and AP and ß-G. Particularly, the prokaryotic biomass (LPS) was positively correlated with the three enzyme activities and HPP (Table 2); a similar result was observed in an Arctic study [10]. The different rho correlation values observed between the TNB and the VLT surveys (Table 2), together with the quantitative differences (Table 1), were probably the result of the different trophic and geographical characteristics of the sampled areas and sites [18,22].
The phytoplanktonic biomass, measured as water fluorescence, was significantly correlated (Spearman's rho) with HPP (0.58), LAP (0.74), ß-G (0.42), TPP (0.50), and LPS (0.64), as calculated on logarithmic-transformed data from the whole dataset. In a previous study, a strong positive empirical relationship between prokaryotic abundance and chlorophyll concentration was observed by Bird and Kalff [45] in fresh and marine waters. A tight coupling between HPP and chlorophyll-a was also recorded in the Amundsen Sea Polynya (Antarctica) [42], suggesting that phytoplankton biomass provides a relevant source of DOC that supports the metabolic activity of prokaryotic organisms.
Concerning the used datasets, during both surveys, samplings did not take into consideration the diel dynamics of microbial parameters which could influence qualitatively some values [19], the rapid changes of microbial biomass detected in the early ice-free areas [15], the horizontal transport of DOC and POC, and other forcings/variables capable of influencing the microbial processes and, consequently, the absolute values of the analyzed parameters and their relationships. On the other hand, all these features underlined the value of the total dataset considered in this study, as it was representative of most microbial processes taking place during two middle summer periods in coastal areas along Victoria Land.
The relative ratio between proteolysis and glycolysis processes gave insights on the qualitative composition and availability of organic matter to planktonic microorganisms; variations in LAP/ß-G ratios depend on the quality and nature (labile or refractory) of the organic matter pool available in the waters [46]. Proteins and organic phosphorus compounds are polymers degraded early during the enzymatic breakdown of the organic matter, while polysaccharides are more recalcitrant [47,48]. During the TNB survey, molar ratios LAP/ß-G in terms of Carbon mobilized from these hydrolytic activities varied from 1199 (on 22 January) to 65 (on 10 February) [18]. The mean LAP/ ß-G ratios (from Table 1) were 395 and 100 for TNB and VLT, respectively. In the Ross Sea Polynya, sampled during the same period of this study, ratios higher than 10 were detected during autotrophic productive processes [17].
In a previous study performed in the Ross Sea [49], a close relationship was found to link the expression of enzymatic activities to the available organic matter; during productive periods, when both auto-and heterotrophic-derived organic substrates are already available in the environment, the relative importance of leucine aminopeptidase in the organic matter recycling decreased, suggesting that under these conditions the synthesis of new enzymes was not stimulated. As bacterial growth in marine environments depends on peptidyl substrates, extracellular aminopeptidase production is regulated through a "on-demand" mechanism by the peptides present in seawater, as shown in batch cultures of selected bacterial genera [50]. Taking into consideration the release of carbon atoms and considering that one molecule of hydrolyzed leucine is equivalent to six carbon atoms, the LAP/HPP ratios calculated in the Ross Sea were 231 (TNB) and 39 (VLT) ( Table 1). In both surveys, the amount of monomers potentially mobilized from protein decomposition supplied those removed by the heterotrophic prokaryotic production, providing a surplus available to other processes.

Regressions and Predictive Models
From the microbial parameters analyzed as potential predictors, the variations of Log-transformed LAP csa alone and together with Log-transformed LPS csa were those that better interpreted the response variable (HPP). Additionally, in the building process of the regression equation from ≤50 nM h −1 LAP dataset, AP csa contributed to decrease the multicollinearity level of predictors in the regression equation. The significant relationships between HPP and LAP (Table 2) suggested that the conceptual trophic model [productive period (polymeric new production) → proteolytic exoenzymes → new prokaryotic biomass (HPP)] was fully active in the studied area, even at variable intensities, as shown by the variable microbial parameters values reported in Table 1. The ecological correlation of such functional processes was reflected in the numerical correlation between HPP and the other variables, allowing the building of empirical regression models for predictive purposes. In addition, the positive correlations between activity rates (HPP, LAP) and biomass (LPS) agreed with previous studies on polar and cold environments. Moreover, like our findings, Sinsabaugh et al. [51] underlined the close relationships between the heterotrophic prokaryotic production and extracellular enzyme activities, as calculated from a ln-ln regression (instead of Log-10 datasets like in the present study). Table 5 summarizes the regression statistics calculated between HPP and enzyme activities. Although the magnitude order of HPP values reported by the above-cited study were not comparable with our data, due to the different environmental conditions (temperate and polar, respectively), the significant links between HPP and enzyme activity rates further confirmed that microbial growth depends on enzyme synthesis and on substrate availability; therefore, the enzymatic activity expression acts as an optimal resource allocation strategy between environmental resource availability and microbial growth requirements, undergoing up-and down-regulation in close relationships with environmental and trophic conditions.
In contrast with the significant relationships between HPP and LAP detected in our study, heterotrophic bacterial production was found to be significantly related to lipase activity, which in turn correlated with temperature, in sinking particles collected by sediment traps from the western North Pacific Ocean [52].

Post-Regression Tests
From the three datasets (TDS, ≤50 DS, and >50 DS) and using the three regression equations (TDS-RE, ≤50 DS-RE, and >50 DS-RE), the predicted values of HPP, together with the respective residuals and squared values, were calculated. All squared residuals varied into the same range with similar mean and standard deviation values (not shown). Another analysis was carried out using the Kruskal-Wallis test and Mann-Whitney (M-W) pairwise comparisons between the O-HPP and the P-HPP values; the results are shown in Table 6. M-W U values suggested that the predictive equation (Regression Equation, RE) built from the Total DataSet (TDS-RE) gave a better predictive interpretation when >50 DS predictors were used (M-W = 0.912) than TDS ones; conversely, the predictive equation was not appropriate when predictors from ≤50 Dataset (DS) (M-W = 0.104) were used. The best interpretative capacity was associated to ≤50 DS and its specific regression (M-W = 0.995), and a good result was obtained also using >50 DS and its specific regression (M-W = 0.741), whereas it was completely unsuitable for ≤50 DS and >50 DS-RE (M-W ≤ 0.001). Table 6. U values of Mann-Whitney pairwise comparisons calculated between the observed (O-HPP) and the predicted (P-HPP) values of heterotrophic prokaryotic production (HPP). The datasets and regression combinations used in the building of the P-HPP along the water columns are reported in bold (reported in Figure 3  In Table 5, the datasets and regressions combinations used to predict HPP distribution in the water columns are reported in bold (Figure 3). Observing the distribution of both P-HPP and O-HPP values, the question if the single predictive equation built from the TDS instead of two predictive equations used simultaneously was more suitable to predict the response variable was not completely answered.

Observed versus Predicted HPP
The dataset of each single survey, summarized in Table 1, was closely related with the two datasets (≤50 and >50 DS) of LAP produced after the regression study. All samples collected from the VLT survey were included in ≤50 DS and 68% of samples from the TNB survey were included in >50 DS. Therefore, practically, the predicted HPP values for the VLT area and for 50-100 m depth of the TNB area were obtained using ≤50 DS regression equation (Figure 3).
The single regression equations, derived from log-transformed, cell-specific activities linking O-HPP versus two LAP datasets [Log (HPP csa ) = 1.372 + 1.230 Log (LAP csa ) ≤ 50 nM h −1 ; Log (HPP csa ) = 27.78 + 0.195 Log (LAP csa ) > 50 nM h −1 ], exhibited two different slopes (=coefficients of the regression equation). This resulted in different ecological implications: indeed, in the VLT area (≤50 DS) characterized by low microbial activities, ice covered an active melting process, and the slope of 1.230 indicated that HPP increased more rapidly than LAP, resulting in a potential consumption of the dissolved monomers released from the proteolytic activity. On the contrary, in the TNB area (>50 DS, particularly in the 0-50 m layer of the water column), characterized by high microbial activities in decreasing trend during a post-bloom period [18], the slope of 0.195 suggested that HPP increased more slowly than LAP, indicating that dissolved monomers exceeded those consumed by the heterotrophic production process.
The substantial differences observed between the two simple regression equations (HPP versus LAP) might contribute to interpreting the recent history, and the close evolution, of the biological processes taking place in each area. When very low microbial activities (close to the limit of detection) are observed, the precision of these estimates is low and their respective numerical data are less reliable, which can contribute to making the predictive capacity of the regression equation less confident. The low values of AP and ß-G detected mainly in the VLT survey and at 50-100 m depth in the TNB waters could be susceptible to this error. According to the analytical protocols used in the study of the variables, it was not possible to assess the possible methodological error included into the regression equation; for HPP, the determinations were made in triplicate and the results gave a mean coefficient of variation (CV) of 15.1%, which was the only CV that could be considered. Moreover, another potential error source could be introduced by the ectoenzymatic activities calculated using the Michaelis-Menten kinetics, since a V max value was obtained.
In some water columns (SMN-02 and R2 stations, Figure 3), the O-HPP values at surface were far from or did not follow the same trend as the P-HPP ones. This behavior led us to suppose that not all the variance in HPP could be explained by the selected predictors and that other processes affected HPP [53]. Variability in temperature, nutrients, and community composition has been reported to influence HPP patterns in oligotrophic Arctic streams and lakes affected by nutrient pulses following storm events [54]. Moreover, besides temperature, other physical factors related to climate change (i.e., ice melting inputs) and biogeochemical forcings (i.e., DOC availability) were reported to affect bacterial dynamics in a coastal ecosystem of the Western Antarctic Peninsula, where a decadal (2002-2014) time series of bacterial production was collected and analyzed [55]. All such factors can lead to erroneous data predictions and misinterpretations.
Although our observations were the results of two surveys referred to the Ross Sea summer conditions, and additional data should be collected to generalize our findings to the entire annual period, the good agreement between the predicted and observed HPP rates confirmed the validity of our predicted models for the Ross Sea environmental scenario.

Conclusions
The results obtained in this study proved that enzymatic activities, particularly exoproteases, together with prokaryotic biomass, are good predictors for HPP in the context of the microbial processes and trophic characteristics of the coastal Ross Sea epipelagic environments.
The application of EEA measurements to get empirical predictive models of HPP values has the important advantage of ensuring laboratory safety, as fluorogenic assays do not involve the production of hazardous (i.e., radioactive) wastes; moreover, the analytical protocols of enzyme assays are cheap and widely used in routine analysis.
The used statistical tools, together with an ecological interpretation of the microbial processes investigated during the surveys, led us to consider the more appropriate simltaneous use of two regression equations, each of them built with different ranges of the main predictor (LAP), for the empirical prediction of HPP.
The total dataset of microbiological parameters, however, referred to two middle summer periods only; this leaves the question of how the models would have predicted HPP after and before the studied period unanswered. Moreover, in this study the datasets considered referred to natural microbial communities; in future experiments it would be interesting to confirm the performance of the experimental and analytical workflow applied here through experiments in microcosms with known concentrations of both autotrophic and heterotrophic microorganisms.
As the models developed in this study are tailored for the austral summer period, future Antarctic surveys should be performed to extend the period of observations and include other seasonal periods, as well as to check these models and analyze other environmental and/or biological parameters capable of improving current model estimates, making them applicable to almost the entire year.