The Current Conﬁguration of the Ostia System for Operational Production of Foundation Sea Surface Temperature and Ice Concentration Analyses

: The Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) system generates global, daily, gap-ﬁlled foundation sea surface temperature (SST) ﬁelds from satellite data and in situ observations. The SSTs have uncertainty information provided with them and an ice concentration (IC) analysis is also produced. Additionally, a global, hourly diurnal skin SST product is output each day. The system is run in near real time to produce data for use in applications such as numerical weather prediction. Data production is monitored routinely and outputs are available from the Copernicus Marine Environment Monitoring Service (CMEMS; marine.copernicus.eu). As an operational product, the OSTIA system is continuously under development. For example, since the original descriptor paper was published, the underlying data assimilation scheme that is used to generate the foundation SST analyses has been updated. Various publications have described these changes but a full description is not available in a single place. This technical note focuses on the production of the foundation SST and IC analyses by OSTIA and aims to provide a comprehensive description of the current system conﬁguration. Waters); software, S.G., E.F., C.M., (James While); draft All authors


Introduction
The Operational Sea Surface Temperature and Sea Ice Analysis (OSTIA) system was developed at the Met Office (the UK's national meteorological service; https://www.metoffice.gov.uk/) to meet its need for near real time, accurate, global sea surface temperature (SST) data for numerical weather prediction (NWP) [1]. It uses SST observations from in situ observing instruments such as drifting buoys in combination with satellite SST data from a variety of sensors to form daily, globally complete, model-free analyses of foundation SST (the SST from which the diurnal cycle grows) on a 1 20 • grid.
SST uncertainty and ice concentration (IC) fields are also produced. Although designed for use in NWP, OSTIA data have found a variety of applications including ocean (e.g., [2]) and atmospheric (e.g., [3]) reanalyses and have been used in numerous scientific studies (e.g., recently [4,5]). Since the OSTIA system was originally set up and documented in [1], a variety of changes have been made. These include creating configurations for generating long-term reprocessed datasets [6][7][8] and extending the OSTIA system to include the generation of diurnal skin SST products [9]. Changes to the near real time foundation SST and IC part of the OSTIA system have been reported individually (see Section 2 for an overview) but a complete description is not available in one place. The aim of this technical note is to provide a summary of the current configuration.
A description of the current system configuration of the OSTIA system for generating the foundation SST and IC products is given in Section 2. This is followed in Section 3 by a description of the outputs. Section 4 discusses the impact of the various changes to the system and Section 5 contains the conclusions.

Methods
Changes to the foundation SST and IC analysis part of the OSTIA system since its inception have included updates to the input data that are used as the satellite systems have evolved. The processing of the data has also improved. Perhaps the most significant update has been a change to the data assimilation scheme that underpins the system. Data assimilation is commonly associated with combining observations and a forecast field from a numerical model in order to create initial conditions for further predictions. Data assimilation is similarly used to create OSTIA analyses with the exception that the forecast field is provided by damped persistence of the previous day's analysis anomalies rather than taking it from a numerical model. The data assimilation scheme that has previously been used in OSTIA since its inception is the Analysis Correction (AC) scheme [10,11]. This uses an iterative approach to solve the optimal interpolation equation. The new version of the system uses NEMOVAR [12], which is a variational data assimilation scheme that is developed alongside the Nucleus for European Modelling of the Ocean (NEMO) model [13]. This echoes work done to upgrade the Met Office Forecasting Ocean Assimilation Model (FOAM) system to use NEMOVAR [14] and has facilitated various improvements to OSTIA including the introduction of an IC analysis to replace a simple regridding of the input data. Other changes to the OSTIA system have been the introduction of estimates of lake water temperatures and ICs [15], updates to the error covariance estimates that are crucial to the merging of the input data and an increase to the number of iterations of the AC assimilation scheme to converge more closely on the solution [16], and, most recently, an update that improved the resolution of SST features in the analyses [17].
An overview of the current OSTIA processing system is shown in Figure 1. It consists of five main parts, which are each described in detail in the following subsections: observation preprocessing (Section 2.1), IC analysis (Section 2.2), satellite SST bias estimation (Section 2.3), SST analysis (Section 2.4) and postprocessing of the outputs (Section 2.5).

Data Sources
Satellite and in situ SST and satellite IC data are pulled from data disseminators or are received through EUMETCast or World Meteorological Organisation (WMO) Global Telecommunication System (GTS) communication. Input SST data that cover the period from 18:00 UTC on the day prior to the date represented by the analysis to 6:00 UTC on the day after are allowed to be ingested. However, if they were included in the previous analysis they are not reused. The SST data that are currently used are listed in Table 1. The selection of input data used in OSTIA is continuously under review and new sensors are added if they improve the quality of the OSTIA analyses or are deemed useful to increase resilience of the system to data outages or loss of other sensors. Similarly, sensors will be removed from the system if they fail or their data quality degrades.  Table 1. List of input sea surface temperature (SST) data used in OSTIA, including the instrument and the satellite on which it is situated, the data producer, the data level (defined in [18]), spatial resolution (as defined in the file metadata), the level of thinning applied (for example 2 × 2 means that 1 in every 2 data values is kept in both dimensions) and citations and notes associated with the data use. Acronyms/abbreviations are defined and acknowledgments for data use are provided at the end of this technical note. The in situ SST data used in OSTIA are those distributed in near real time through the WMO GTS and hence are from a mixture of sources including drifting and moored buoy platforms and ships. Near surface data from Argo profiles are not used as they are reserved for independent validation of the analyses. Satellite SST data from a variety of instruments are used. These are primarily infra-red (IR) sensors on polar orbiting satellites, which give accurate global SST data approximately two times per day but can suffer from data gaps where cloud occurs. Currently there are two exceptions to this in use in OSTIA: the Spinning Enhanced Visible Infra-Red Imager (SEVIRI) IR sensor on the Meteosat Second Generation (MSG) geostationary orbiting satellite, which provides more views per day of the Atlantic and western Indian Oceans than is possible from a polar orbiter, and the passive microwave instrument Advanced Microwave Scanning Radiometer 2 (AMSR2) on the polar orbiting GCOM-W (Global Change Observation Mission-Water) satellite, which has fewer data gaps than IR sensor data.
Sea ICs from the Ocean and Sea Ice Satellite Application Facility (OSI SAF; product OSI-401-b) based on Special Sensor Microwave Imager/Sounder (SSMI/S) data on DMSP-F18 are used, and lake ICs are taken from the National Center for Environmental Prediction (NCEP) dataset, which is based on multiple sensors. Since the daily input data files from these providers are normally spatially complete, there is no use of IC data from the days surrounding the analysis date.

Satellite SST Data Processing
The file format of the input satellite data follows the Group for High Resolution SST (GHRSST; www.ghrsst.org) data specification [18] and the data are hence supplied with auxiliary information that aids their use. The quality information provided with the data files is used in the OSTIA processing. Generally, data with the two highest quality levels (4 and 5) are used, but for Visible Infrared Imager Radiometer Suite (VIIRS) and SLSTR data only those data points with the highest quality level (5) are retained. Estimates of bias are also provided and are removed from the SSTs, and estimates of error standard deviation are saved for later use in the analysis.
As OSTIA attempts to estimate the foundation SST, it is undesirable to include data containing a significant diurnal signal. Satellite SSTs which may be affected by a diurnal signal are determined using wind information supplied with the data (if available) and are defined as daytime data with wind speed less than 6 m s −1 . For all nighttime data, and daytime data where the wind speed is above this threshold, the SST below the surface is expected to approximate the skin SST, which is the temperature of the layer of the ocean that the IR sensors are sensitive to, except for a cool offset [23,24]. For IR sensor datasets where that offset is not already removed (currently only the Sea and Land Surface Temperature Radiometers (SLSTRs)), 0.17 • C is added to the SSTs [1,24].
Satellite SST data tend to be oversampled compared to the grid used to produce OSTIA and the data assimilation scheme that is used to combine the data is not able to take into account any correlations in the errors in the input data. Therefore, the satellite SST data are thinned before ingestion into the OSTIA system (as shown in Figure 1, observation processing section). The amount of subsampling that is applied for each sensor is shown in Table 1. Typically one SST value in every six data points is retained in both spatial directions. However, this is reduced to one in every two points for the passive microwave AMSR2 instrument, which has coarser resolution than the IR sensors. In order to maximise the amount of data used in the analyses, the algorithm searches for a nearby valid SST to retain if the grid point that would have been kept does not contain a valid SST. A copy of the data is also made with a second thinning applied ( Figure 1, bias estimation section). This retains the median SST where more than one value occurs within 13 km and 1 h. These data with the second thinning applied are used for calculating satellite biases, which are estimated on a coarser grid than the SST analysis. The data without the second thinning are used for the SST analysis.

Quality Control
A quality control check of the data is carried out using a Bayesian scheme [25,26] to evaluate the probability of a gross error in the data based on a comparison to the previous day's analysis field. Erroneous in situ data from instruments in ice covered regions can be missed by this test. Therefore, a check of the observation location against the previous day's IC field is done. If the IC is above 50%, the observation is not used.
A reject list is also maintained and used to exclude in situ SST stations with persistent quality issues. The in situ SST stations that are on the reject list are updated monthly, based on observation minus OSTIA (o-O) statistics derived from the previous month's data. For stations that have reported more than 30 observations during the previous month, the mean, root mean square (RMS) and standard deviation o-O and the percentage of observations that failed quality control are compared to a number of thresholds to determine whether they should be added to, or removed from, the reject list ( Figure 2).
The o-O threshold values depend on station type: for buoy SSTs the mean and standard deviation limits are 1.0 K and 1.5 K respectively and for ship SSTs thresholds of 1.5 K and 2.0 K apply. Stations whose o-O statistics are below both those thresholds and with quality control rejection rates less than 10% in the preceding month are marked as "good enough to remove" and will be removed from the reject list if already present on the list.
Next, each station's SST o-O statistics are checked against an alternative set of thresholds: 25% rejection rate and mean SST o-O of 1.5 K for buoys or 2.0 K for ships. Stations whose o-O statistics exceed either of those thresholds are marked as "bad enough to reject". Stations deemed "not good enough to remove" and "bad enough to reject" will be added to the reject list (or remain there if already present). Stations deemed "not good enough to remove" and "not bad enough to reject" are subject to a further sequence of checks on rejection rates and mean, standard deviation and RMS o-O to determine whether they should be accepted or rejected for the coming month.
If stations have reported 30 or fewer SST observations during the previous month, the procedure is instead applied to the previous two or three months of data if including these allows the number of observations criterion to be met. If not, stations remain on the reject list if they are already on it, otherwise they are accepted.

Sea Ice Analysis
Prior to February 2018, a simple regridding method had been used to process IC data. This reformatted the input data onto the final OSTIA output grid. However, this approach was vulnerable to missing data. Since February 2018 the NEMOVAR data assimilation analysis scheme is used in OSTIA to combine a background (first guess) IC field with daily satellite IC information. As described in [14,17], this performs an iterative minimisation of the cost function given in Equation (1). In this, the cost J associated with a particular increment δx to the background (or first guess) field depends on the background error covariance matrix, B, the difference between the observations and the background field, d and the observation error covariance matrix, R. H is the observation operator, which interpolates the gridded data to the observation locations. Sixty iterations of the scheme are applied in order to ensure that it has converged to a solution.
The sea ice analysis is performed on the extended ORCA12 grid. The ORCA grids are tripolar, with each pole over land. In the ORCA12 case the grid has a nominal resolution of 1

12
• at the equator [27] and finer resolutions at increased latitudes [17,27]. The extended version of the grid is used as the standard version does not reach far enough south to include all the OSTIA ocean grid points. The background IC field is the previous day's analysis. The background error covariance matrix is not specified directly owing to its large size. Instead, the system is provided with estimates of the background error standard deviations and a lengthscale that describes the correlations in the background errors, which are modelled within the scheme using a diffusion operator that approximates a Gaussian function. The background error standard deviations are specified seasonally as shown in As stated previously, OSI SAF and NCEP data are used for sea and lake IC respectively. To filter out spurious low concentration ice in the OSI SAF data, concentrations below 10% are set to zero. At present, uncertainty information provided with the data is not used. Instead, the fields shown in Figure 3b,d,f,h are interpolated to the data locations and used as the estimates of error standard deviations. These fields have similar spatial patterns to the background error standard deviation fields but are smaller in magnitude. Errors in the input data are assumed uncorrelated by the data assimilation scheme. The OSI SAF input data employs a more conservative land-sea mask than OSTIA. This causes an issue in the OSTIA analyses as near-coastal grid points do not get directly influenced by the input data causing spuriously low ICs. This is remedied by modifying the NEMOVAR outputs. In most regions, this is done by infilling the affected grid points from those that are within the realm of the OSI SAF ice data. Figure 4 shows the affected points in the North Sea. The green grid points are infilled from the values in the dark blue grid points. However, in the Baltic Sea the area covered by the input data is too small to do this (grey grid points in Figure 4). Instead the SST analysis is used in combination with a monthly freezing temperature climatology to determine if ice is likely to be present. The ocean freezing temperature climatology was derived from a monthly global 1 degree salinity climatology (Polar science center Hydrographic Climatology (PHC) 3.0, updated from [28]) and Equation (2) (from [29]) as follows: In this equation the freezing temperature T f is dependent on salinity, S, coefficients a 0 = −0.0575, a 1 = 0.001710523e-3, a 2 = −0.0002154996 and b = −0.000753, and pressure, p, which was set to 10.0 dbar. Freezing temperature for lakes was set to 0 • C. The result is shown in Figure 5. A small tolerance is added to these freezing temperatures to accommodate errors in the temperature analyses. The tolerance is 0.01 and 0.1 • C for ocean and lakes respectively. If the temperature is below the calculated freezing point, IC is set to be 100%. It is removed at a rate of 20% per day once SST rises above freezing.
The procedure to add ice if temperature is below freezing is also applied to all lakes (e.g., brown points in Figure 4). . Baltic and North Sea region of the mask used to determine which sea ice analysis grid points require additional processing. Green points are coastal and are not sufficiently covered by the input data. These are infilled from the dark blue regions. In the Baltic Sea, only the grey region is covered by the input data. This and the light blue region are augmented by using the SST to determine if ice is present. This method to infer ice from the temperature is also applied to lakes (brown regions).

Satellite SST Bias Estimation
In this processing step, drifting and moored buoy in situ data and nighttime data from the VIIRS sensor are used together as a reference dataset. Matchups are then found between the reference dataset and each of the other satellite SST data types, including the remainder of the VIIRS dataset. The limits for a matchup to occur are 25 km and 1 day. The two VIIRS sensors currently have a single bias field associated with them, and the same is done for the two SLSTR sensors, since testing indicated that there was no advantage to generating separate bias fields for each. However, this may be reviewed in the future.
For each sensor a spatial analysis of the matchups is performed. This is done using NEMOVAR, as described in Section 2.2 but no land mask is used and it is performed on the extended ORCA025 grid. This grid is a coarser (nominal resolution 1 4 • ) version of the grid used for the sea ice and SST analyses. The background to each bias analysis is the previous day's bias field for that sensor but with the magnitudes of the biases reduced by 10%. This reduction means that in the absence of matchups the bias fields will eventually relax to zero.
The background error standard deviations used in the bias analyses are set to 0.1 • C globally and the length scale that defines the correlations in the background errors is defined as 7 • . The matchup error standard deviations are defined as shown in Figure 6 and vary spatially with higher values in more variable regions. These error variances were tuned to ensure that the results approximate the bias fields generated under the previous version of the system, prior to the use of NEMOVAR, described in [1]. An example of a bias field is shown in Figure 7. This shows the estimate of the bias in SLSTR data for 1 October 2019 (biases were estimated from Sentinel 3A data only since Sentinel 3B SLSTR data were not used operationally at the time).

Temperature Analysis
As with the IC analysis, the SST analysis is generated using NEMOVAR on the extended ORCA12 grid. It uses as input the satellite data after removal of the biases calculated in Section 2.3 and the in situ SSTs. The uncertainty information provided with satellite SSTs is used to define the diagonal elements of R from Equation (1). For in situ SSTs, spatially varying error standard deviations are prescribed ( Figure 8a) and an additional platform-dependent uncertainty is combined with this in quadrature as defined in [16]. In that study, the additional platform-dependent error variance in moored buoys and ships compared to drifters is found to be 0.12 and 0.19 • C 2 respectively. The same platform-dependent uncertainty used for moored buoys is now also added for drifting buoy data owing to concerns over over-fitting to these data in the NEMOVAR implementation. The background for the analysis on day i, x b i , is generated by a damped persistence of anomalies from the SST analysis from the previous day, x a i−1 , as described in Equation (3). The anomalies are the difference between the analysis and a climatological field for that day x c i−1 , which are reduced by a factor 0.1 1 t , where t is a timescale. The climatological field for the analysis day, x c i , is then added to give the background field.
The daily climatology is derived from a reprocessed version of OSTIA [6] and is averaged over the period 1985 to 2007. The timescale, t, for regions that have 50% IC or less (according to the ice analysis) is set to 30 days, which means that the anomalies will decay to 92.6% of their original size after a day and to 10% after 30 days. For regions with greater than 50% concentration the background is relaxed towards −1.8 • C in the ocean and 0.1 • C in lakes with a timescale that depends on the IC. If the IC is 50% the timescale is 17.5 days and if the IC is 100% it is 5 days, with a linear change between these values. The resulting background field is checked for extreme SST values. Any values less than −2 • C or greater than 40 • C are reset to those values. Finally, a weak smoothing is applied to the field using a Gaussian filter with lengthscale 4.7 km.
The background error covariances are specified using a dual-component formulation developed for NEMOVAR by [30] and the OSTIA implementation is described in detail in [17]. As with the single-component configurations described in Sections 2.2 and 2.3, the inputs to NEMOVAR are spatially varying background error standard deviations and a lengthscale, but in this case two are provided instead of one. The background error standard deviations are from [16] and vary seasonally. The error variances have been smoothed to reduce noise while retaining structure using an inverse distance weighting filter (with weights defined as 1 d+1 , where d is the distance from the central point in units of number of grid cells) and a 61 × 61 kernel. These background error standard deviations are shown in Figure 8b-i. The two uncertainty components for each season are designed to approximately represent errors that correlate on atmospheric synoptic scales and ocean mesoscales or shorter. Their associated lengthscales are currently 2.7 • and 0.135 • respectively (the latter was set to 0.36 • in the initial implementation of the NEMOVAR version of OSTIA but was reduced following the work of [17] to improve the representation of features in the OSTIA analyses). The background error standard deviations control the relative weights of the two components and therefore the effective correlation lengthscale of the background errors varies spatially.
The OSTIA system takes advantage of the dual-component setup to reduce the effective background error correlation lengthscale in certain situations by increasing the relative weight given to the shorter lengthscale uncertainty component. The first of these ramps up the relative weight given to the shorter lengthscale for temperatures between 3 • C and 1 • C to prevent the spreading of warm SST observation information under ice. The second modification ramps down the weighting of the longer lengthscale uncertainty component for distances less than 400 km from the coast. The final modification is a flow-dependent adjustment developed by [17]. This ramps up the relative weighting given to the shorter lengthscale uncertainty component where the SST gradients in the background field are between 20 and 50 mK km −1 . This allows the background error correlations to adapt daily to the changing SST conditions, rather than them changing only on a seasonal timescale, and improves the representation of features in the SST analyses.
Finally, the resulting SST analysis is checked for unrealistic low temperature, defined as those less than −2 • C, and these are reset to that limit if they occur.

Observation Influence Analysis
A second analysis is performed using the same inputs as the original SST analysis but with all background values set to zero and all observation values set to one. This "observation influence" analysis has values close to one where the input data strongly influenced the SST analysis and zero in the absence of observations (which means that the analysis would be the background value in that location). Intermediate values imply that the analysis was a balance between the input data and the background. The observation influence analysis values are reset to 0 or 1 if values less than or greater than these occur. As the background to the analysis does not contain the SST values in this case, the observation influence analysis does not include the ramping applied to the background uncertainty components where the SST gradients are high or the ice edge is near but is otherwise equivalent to the SST analysis. This is used to generate an analysis uncertainty estimate as described in Section 2.5. An example of an observation influence analysis is shown in Figure 9.

Postprocessing
The final step in the OSTIA processing chain is to postprocess all the outputs before dissemination to users. All the previous steps produce data on the extended ORCA025 and ORCA12 grids and these are bilinearly interpolated to a regular grid using weights generated by the SCRIP package (https://github.com/SCRIP-Project/SCRIP).
An SST uncertainty estimate is derived from the observation influence analysis described in Section 2.4 using Equation (4) (reproduced from [1]).
The analysis uncertainty a i at each grid point i depends on the total background error variance (the combination of the mesoscale and synoptic scales), B i , and the observation influence analysis, o i , at that grid point and two tuning parameters α and β, which are set to 0.5 and 4 respectively.
The various outputs to the system are combined in a netCDF file that follows the GHRSST data specification version 2 (GDS2) [18] before disseminating the data to users. These files are described in Section 3 below.

Results
The OSTIA system is run daily at 05:50 UTC to create SST and IC analyses for the previous day. Inputs and outputs from the system are monitored routinely to ensure that data are produced on time and with consistent quality. An automated system triggers an alert if there is a lack of data from a sensor for a significant amount of time and, if appropriate, operators contact data providers to ask them to investigate the issue. Monitoring of outputs typically consists of examining plots of observation-minus-background differences, as well as time series of globally and regionally aggregated statistics of these. Plots of the output fields are also available to view by the monitoring operative. Issues are dealt with as appropriate, for example by contacting data providers or by adding an in situ platform to a reject list.
The primary OSTIA output is a GDS2 netCDF file. The variables within the data file are summarised in Table 2. Data are provided on a 1 20 • regular grid and there is one daily analysis per file, which means that the dimensions are 1 in time, 3600 in latitude and 7200 in longitude. An example of each of the data fields is shown in Figure 10. Table 2. Variables contained in the main OSTIA output files. Some of the variables are stored as packed data, which means that they have to be multiplied by their scale_factor attribute and the add_offset attribute added in order to obtain the actual numbers.

Variable Name
Dimensions Notes Nominal time that the SST analysis represents lat 3600 Central latitudes of the grid points lon 7200 Central longitudes of the grid points analysed_sst 1 × 3600 × 7200 The SST analysis (packed data) analysis_error 1 × 3600 × 7200 The uncertainty in the SST analysis (packed data) sea_ice_fraction 1 × 3600 × 7200 The IC analysis (packed data) mask 1 × 3600 × 7200 Water / land / lake / ice mask The regridded versions of the SSTs, SST uncertainties and ICs are packed into a lower precision data type to reduce file size. To obtain the actual SSTs and IC, the data in the file have to be multiplied by their scale_factor attribute and the add_offset attribute then added. This is done automatically by some netCDF data readers.
The mask variable provides the user with information on which grid points are ocean (mask value 1) or not (mask value 2). Information on lakes and ice is also encoded in the variable by adding 4 to the mask values of lake grid points and 8 to the mask values in icy areas (defined as ICs greater than zero).
In addition to the output described above, products are generated on a 1 4 • regular grid in a similar netCDF format: monthly and seasonal averages; the satellite bias estimates and an anomaly file containing differences between the analyses and a climatology of AVHRR Pathfinder data [31]. These and the primary output files are all available from the Copernicus Marine Environment Monitoring Service (CMEMS, marine.copernicus.eu).

Discussion
The OSTIA system has changed in a number of ways since it was described by [1]. The impact of these developments is illustrated in Figure 11. This shows global and regional monthly standard deviations of matchups between the OSTIA analyses and Argo data [32] in the range 3-5 m from the EN4 dataset [33]. Near surface Argo observations have been shown to be representative of foundation SST [34] and, as stated previously, Argo data are not used to generate the analyses, allowing their use as an independent reference for validation. The latest version of OSTIA performs well compared to the older versions of the system and also in comparison to other SST analyses (e.g., see the NOAA SST quality monitor (SQUAM); https://www.star.nesdis.noaa.gov/sod/sst/squam/analysis/l4 [35,36]). Some of the most significant events in the evolution of OSTIA are shown in Figure 11 by vertical lines. The changes in the statistics associated with each OSTIA change gives insight into the relative importance of each update.
The importance of the bias corrections applied to the satellite SST data can be seen by the change in statistics as the satellite reference sensor changes. In early 2012 the Advanced Along-Track Scanning Radiometer (AATSR) data stream was lost. This was in use as the reference sensor, leading to the switch to using MetOp-A AVHRR data instead. The Argo statistics noticeably worsened after this in some regions (for example, the South Atlantic), although the effect is less clear globally and in the North Atlantic there is an indication of improvement in the statistics. In September 2016, the reference sensor changed again to VIIRS. This corresponds to an improvement in the Argo statistics globally and in most regions.
Between the two changes to the reference sensor, in January 2013, the background error covariances were updated and the number of assimilation scheme iterations were increased as described by [16], who found that these changes yielded improved Argo validation statistics. While there is some indication of this improvement in Figure 11 (e.g., in the Southern Ocean), it is not clear owing to the short amount of time between changes.
The data assimilation scheme was changed to NEMOVAR in February 2018 and updates were made to improve the SST feature resolution in March 2019. The recent changes have both corresponded to improvements in the Argo statistics in some regions, demonstrating the positive impact they have had. The latest system also represents sharper SST features than previous versions [17].
The improved performance of the current OSTIA system is therefore attributable to a number of factors. The switch to using VIIRS as the reference sensor, the update of the assimilation scheme to NEMOVAR and the change to the feature resolution all regionally resulted in noticeable improvements in the Argo statistics. However, globally, the reference sensor change had the most impact. The OSTIA system is expected to continue to improve in the future with current efforts focused on improving the reference dataset further by using SLSTR SSTs. Figure 11. Monthly standard deviations of differences between near-surface Argo data and OSTIA foundation SST analyses for different regions. Some of the most notable changes to the OSTIA system are marked by vertical lines. Grey horizontal lines are drawn to aid comparison of the performance of OSTIA at different times.

Conclusions
The OSTIA system produces foundation SST and IC information in near real time for use in applications such as NWP. It is continuously developed and updated, both in terms of the input data it uses and the methods used to generate the products and has changed considerably since [1] originally described it. This technical note has explained the current configuration of the system.
In summary, the system now uses the NEMOVAR data assimilation scheme to generate model-free analyses of foundation SST and IC. The input data to each analysis are a forecast of the SST or IC based on the previous day's analysis, satellite data and (in the case of SST) in situ data. The satellite SST data are bias corrected using a combination of in situ data and VIIRS nighttime satellite data. Background error covariances are prescribed seasonally and, for SST, updated daily using information from the background field. Uncertainty estimates that are provided with the satellite SST data are used, but uncertainties are prescribed for IC and in situ SST data. An SST analysis uncertainty estimate is produced using a special "observation influence" analysis, which reveals where observations have affected the analysis on each day.
Finally, while this technical note has focused on the near real time OSTIA system that describes the foundation SST, users may wish to be aware that in near real time an hourly diurnal skin SST analysis is also generated [9] and an ensemble of the global SST analyses produced around the world is created to support intercomparison activities [37]. The system is also used to create historical datasets of foundation SST (the dataset resulting from an old version of the OSTIA system is described in [6]) and as part of ESA's Climate Change Initiative to generate fields that represent the daily average SST at 20 cm depth [7,8].
Data are disseminated to users by the Copernicus Marine Environment Monitoring Service (CMEMS; marine.copernicus.eu) under the Copernicus Marine Service Licence (http://marine. copernicus.eu/services-portfolio/service-commitments-and-licence/). The OSTIA foundation SST data are also available from the Physical Oceanography Distributed Active Archive Center (PO.DAAC; dataset link: https://doi.org/10.5067/GHOST-4FK02).  Acknowledgments: Thank you to Jemma Davie for reviewing the description of the reject list procedure to ensure that it accurately reflects the "Monitoring and Updating Station Lists" (MUSLi) system and its documentation. We also thank the four reviewers for their useful comments, which improved the manuscript. AMSR data are produced by Remote Sensing Systems and were sponsored by the NASA AMSR-E Science Team and the NASA Earth Science MEaSUREs Program. Data are available at www.remss.com. OSI-204-b and OSI-206-a SST data and OSI-401-b sea IC data are products of the EUMETSAT Ocean and Sea Ice Satellite Application Facility (OSI SAF, www.osi-saf.org) and are copyright EUMETSAT. AVHRR Pathfinder data were provided by GHRSST and the US National Oceanographic Data Center -this project was supported in part by a grant from the NOAA Climate Data Record (CDR) Program for satellites. We thank the providers of in situ data who make their data available in near real time through the WMO GTS system. The Argo data used in this publication were collected and made freely available by the International Argo Program and the national programs that contribute to it. (http://www.argo.ucsd.edu, http://argo.jcommops.org). The Argo Program is part of the Global Ocean Observing System.

Conflicts of Interest:
The authors declare no conflict of interest. The funders had input into decisions on updates to the OSTIA system but no influence on the writing of this manuscript, or in the decision to publish these results.