Atmospheric Simulations of Total Column CO2 Mole Fractions from Global to Mesoscale within the Carbon Monitoring System Flux Inversion Framework

Quantifying the uncertainty of inversion-derived CO2 surface fluxes and attributing the uncertainty to errors in either flux or atmospheric transport simulations continue to be challenges in the characterization of surface sources and sinks of carbon dioxide (CO2). Despite recent studies inferring fluxes while using higher-resolution modeling systems, the utility of regional-scale models remains unclear when compared to existing coarse-resolution global systems. Here, we present an off-line coupling of the mesoscale Weather Research and Forecasting (WRF) model to optimized biogenic CO2 fluxes and mole fractions from the global Carbon Monitoring System inversion system (CMS-Flux). The coupling framework consists of methods to constrain the mass of CO2 introduced into WRF, effectively nesting our regional domain covering most of North America (except the northern half of Canada) within the CMS global model. We test the coupling by simulating Greenhouse gases Observing SATellite (GOSAT) column-averaged dry-air mole fractions (XCO2) over North America for 2010. We find mean model-model differences in summer of ∼0.12 ppm, significantly lower than the original coupling scheme (from 0.5 to 1.5 ppm, depending on the boundary). While 85% of the XCO2 values are due to long-range transport from outside our North American domain, most of the model-model differences appear to be due to transport differences in the fraction of the troposphere below 850 hPa. Satellite data from GOSAT and tower and aircraft data are used to show that vertical transport above the Planetary Boundary Layer is responsible for significant model-model differences in the horizontal distribution of column XCO2 across North America.


Introduction
One of the persistent challenges in the study of the global carbon cycle is the quantification of the uncertainty in inferred biogenic carbon sources and sinks [1]. Contemporary solution methods include atmospheric inversions while using general circulation models and in situ or satellite observations of carbon dioxide (CO 2 ) to correct vegetation model or flux-derived estimates of these biogenic surface CO 2 fluxes e.g., [2][3][4]. However, in spite of increasing sophistication in the optimization methods and observation systems, annual inverse fluxes vary widely at continental scales, e.g., from 0 to −1.5 PgC over North America [5]. Recent studies e.g., [6] assimilating in situ and satellite retrievals In our study, we assume the optimized mole fractions from the global model can be used without further adjustment. We also choose the regional domain boundaries to be remote from the main area of interest in our study, and aim to conserve the mass of CO 2 introduced at the boundaries of the simulation domain to be consistent between the global model and the regional model. This enables us to simulate column-averaged CO 2 (XCO 2 ) in both global and regional models and compare to satellite-derived observations. This permits us to explore the impact in the regional model of surface CO 2 fluxes optimized in the global model. To better understand the role of transport errors in our simulations, we also perform an ensemble of Weather Research and Forecasting-Chem (WRF-Chem) simulations while using identical fluxes from the NASA Carbon Monitoring System-Flux (CMS-Flux) inversion system but different transport realizations using the Stochastic Kinetic Backscatter Scheme (SKEBS; Berner et al. [35]). Do these fluxes produce equivalent results in the regional domain? How large are the differences in simulated XCO 2 due to the use of different atmospheric transport models? Are the transport errors impairing our ability to infer surface sources and sinks? These are long-standing problems in CO 2 inversion studies.
In our experiment, we introduce the optimized biogenic surface fluxes and posterior 4-D atmospheric mole fractions of CO 2 from the Carbon Monitoring System (CMS-Flux; Liu et al. [36]) as initial and boundary conditions into the WRF-Chem regional model [37][38][39]. Here, we describe this framework for achieving our goal of conserving the mass of CO 2 introduced in the regional model from the global model. In Section 2, we describe the global and regional models used in this experiment, the model coupling framework, and introduction to the methods for comparison to observation data.
We present between-model consistency and comparisons to observations in Section 3. A discussion of the results follows in Section 4. Finally conclusions are presented in Section 5, including how the framework can be used to couple other global models to WRF.

Models and Methods
In this section, we provide an overview of the CMS-Flux inversion system, the customized WRF-Chem model, the coupling of the two modeling environments, the ensemble of WRF transport realizations, and the observations used to compare model results.

CMS-Flux Inversion System
The NASA CMS-Flux inversion system (http://cmsflux.jpl.nasa.gov) is described in detail in Liu et al. [36], an observing system simulation experiment (OSSE) to optimize biogenic fluxes using simulated Greenhouse gases Observing SATellite (GOSAT) XCO 2 soundings [40]. The GOSAT Project (Japan Aerospace Exploration Agency, National Institute of Environmental Studies, and Ministry of the Environment) measures densities of CO 2 and methane (CH 4 ) from space. The CO 2 column abundances detected by the Thermal and Near Infrared Sensor for Carbon Observation-Fourier Transform Spectrometer (TANSO-FTS; Yakota et al. [40]) that we use in our experiment are processed using the NASA Atmospheric CO 2 Observations from Space (ACOS) algorithm [41,42] for use as column-averaged XCO 2 observations. The CMS-Flux inversion system uses the forward GEOS-Chem global chemical transport model [43,44] that is driven by meteorological fields from the NASA Goddard Earth Observing System, Version 5 (GEOS-5) data assimilation system [45]. CO 2 is simulated as a passive tracer forced by emissions from fossil fuel, biomass burning, shipping and aviation, biogenic land, and ocean surface fluxes. The GEOS-Chem adjoint model [46] optimizes the biogenic surface fluxes and atmospheric CO 2 mole fractions to be consistent with XCO 2 from a satellite observing system [36,47]. The GEOS-Chem adjoint has been used to estimate carbon monoxide emissions [48,49] and to attribute direct ozone radiative forcing [50]. In our study, we couple all of the components of the CMS-Flux inversion system including the imposed surface fluxes and the optimized biogenic fluxes at the surface of the WRF model domain, but excluding two minor components of the fluxes (aircraft source and chemical sources). Atmospheric CO 2 mole fractions from a coarse resolution global CMS-Flux inversion (horizontal resolution: 4 • latitude × 5 • longitude; vertical resolution: 47 levels to 0.01 hPa) that assimilated GOSAT XCO 2 for 2010 are coupled to the boundaries of the WRF simulation domain. Details of the imposed and prior surface CO 2 fluxes can be found in Liu et al. [36] and they are summarized for North America in Table 1. Figure A1 shows a schematic of the CMS-Flux system and the WRF-CMS interface for our experimental setup.

Weather Research and Forecasting with Chemistry Model (WRF-Chem)
The mesoscale model is WRF-Chem v3.6.1 [37][38][39]51] with the modification described in Lauvaux et al. [28] to transport greenhouse gases as passive tracers. Trace gas boundary conditions are provided from a global model at six-hourly intervals and surface fluxes introduced hourly. This work will describe (Section 2.4 below) the framework for introducing the boundary conditions from the global CMS-Flux model (using the GEOS-Chem atmospheric transport model to simulate CO 2 mole fractions in the atmosphere) into the WRF domain in a manner designed to preserve the vertical distribution of CO 2 at the WRF domain edges. Lauvaux et al. [28] used WRF-Chem for the forward transport model in an inversion for CO 2 fluxes in the USA Upper Midwest region as part of the Mid Continent Intensive (MCI) campaign. This WRF-Chem implementation was also used in Lauvaux and Davis [13], who investigated the impact of introducing column-averaged XCO 2 into regional inversions that typically use only CO 2 observations measured in the planetary boundary layer.
In the experiment that is described here, the regional domain contains most of North America in a Lambert Conformal projection at 27 km horizontal resolution. The model has 50 levels up to 50 hPa with 20 levels in the lowest 1 km. The model meteorology is initialized every five days with six-hourly ERA-Interim [52] reanalysis at T255 horizontal resolution (∼80 km resolution over North America). Each meteorological re-initialization is started at a 12-h setback from the end of the previous five-day run. The first twelve hours of the new run are then discarded implying a discontinuity in CO 2 transport necessary to re-initalize the meteorological conditions. We minimize the impact of the discontinuity by re-initalizing at night and by using the three-dimensional (3-D) CO 2 mole fractions from the previous run to start the following one. We also employ the alternative lake surface temperature initialization as described in the WRF User Guide (Weather Research & Forecasting ARW Version 3 Modeling System User's Guide, January 2015, pp. . Choices of the model physics parameterizations used in this experiment are documented in Table 2. Carbon dioxide from surface fluxes from the CMS-Flux inversion (optimized biogenic CO 2 fluxes and imposed CO 2 surface fluxes from the ocean and emissions of fossil fuel, biomass burning, and ship bunker fuel; Table 1) are carried as individual tracers in WRF-Chem. Background CO 2 mole fractions, supplied as boundary conditions from the CMS-Flux optimized mole fractions, are in a separate tracer. For analyses requiring total CO 2 or XCO 2 , the surface flux tracers are summed after correcting for the presence of water vapor in GEOS-Chem to obtain dry air mole fractions, and added to the boundary condition tracer. This multiple-tracer strategy allows for the inspection of the separate tracers throughout the model integration. If total column values are required, then the region above the 50 hPa top of the WRF model is populated with the appropriate value from 50 hPa to 0.01 hPa from the global model. In this experiment, we do not include the non-surface fluxes of CO 2 from the CMS-Flux inversion (aircraft source and chemical source; Table 1), but these are included in the boundary condition mole fractions from the global domain. A test, reducing these non-surface sources to surface emissions in the WRF domain, made, at most, a 0.03 ppm difference in the simulations of GOSAT XCO 2 .
The final preparation step for our experiment is the population of the WRF atmosphere with CO 2 mole fractions from the CMS-Flux global model. It takes approximately a month's integration in model time in order to distribute the contributions from the boundaries throughout the WRF atmosphere and completely remove the initial conditions from the simulation domain. For this experiment, we started the model integration in early December 2009, but begin all analyses in January 2010.

Ensemble Simulations (SKEBS)
An ensemble of ten perturbed transport realizations is performed to provide an assessment of transport model uncertainties. Perturbations of the meteorological driver data used to create initial and boundary conditions for WRF are generated by the Stochastic Kinetic Energy Backscattering scheme (SKEBS) [35]. Berner et al. [53] showed that a combination of a multiphysics scheme and perturbations from SKEBS represent more accurately transport model errors near the surface than perturbations alone. We selected a combination of model physics parameterizations to increase the spread of our ensemble. We used different combinations of Land Surface Models (LSMs) and planetary boundary layer (PBL) schemes in WRF, the main sources of variability in transport following Díaz-Isaac et al. [54]. We selected (1) Mellor-Yamada Nakanishi and Niino Level 2.5 (MYNN 2.5) PBL scheme with Noah LSM; (2) Mellor-Yamada-Janjic (MYJ) PBL scheme with RUC LSM; and, (3) Yonsei University (YSU) PBL scheme with five-layer thermal diffusion LSM. The SKEBS algorithm generates random perturbations to the meteorological initial and lateral boundary conditions. The SKEBS has been evaluated widely in the atmospheric ensemble modelling community while using similar parameter values to define the amplitude, sizes, and frequencies of the perturbations.

Mass Conserving Coupling Framework
In this experiment, we wish to compare simulated column-averaged CO 2 from both the WRF execution and the CMS-Flux products with the ACOS GOSATv3.5 soundings [42] for North America. This requires that we ensure that the mass of CO 2 introduced into the WRF domain from the CMS-Flux GEOS-Chem global model be conserved, both at the surface (fluxes) and the boundaries (atmospheric mole fractions) of the WRF model domain. The challenges for mass conservation include differences in model horizontal grid resolution, implied grid surface elevation, and vertical grid discretization. The strategy described here can also be used to couple the WRF regional model to other global models with a minimal amount of customization specific to the global model.

Mass Conservation of Surface Fluxes
We apply domain-wide scaling factors for each surface flux to conserve the mass of CO 2 introduced into the domain by the surface fluxes. First, we create an exact map projection for translation from the low-resolution CMS-Flux GEOS-Chem 4 • × 5 • grid to the WRF 27 km grid ( Figure A2 illustrates the domain extent of the WRF Lambert Conformal projection). Thanks to the higher resolution in WRF, the spatial mapping requires no interpolation or smoothing of the original CMS surface fluxes. We directly assign each WRF grid cell to the corresponding CMS pixel and compute monthly scaling factors for each surface flux component, as follows:

1.
Calculate the sum of the mass exchange in the global model grid cells assigned to the WRF domain.

2.
Calculate an initial domain-wide sum of mass exchange for the WRF grid cells using the assigned global model grid cells.

3.
Compute a domain-wide scaling factor as the ratio of the results of step 1 and step 2.

4.
Multiply the mass exchange assigned to each WRF grid cell by the domain-wide scaling factor.
These steps are repeated for each of the surface fluxes, including all the components of the CMS model, so that nearly-identical fluxes are used in WRF and CMS. The modification of the CMS total flux is due to minor misattribution along the coast lines, near water bodies, and due to non-perfect match between the two grids. The goal is to achieve equal total surface mass exchanges at the hourly resolution of the flux input into the WRF model domain. The component fluxes from CMS-Flux have temporal resolutions varying from hourly to monthly. The optimized biogenic flux is at monthly resolution, but with a 3-h diurnal cycle overlay with monthly net zero emission/uptake. We do not scale the diurnal cycle overlay. However, the surface fluxes in WRF have the same diurnal cycle.
Only the scaling factors are being kept constant over a month. Most monthly scaling factors are in the range [1.0, 1.3] with the exception of some of the minor fluxes (biofuel and ship bunker fuel) with slightly larger scaling factors. The scaled surface fluxes are introduced into the WRF modeling system using the WRFCHEMI function of WRF-Chem. Realistic flux scaling results may also be achieved using a more sophisticated approach, such as the mass-conserving utilities within the NCAR Command Language (NCL) Earth System Modeling Framework (ESMF)), which would also retain the global model flux patterns, as well as preserving the domain mean.

Mass Conservation at the Domain Boundaries
The challenge in the case of the domain boundaries is not only the difference in horizontal resolution and grid type, but also the difference in vertical discretization schemes. WRF uses a terrain-following, hydrostatic-pressure vertical eta coordinate with a fixed model top. Because the top of the atmosphere in WRF is at 50 hPa, we used CMS mole fractions to complete the column values above 50 hPa in our study. The CO 2 boundary mole fractions for our experiment are from a GEOS-Chem model with 47 hybrid-sigma layers corresponding to the GEOS-5 (MERRA) reduced vertical grid [45]. This grid is terrain-following from the surface up to ∼170 hPa, with fixed pressure levels from 170 hPa to the top of the model. Depending on the surface pressure and terrain, there may be nine layers below 1 km, compared to 20 layers below 1 km for our WRF domain. To approximate the vertical distribution of CO 2 in the global model source in the WRF grid, we follow these steps for each WRF boundary grid cell:

1.
Compute an equivalent global model surface pressure for each WRF boundary grid cell using standard bi-linear interpolation (interpolating in two or more dimensions) using the four global model surface grid cells whose centers are closest in latitude and longitude to the WRF grid cell center.

2.
Compute the equivalent global model pressure column using this derived surface pressure and the standard vertical grid discretization for the global model, in this case the GEOS-5 hybrid sigma-pressure ap and bp parameters and algorithm for the 47-layer reduced vertical grid.

3.
Compute an equivalent WRF pressure column using this derived surface pressure and the znu WRF vertical resolution discretization vector.

4.
Assign a source global model level for each receiving WRF model level in the WRF boundary grid ( Figure 1). This is determined by the global model level edge pressures between which the derived WRF midpoint layer pressure falls. This computation is done in log space with the respective pressure columns in units of Pa.

5.
Use simple linear interpolation (if necessary) between global model levels to smooth out a poorly mixed flux signal. We used this technique for CMS-Flux GEOS-Chem where the first four or five model levels in WRF are sourced from the first GEOS-Chem level; this source layer often shows the immediate result of the surface flux as distinct from several well-mixed layers above the surface layer. 6.
The result of this transfer of vertical CO 2 columns from the global model to the WRF boundary grid cells is introduced into WRF via the WRFCHEMBC functionality that is used for meteorological boundary conditions, similar to the approach of [18,19].
We use the GEOS-Chem surface pressure for two reasons. First, it is the mass in the GEOS-Chem column that we want to introduce into the WRF model domain and, second, it is not possible to match the surface pressures between the two models due to different horizontal grid resolutions, model grid surface elevations, and driver meteorology. To test the adequacy of this method, we compute pressure-weighted column-averaged XCO 2 along the WRF boundaries, independently compute the same quantity from the global model up to 50 hPa, and compare the results. Figure 2 shows an example of this comparison for a day in early June 2010. The surface layer of the western and eastern boundaries of the WRF domain is predominantly ocean, where we do not expect significant model grid elevation or surface pressure differences. This is evident for this example date in Figure 2b. On the other hand, we do expect some differences in the southern and northern boundaries, particularly in mountainous areas ( Figure 2a). Our algorithm produces model-model differences of column-averaged CO 2 at the boundaries of less than 0.1 ppm on most of the boundaries and less than 0.3 ppm in the high-terrain regions of the northern boundary. We present a comparison of the mass-conserved coupling scheme to previous versions e.g., [28] in Section 3.1 to demonstrate the importance of mass conservation at the boundaries of the simulation for the use of column-integrated measurements.

Model Comparison to Observations
While a primary goal of this model coupling experiment is to compare WRF and CMS-Flux model-simulated XCO 2 GOSAT soundings, it is comparisons to other observations that provide verification and insight into model behavior. In addition to the GOSAT XCO 2 , we compare to XCO 2 from a Total Carbon Column Observing Network (TCCON) site for times of GOSAT overpasses. In order to understand the vertical distribution of CO 2 and its relationship to column XCO 2 values, we compared our simulated mole fractions to aircraft measurements and continuous tower measurements. We also compare model meteorological winds to observations from selected North American rawinsonde sites.

GOSAT XCO2
We evaluated the WRF and CMS-Flux model-simulated XCO 2 by comparison to to ACOSv3.5 LITE GOSAT soundings (Crisp et al. [41]) with more than 13,000 good quality soundings within the WRF North American domain during 2010. Reflected sunlight in three near-infrared bands are measured over a surface footprint of approximately 10 km in diameter at the nadir [55]. Nadir land and ocean glint observations were both used in our analysis.
The distribution of these soundings varies in space and time, with no coverage in ocean areas or north of 60 • N in winter. We follow the same sampling scheme in both WRF and CMS-Flux model CO 2 mole fractions:

1.
Locate the nearest model grid cell in time and space to the GOSAT sounding.

2.
Isolate the column of CO 2 at that grid cell, calculate the dry air mole fractions.

3.
Interpolate the simulated CO 2 to the 20 pressure levels specified in the sounding's ACOSv3.5 averaging kernel algorithm and a priori profile [42,56], as shown in the ACOS 3.5 User Guide.

4.
Apply the averaging kernel to the interpolated profile for the full column XCO 2 or use the a priori profile pressure-level weights for partial column analysis.
Using the pressure-level weights versus the complete averaging kernel for the whole column yields results within a few tenths of a ppm for the full columns in our model atmospheres. For a priori profile pressure levels above 50 hPa, we use the CMS-Flux interpolated optimized mole fractions for both the WRF and CMS XCO 2 computations. For ACOS GOSAT soundings with profile pressure levels below the model surface, we use the CO 2 at the midpoint of the surface model layer.

Lamont TCCON XCO2
The Total Carbon Column Observing Network (TCCON; Wunch et al. [57]) provides measurements of XCO 2 from the earth's surface at a global network of sites. Two TCCON sites within our North American domain were operating in 2010. The Park Falls site has significant drop-outs, especially in summer, so we choose the Lamont TCCON site (36.6 • N, 97.49 • W) for comparison [58]. We average GOSAT soundings in a box of 6 • latitude and 12 • longitude centered at the Lamont site and match them to the TCCON data averaged for the hour of the GOSAT overpass of this regional box to include enough GOSAT soundings over each day in our analysis. This region corresponds to the US Upper Midwest with similar ecosystems and synoptic conditions, despite occasional frontal systems crossing the region. More advanced filtering methods have been developed to co-locate GOSAT and TCCON soundings [59], but we avoided using XCO 2 fields from one of the two models to avoid biasing the selection process. We also computed hourly-averaged simulated XCO 2 from the models for these GOSAT soundings in order to compare to both GOSAT and TCCON XCO 2 . During our period of comparison, summer of 2010, this region covers a large gradient of surface CO 2 fluxes. Our goal is not to evaluate the GOSAT XCO 2 relative to TCCON observations, but rather to illustrate how well the model-simulated XCO 2 follow the general tendencies of the observations. More rigorous evaluation of the GOSAT XCO 2 would require the use of coincidence criteria, as shown in Wunch et al. [60] or Basu et al. [61].

Horizontal Wind at Selected Rawinsonde Sites
We also make use of the NOAA archive of rawinsonde data (Schwartz and Govett [62]) for model comparisons to observed wind speed and direction at mandatory reporting levels. The goal in this case is to identify the possible causes of transport differences between models and observations. Comparisons of 00 UTC soundings (16)(17)(18)(19) LST, depending on the site) are summarized to show annual and seasonal biases.

In Situ Tower and Aircraft CO 2 Observations
In order to evaluate the impact of transport differences between WRF and GEOS-Chem, and among the WRF ensemble members, we collected in situ ground-based data from the NOAA Tall Tower network [63] and from multiple aircraft profiles to diagnose differences in vertical CO 2 distribution [64]. Continuous gas analyzers measure CO 2 mole fractions using non-dispersive infrared (NDIR) absorption sensors (e.g., Licor Li-6200 series and Li-7000 analyzers) and Picarro Cavity Ring Down Spectrometers (CRDS) [65]. Calibration with high-accuracy gas cylinders and comparison to co-located discrete flask samples indicate measurement errors below 0.2 ppm. Discrete samples from the NOAA Aircraft Program are collected by the programmable 12-flask sampling systems at various altitudes (between 300 and 8000 m above ground level). CO 2 mole fractions are measured by one of two nearly identical Measurement of Atmospheric Gases Influencing Climate Change (MAGICC) automated analytical systems. Measurement errors fall below 0.1 ppm, including possible sampling errors and storage biases.

Evaluation of Transport Differences in Model-Simulated GOSAT XCO2 Soundings
We compare the CMS-Flux and WRF XCO 2 model simulations with GOSAT soundings without and with mass-conservation in WRF in Table 3. The results from the mass-conserved coupling scheme in WRF reproduce more closely the original mismatches between CMS-Flux and GOSAT measurements. Mean mismatches in the original coupling scheme reach large values (up to 1.4 ppm) in Spring of 2010, and between 0.5 and 1 ppm during the other seasons. The histograms of the residuals are shown in Figure 3 revealing a similar range, positively-biased for all seasons. This first result demonstrates the decrease in the bias introduced in WRF by using a mass-conserved coupling scheme, whereas the original coupling artificially increases the amount of CO 2 molecules in the atmosphere. We further compare XCO 2 model simulations with each other in Figure 4 and with the GOSAT soundings in Table 4. If we have reached our target of conserving CO 2 mass entering the WRF domain from the CMS-Flux optimized CO 2 surface fluxes and boundary conditions, then we expect the model-simulated XCO 2 to be similar. Model-model differences can be related to transport from the boundaries into the WRF simulation domain (horizontal transit times from different driver meteorology and vertical mixing from boundary layer processes) or model resolution (heterogeneous surface characteristics). The seasonal mean model-model differences in XCO 2 simulations are largest in winter (mean, −0.30 ppm; RMSD, 0.51 ppm) and smallest in summer (mean, 0.12 ppm; RMSD, 0.72 ppm). These seasonal distributions of differences are all sharply peaked around zero with a few large outliers (>5 ppm) in summer. Table 4 summarizes comparisons of model simulations to the GOSAT soundings. In spite of a few extreme outliers in winter and spring, the inner quartile ranges (IQRs) for both models relative to GOSAT are approximately 2 ppm, with nearly the same seasonal mean differences (CMS-Flux range [0.00, 0.31] ppm; WRF range [−0.04, 0.38] ppm). The median differences are slightly larger than the mean differences, except for the CMS-Flux simulations in summer and the WRF simulations in summer and fall. Although the model results are centered well with the GOSAT soundings, neither model produces simulations with the full range of values of the GOSAT XCO 2 . Because the WRF model is able to simulate the meteorological conditions at higher resolution, we might have expected more variability in the WRF simulations, as multiple GOSAT soundings assigned to a single grid cell in GEOS-Chem are represented by many grid cells in the higher resolution WRF grid. However, because surface fluxes remain at coarse resolution in both systems, we do not see any differences at this summary level. When considering the tails of the model-data residuals, only six individual GOSAT XCO 2 soundings with model-GOSAT differences greater than 10 ppm were found; these soundings are all located over steep terrain in the western United States, hence most likely noise in the GOSAT measurements rather than due to the coarse resolution of the CMS-Flux system.     However, these general comparisons do not show differences in spatial coverage by season of the GOSAT XCO 2 soundings or any spatio-temporal variations in model-model residuals. To address this, we aggregate the model-model differences to the CMS-Flux GEOS-Chem 4 • × 5 • grid to map the mean seasonal differences in each grid cell ( Figure 5). We report in Figure 5 only those grid cells with more than 10 GOSAT soundings during each season shown. There are consistent small negative residuals in the WRF XCO 2 simulations relative to CMS-Flux in most of the continent in winter and in the south and east in fall. The WRF-CMS-Flux differences are slightly positive in the northwest in fall. The pattern in spring is mixed. There are strong positive differences in the Pacific Northwest in summer. The underlying CMS-Flux optimized biogenic flux shows a very strong source of CO 2 in the upper Pacific Northwest, and a correspondingly strong sink in the MidWest and East in July. These sources and sinks are evident in the WRF model layers closest to the surface, but not always in the CMS-Flux optimized mole fractions in the surface layers in the same locations. On some days, the surface layer flux contributions in the CMS-Flux XCO 2 mole fractions are in grid cells adjacent to the emitting grid cells, which suggests some model-model differences in mixing and transport within the boundary layer. There are also sharp gradients in terrain height in this region that may contribute to the model differences, although this does not appear to be an issue in the Rocky Mountain region of the US West. In general, there is more variability in the spatial differences during the growing season. These maps also show how the spatial distribution of the GOSAT soundings changes by season. Note that there are no ocean soundings and no soundings in the high latitudes in winter. The examination of variances of the model-simulated soundings on this spatial scale show no clear differences between models, other than that there is less spatial variance in the models than in the GOSAT soundings.
Having documented the model-model differences in column-averaged XCO 2 at seasonal scales, we next compare the distribution within the columns of the CMS-Flux and WRF XCO 2 simulations. We expect to find the most differences in the active growing season, hence we focused on the comparison of summertime column estimates. We divide the columns into upper and lower portions, with the lower column corresponding to the three pressure levels closest to the surface in the ACOS algorithm, and with the upper column consisting of the remaining 17 levels. The ACOS averaging kernel pressure level weights specify ∼12.5% of the column-averaged value from the three pressure levels that are closest to the surface. The elevation above the surface of this split varies with terrain, but roughly corresponds to 850 hPa. As justification for this division, we highlight an example CO 2 profile at the location of the LEF tall tower in Park Falls, Wisconsin, USA (45. Figure 6a) and the contributions of the flux tracers (Figure 6b). This is a location far from the boundaries of the WRF regional domain, so the boundary conditions profile is the result of long-range transport, not recent inflow. At this location and time, the flux contributions to the total CO 2 profile are predominantly below 850 hPa. The minor CO 2 fluxes (ocean, and combined biomass burning, biofuel, and ship bunker fuel emissions) contribute very little to the overall column value. The biogenic flux and the imposed fossil fuel emissions constitute almost all the flux portion of the column value. The transported boundary conditions account for more than 85% of the column-averaged CO 2 . For applications using total column CO 2 , it is important that this coupling of regional to global model be done correctly.  We illustrate the model-model differences for the simulated GOSAT XCO 2 in summer using this ∼850 hPa division in Figure 7. Differences shown are for sub-column-averaged CO 2 within the lower and upper portions of the column, computed as the sum of the interpolated mole fractions at each ACOS pressure level times the ACOS-provided weight at that pressure level. The lower portion of the column accounts for ∼50 ppm of the total column-averaged CO 2 . When comparing spatial patterns of differences of the sub-columns versus the total column in Figure 5c, we see that the WRF positive residual in the Pacific Northwest is the result of larger mole fractions in both upper and lower sub-columns. This is an area with a relatively low count of GOSAT soundings ( Figure A3) and very large biogenic source flux. In the eastern half of North America, there is a dipole effect, with WRF simulations having lower mole fractions in the lower sub-column and higher values in the upper sub-column when compared to the CMS-Flux simulations. There are non-homogeneous patches of strong uptake in the biogenic flux in eastern North America in July, generally matching the spatial pattern in the lower sub-column. As a result, once re-scaled over the total column mole fractions (cf. Figure 7, lower panel), an additional 1 ppm is attributed to the lower part of the column over the West coast in WRF as compared to CMS-Flux, while the Great Lakes region and the Central part of North America shows a −0.5 to −1 ppm difference. The differences will reflect the attribution of the observed gradients to the north American surface fluxes as low levels interact with the surface while the upper part of the column is attributed to long-distance air masses. We note here that 1 ppm over the column and over the entire domain would correspond to an aggregated annual flux of 1 PgC over North America, or twice the natural sink over the continent. The upper sub-column pattern more closely resembles the total column pattern in Figure 5c. Model-model differences in other seasons are minimal in both parts of the column. The summer patterns illustrate that agreement of full column XCO 2 simulations does not necessarily imply that the distribution of the CO 2 is the same in the two models. This further suggests model-model differences in transport within the columns, possibly due to more vertical mixing in the CMS-Flux GEOS-Chem model. The CMS-Flux vertical profile in Figure 6a shows an example of this behavior.

Evaluation of Transport Differences in GOSAT XCO Soundings
The spatial differences between WRF and CMS-Flux XCO 2 column mole fractions and ACOS GOSAT retrievals are shown in Figure 8 Figure 8). Differences are noisy, but, despite using different transport models, the overall model-data mismatches remain similar. Both models tend to over-estimate XCO 2 column mole fractions in the North West of the US (region of Seattle, WA). To better understand the differences between the two transport models, we show in Figure 8 (lower panel) the differences between the two model-data mismatches (differences of the differences). The positive anomaly over the northwestern US dominates other regional differences, advected by the westerly flow following the Mid-latitudinal Jet Stream. The Jet advects this anomaly toward the central part of North America, generating a large mismatch of about 0.2 to 2 ppm between the two models. This anomaly is highly correlated with a large positive biogenic flux area in the CMS inverse fluxes, hence in both models but more pronounced in WRF, located over Seattle, WA. Looking at vertical profiles, this anomaly seems to be related to vertical mixing differences. GEOS-Chem, used in the CMS-Flux inversion system, tends to mix faster in the vertical, lifting CO 2 molecules to higher altitudes where the wind speed is higher. We investigate the general behaviors of transport models in Section 3.4 using NOAA aircraft profiles across North America. WRF shows higher XCO 2 values across the domain (0.57 ppm over summer) as compared to CMS (0.015 ppm), as shown in Table 3. This positive bias seems to be driven by the anomaly in the northwestern US. Mean summer (June, July, August) spatial differences in WRF and CMS-Flux pressure-weighted CO 2 for partial columns of simulated GOSAT XCO 2 aggregated to the CMS-Flux grid (light gray lines). (a) Lower portion of column (model surface to 850 hPa) (b) Upper portion of column (850 hPa to top of the ACOS profile) and (c) the differences in the lower portion of the column scaled to the total column mole fractions (lower panel).

Temporal Evaluation of Model-Simulated XCO2 at the Lamont TCCON Site
A semi-independent comparison of column-averaged XCO 2 for GOSAT and the model simulations is performed while using the XCO 2 observations at the Lamont TCCON site. The calibration of ACOS GOSAT retrievals by comparison to TCCON measurements only accounts for long-term systematic errors, while our comparison focuses on short-term model-data differences. GOSAT passes near the Lamont TCCON site around 19 UTC. We select the GOSAT soundings near Lamont, as described in Section 2.5.2, average the XCO 2 values by day and report them along with the 19 UTC hourly average of the TCCON observations. We present the weighted averages and standard deviations of both sets of observations in Figure 9a. The error bars in Figure 9a represent the root mean square uncertainties of the selected TCCON and GOSAT soundings included in the hourly averages. In the example shown for days in July and August 2010 when both observations are available, there are 1-21 good GOSAT soundings and 3-34 TCCON XCO 2 observations during the 19 UTC overpass. Despite the presence of outliers in the early summer (around DOY 180-190), the daily average GOSAT and TCCON XCO 2 soundings agree within 1-2 ppm, with discrepancies likely due to sampling and representation errors. The mean residuals of ACOS GOSAT, CMS-Flux, and WRF XCO 2 relative to TCCON for this period are 0.088 ± 1.856 ppm, 0.868 ± 1.042 ppm, and 0.965 ± 0.981 ppm, respectively (Figure 9b). The residuals from the model simulations are generally, but not exclusively, positive and they are more similar to each other than to either of the observations. This implies that model transport differences are small when compared to other model-data differences at this time and location. The synoptic-scale variability in the summer atmospheric CO 2 mole fractions is well represented during short-term events (e.g., near DOY 218). Despite the coarser resolution of the CMS-Flux mole fractions as compared to WRF, both models are able to capture the trend and variability observed by the Lamont TCCON site.

Vertical Transport Evaluation of Model-Simulated CO2 at Several NOAA Aircraft Profile Sites
In order to understand the differences in total columns, we present an analysis of vertical profiles of CO 2 mole fractions in Figure 10 focusing on two NOAA aircraft profile sites, one near West Branch, Iowa (WBI) in the central part of the US Corn Belt, and a second on the East Coast of the US near Charleston, South Carolina (SCA). Profiles were selected to represent both well-mixed conditions, during which the PBL top is well-defined and vertical CO 2 gradients are close to zero (upper left and lower right panels), and near-neutral conditions when vertical mixing is driven by wind shear and thermals from the surface, responsible for non-zero CO 2 gradients in the Lower Troposphere. Differences between WRF and CMS-Flux are observed in both the Free Troposphere and the PBL on 28 April 2010 (cf. Figure 10, upper left panel), with an under-estimation of the PBL height in WRF.
The ensemble of WRF simulations is insufficient to explain the differences between the two transport models. On 29 November 2010 (cf. Figure 10, lower right panel), modeled CO 2 mole fractions are consistent between the two models, but WRF captures correctly the near-surface gradient and values, independent of transport perturbations. For neutral cases ( e.g., Figure 10, upper middle panel), vertical gradients in CO 2 mole fractions observed by the aircraft show a constant decrease until 3000 m above sea level. CMS-Flux shows a shallow and mixed PBL until 1000 m asl, while the ensemble of WRF simulations shows a similar gradient when compared to the observations but a wide range of values among transport ensemble members. A similar case is observed on 28 June 2010 (cf. Figure 10, lower left panel). The last two cases show an agreement between the two models, not matching the observed gradients nor the observed values in CO 2 mole fractions. When considering the different cases shown here, we conclude here that models tend to agree in the Free Troposphere, but vary widely near the surface. The ensemble of transport realizations indicate a large spread among members in neutral conditions and a narrower range of vertical profiles during well-mixed conditions, despite the use of different physics schemes. In the Free Troposphere, the ensemble shows some variability due to variations in the origin of air masses. This result shows the importance of the transport of large-scale boundary conditions within the WRF domain, driven by uncertainties in the meteorological conditions over North America. These differences might also be due to differences in boundary conditions from the northern boundary, not fully reconcilable due to the steep terrain. Overall, we conclude that vertical mixing in the Lower Troposphere seems to drive the model-model CO 2 differences, while the transport over long distances of large-scale CO 2 air masses impacts CO 2  We further compare CO 2 mole fractions from NOAA aircraft profiles to the two models (WRF and CMS) and to the contribution from the CMS boundary conditions alone within the WRF model in Figure 11. Total column XCO 2 mole fractions are indicated for each vertical profile in order to understand the relationship between integrated mole fractions over the column and the vertical distribution of in situ mole fractions. For all four cases, WRF and CMS column XCO 2 are within 1 ppm or less from each other, despite significant differences in the vertical distribution. Typically, mole fractions in the Free Troposphere remain similar. The corresponding GOSAT soundings are indicated with black stars, soundings collected nearby and within the hour. Only few profiles matched with GOSAT passing times. The accumulation of CO 2 near the surface is a typical feature in CMS which might be problematic if tower data are being inverted, but not visible in satellite data. The surface fluxes in WRF impact the profiles up to 600 hPa (about 4000 m asl) as noted in the WBI profile on 1 July 2010 (cf. Figure 11, lower right panel). We also note that boundary conditions in WRF display a small vertical gradient near the surface, despite the long-range transport of air masses from the boundaries. This gradients implies that vertical gradients in CMS coupled to the boundaries can impact the vertical gradients in WRF, hence propagating vertical structures far into the WRF simulation domain. The largest differences are typically observed when surface signals are transported higher in the column, above the PBL (cf. Figure 11, lower right panel), advected by widely different horizontal mean wind. As shown in Lauvaux and Davis [13], vertical distribution matters, as it impacts the horizontal advection of surface signals, responsible for large spatial differences in the horizontal, as shown in Figure 8, lower panel).

Temporal Evaluation of Model-Simulated CO2 at Several NOAA Tower Sites
We evaluated the modeled CO 2 mole fractions at two NOAA towers: WBI sampling at 240 m agl and SCT sampling at 115 m agl, over a year, by showing the 31-day running mean Daily Daytime Averages (DDA) in Figure 12. These two locations correspond to the positive bias located in the US Midwest (WBI) and the negative bias (East Coast) diagnosed in WRF model residuals compared to CMS in Figure 8. At the surface, model-data differences agree with total XCO 2 mole fraction differences during wintertime with lower mole fractions in WRF (cf. Figure 6), but tend to disagree in the summertime. The model-data differences are negative when compared to tower data in the summer whereas the total column values are positive. Positive model-data differences are observed over the PBL, between 500 and 600 hPa, as shown in Figure 11. We conclude here that model-data differences in the PBL do not reflect the overall differences in the column XCO 2 mole fractions. When considering transport differences in the WRF ensemble, summertime model differences can be interpreted as transport differences at SCT, but not at WBI, with a larger drawdown in WRF. Spring and Fall seasons show little variability due to transport, while summer and winter show a larger spread in the ensemble. This result suggest that WRF CO 2 mole fractions are primarily driven by large-scale conditions in shoulder seasons, reinforcing the importance of the coupling scheme when considering the annual carbon cycle. At SCT during the Fall season, WRF and CMS show large differences (about 2 ppm), despite the absence of fluxes from North America. We conclude here that long-range transport in the WRF domain differs from the CMS CO 2 mole fractions. This analysis demonstrates the role of large-scale conditions and its transport within the regional simulation domain, esp. in Fall and Spring, and highlights the indirect relationship in model-data differences when comparing model-data residuals near the surface and in the XCO 2 total column mole fractions. Figure 11. Comparison of modeled CO 2 mole fractions (in ppm) sampled at the location of WBI (West Branch, Iowa) and Park Falls, Wisconsin (LEF) from CMS-Flux (in red), the contribution in CO 2 mole fractions by the CMS boundary conditions in WRF (in light blue), and the total CO 2 mole fractions from WRF (in dark blue). The corresponding total column XCO 2 mole fractions for each profile are indicated by the star symbols.

Horizontal Mean Wind
We selected a group of rawinsonde sites (Table 5; Figure 13), including west coast, east coast, Gulf Coast, and mid-continent North American sites for comparison of model and observed wind speed and wind direction at mandatory rawinsonde reporting levels. Our intent is to identify any regional distinctions in biases or variability that might help to explain the CO 2 model-data differences. Each of these sites had more than 11 months of data for 00 UTC soundings in 2010 at the mandatory reporting levels of 925 hPa, 850 hPa, 500 hPa, and 250 hPa, which we used in our analysis. Continental mountain and high plains sites were not included, because they lack 925 hPa data, and cannot easily be compared with mandatory levels at lower elevation sites. Model locations were assigned as the nearest grid cell with similar surface height. For some coastal sites, the representative model sites were moved one grid cell toward the ocean to achieve this. In Figure 14, we show the annual mean wind speed bias at these sites from the WRF forward run (initialized with the ERA-Interim reanalysis) and the GEOS-5 reanalysis used by the CMS-Flux system. Recall that, although the WRF run is initialized with reanalysis data, it is free-running during each five-day run, and so may not conform to the reanalysis. Wind speed bias, RMSE, variance ratio, and correlation skill score ( ); von Storch and Zwiers [86]) are summarized in Table 6 for mandatory reporting level 925 hPa. (See Tables A1-A3 for results at other reporting levels). In general, coastal sites are difficult to simulate in both models, with seasonal differences that tend to cancel each other in WRF, but not in the GEOS-5 meteorology. With the exception of the coastal sites, WRF wind speed error is positively biased at the lower levels, but this bias largely disappears with height. Most notable is that GEOS-5 underestimates the wind speed for nearly all of this selection of sites at all levels. The WRF model overestimates wind speed variability by more than 10% at 925 hPa and 850 hPa, but slightly underestimates variability at higher mandatory levels. The GEOS-5 analysis consistently underestimates variability at all levels. Both models' correlation skill scores improve with height, as expected (Table 6 and Tables A1-A3). WRF overestimates day-to-day variability, as shown by the variance ratio, in winds at lower levels, and it has a larger RMSE than GEOS-5 at lower, but not higher, levels. GEOS-5 underestimates day-to-day variability at all levels. Annual wind direction bias and RMSE, for both models at all sites and mandatory levels, are smaller than the number of degrees separating reportable wind directions (not shown). However, at the 925 hPa mandatory level, there are seasonal directional biases at many sites in summer and at west coast sites in all seasons ( Figure 15). This directional bias may contribute to the summer boundary layer differences in XCO 2 simulations between the two models. Based on these results, the potential improvement from finer resolution WRF simulations in the resolution of mesoscale features and vertical structure near the surface is not evident from our continental-scale evaluations. We discuss further in Section 4 the impact of potential biases in WRF near the surface and in GEOS-5 at upper levels.

Discussion
Based on the spatiotemporal distribution of GOSAT XCO 2 soundings, there is very good agreement between the two modeling systems and GOSAT in the gross seasonal comparisons, well within the uncertainty of the individual GOSAT retrievals (0.5-2.00 ppm, reported with each sounding). Our primary goal of this framework is to control the CO 2 mole fractions introduced into the WRF domain to be as close as possible to the mole fractions from the global model. By scaling the surface fluxes and constraining the mole fractions in the flow at the boundary walls, we have approached this goal of mass conservation. The largest spatiotemporal differences in XCO 2 shown in this experiment appear to be due to the model-model differences in transport of the anomalous July biogenic source in the continental northwest and offsetting sinks in the midwest and east in the optimized CMS-Flux biogenic fluxes. However, the very small differences in simulated XCO 2 values mask larger differences within the columns. For example, we can see model-model differences in the transport of the surface flux anomaly in the Pacific Northwest in summer, both within and above the boundary layer ( Figure 7). Apparent model differences in vertical mixing within the boundary layer in summer in the eastern part of North America result in lower values within the boundary layer (<850 hPa, −0.5 to +0.1 ppm) and higher values above the boundary layer (>850 hPa, by +0.1 to +0.5 ppm) in the mesoscale model compared to the global model. We know that this configuration of WRF-Chem lacks convective mass transport of the CO 2 tracers. This will affect the vertical transport of CO 2 into and out of the boundary layer. We also acknowledge here that diurnal variations in CMS surface fluxes are prescribed and they might not exactly match the meteorological conditions in WRF. A comparison of fluxes and PBL variations at shorter timescales is needed in future studies, and might be optimized in the inversion systems e.g., [23]. The CMS-Flux optimized CO 2 mole fractions also show effects of recent fluxes in the layer closest to the surface, and then nearly homogeneous mixing in the next several model layers, as seen in the example profile in Figures 6a and 11.
Comparison to in situ CO 2 mole fractions collected during aircraft flights (NOAA aircraft program) showed that vertical air motion above the PBL top changes significantly the horizontal distribution of column XCO 2 . When comparing CO 2 mole fraction biases at various tower locations (near the surface) to biases in column XCO 2 , seasonal and regional biases appeared to contradict each other, i.e., negative biases in Summer in CO 2 while column XCO 2 was positively biased. Detrainment of air out of the PBL, convection, and day-to-day PBL height variations lift up CO2 molecules above the PBL, which are advected horizontally by fast-moving air masses in the Free Troposphere. A recent study showed the role of deep convection at Mid-latitudes by comparing two global models coupled to the same surface fluxes [87]. Their results suggest that the transport of continental surface fluxes by latitudinal atmospheric transport can greatly impact the distribution of CO 2 mole fractions across the northern hemisphere. Similar to our results, they conclude that additional evaluation of vertical mixing is needed to reduce transport errors above the PBL, esp. by deep convection and other detrainment processes. The apparent agreement between models over the entire simulation domain was in fact hiding large differences in the vertical distribution, hence affecting the horizontal distribution of XCO 2 at finer scales. The utility of aircraft data in our analysis explained some of the observed differences in column XCO 2 , not visible when comparing CO 2 mole fractions at tower levels (within the PBL).
The ensemble of WRF realizations generated by perturbing the initial and boundary conditions and the model physics provided additional information on the impact of transport errors in the observed model-model and model-data differences. Seasonally, model-model differences are driven alternatively by the large-scale boundary inflow (Spring and Fall) or by the near-surface vertical mixing (Summer and Winter). Despite improving the coupling scheme with mass-conservation, differences remain between models, which are mostly due to differences in the transport of the CMS boundary conditions within the WRF simulation domain. Hence, model coupling is critical during off-seasons when signals from the surface are small, but when boundary conditions are affected by large-scale seasonal and synoptic variations. At the annual scale, transition seasons might cause the mis-attribution of large-scale inflow of CO 2 into surface flux signals within the inverse system. Future studies will focus on the attribution of CO 2 and XCO 2 model-data residuals into surface and boundary contributions.
In this experiment, the meteorological evaluation of the WRF results was approximately comparable to the GEOS-Chem GEOS-5 results, and neither compared well in all respects with the rawinsonde observations. At the selected sites, the GEOS-5 winds were slow by up to 3 ms −1 at nearly all of the sites and mandatory levels that we used for comparison, while the WRF winds were closer to observations at higher levels. We had hoped to see better performance from the increased resolution, both horizontal and vertical, in WRF. Perhaps the horizontal resolution in WRF is still too coarse to take advantage of any truly mesoscale effects. We also did not assimilate meteorological observations into WRF; this was by design, as the primary focus of the experiment was to identify differences in transport of the CO 2 originating from the CMS-Flux optimized biogenic fluxes. The WRF resolution, the assimilation of meteorological data in WRF, and changes to the boundary layer parameterizations within WRF could be tested to review the conclusions we see here. The two different re-analysis driver data used here might also cause differences in simulated XCO 2 and CO 2 mole fractions. No reconciliation was performed, because both models re-interpret driver data to a certain extent (through advection and diffusion schemes), acting as an additional layer of complexity. However, a comparison of ERA-Interim and GEOS-5 driver data would potentially bring additional information about transport differences and explain some of the differences noted here.
One of the main objectives of satellite XCO 2 programs is to provide observations for assimilation into atmospheric inversions to improve the quality of inferred surface fluxes. GOSAT and other satellite missions ( e.g., OCO-2) provide good coverage of the North American domain in the summer, the season with the most biogenic activity. However, coverage in other seasons is limited, and must be supplemented with other CO 2 observations, forcing the challenge of assimilating both column and surface observations in the same inversion. It is an interesting thought experiment to see what the model-model differences would be if we did have complete satellite sampling coverage over the course of a year. We sampled each model at its own horizontal grid resolution at the hour of the day with the most GOSAT XCO 2 soundings (20 UTC), created simulated XCO 2 from CO 2 columns interpolated to the pressure levels commonly used in the ACOS algorithm, and examined model-model differences in the same way as we did with the simulated GOSAT soundings. We compared the differences for summer, the season with the best GOSAT spatial coverage (See Figures A4-A6). WRF values below 850 hPa are lower than the CMS-Flux values in the east, and WRF values are higher over most of the continent above 850 hPa. These results are reasonably consistent with the summer results that are shown in Figures 5 and 7, which are conditioned on the spatiotemporal coverage of the GOSAT soundings. This is encouraging. While we will never have perfect coverage in every season with a satellite XCO 2 product, the results presented here show that model-model differences do not appear to be overly dependent on the GOSAT sampling coverage. Future studies should also consider temporal correlations in the residuals when denser sampling is available ( e.g., OCO-2 or OCO-3) to examine the signatures of surface fluxes as compared to transport differences.
We have created a viable framework for comparing the transport of surface fluxes optimized in one model in another model. Theoretically, this allows for comparisons using different grid resolutions, meteorological drivers, and model parameterization options. In this experiment, we do see some differences between models in horizontal winds and boundary layer mixing, but see no clear advantage of the mesoscale model in the simulation of the satellite-derived XCO 2 . The framework does provide a computationally feasible laboratory for investigating other possible model configurations. Repeating the experiment in WRF with perturbations and using other configurations (boundary layer physics parameterizations) yield an ensemble of results that better establish an envelope of transport uncertainty for the CMS-Flux optimized biogenic flux solution. Additional sources of uncertainties, such as driver meteorological fields, assimilation of observed meteorology, or finer model resolution, might provide a more accurate range of transport simulations to establish an optimal configuration of the regional modeling system.

Conclusions
We have established a framework for effectively nesting a mesoscale model within a global model achieving approximate mass conservation of the trace gas CO 2 . The CMS-Flux and WRF simulated column-averaged GOSAT XCO 2 samples are comparable within a few parts per million with some spatial differences, across all seasons and all geographic locations in the WRF North American domain. Our mass-conserved coupling scheme shows a significant improvement when evaluated against GOSAT data (0.1 to 0.5 ppm) as compared to traditional coupling schemes (0.5 to 1.5 ppm), more consistent with the model-data differences of the CMS-Flux inversion system. The models used here have very different horizontal and vertical resolutions and computational grids. The framework presented here makes it possible to follow the CO 2 from surface fluxes optimized in a global model into a regional domain, allowing for the testing of various transport options.
Although simulations of entire columns agree when averaged over the domain between the models, noticeable vertical differences in the distribution of the CO 2 within the column are shown by comparison to discrete flask samples from the NOAA aircraft program. These differences originate from vertical mixing in the PBL and in the transport of air masses from the boundaries into the WRF simulation domain. These differences, small when considering total column XCO 2 from the models (less than 1 ppm on average) will affect the inverse sources and sinks across North America, attributing signals to the surface fluxes and the CMS-Flux boundaries differently for each model. Comparison to NOAA tower data revealed biases in Spring and Fall due to large-scale transport while Summer and Winter were affected by vertical mixing differences. A large fraction of the model differences in column XCO 2 arose from the northwestern US and propagated into the simulation domain following the Mid-latitudinal Jet Stream. The differences are primarily explained by the vertical transport of CO 2 molecules above the PBL top, where wind speed increases significantly, modifying the distribution of CO 2 in the horizontal dimension. Unfortunately, both of the models show significant biases with respect to meteorological observations that vary with season. At the sites we examined, wind speeds are low in GEOS-5 in all seasons, especially at higher altitudes. WRF shows positive biases near the surface, while GEOS-Chem shows negative wind speed biases from the surface up to 250 hPa. There is the potential for significant transport bias with either modeling system, and in this experiment, WRF is not obviously better. Assimilating meteorological observations in WRF should reduce the wind bias in WRF, but the vertical mixing requires the calibration of the physical parameterizations to reduce vertical mixing differences across models.