Correlated observation error models for assimilating all-sky infrared radiances

Geer, Alan J.

doi:https://doi.org/10.5194/amt-12-3629-2019

Articles | Volume 12, issue 7

https://doi.org/10.5194/amt-12-3629-2019

Articles | Volume 12, issue 7

Research article

04 Jul 2019

Research article |

| 04 Jul 2019

Correlated observation error models for assimilating all-sky infrared radiances

Alan J. Geer

Abstract

The benefit of hyperspectral infrared sounders to weather forecasting has been improved with the representation of inter-channel correlations in the observation error model. A further step would be to assimilate these observations in all-sky conditions. However, in cloudy skies, observation errors exhibit much stronger inter-channel correlations, as well as much larger variances, compared to clear-sky conditions. An observation error model is developed to represent these effects, building from the symmetric error models developed for all-sky microwave assimilation. The combination of variational quality control with correlated errors is also introduced. The new error model is tested in all-sky assimilation of seven water vapour sounding channels from the Infrared Atmospheric Sounding Interferometer (IASI). However, its initial formulation degrades both tropospheric and stratospheric analyses. To explain this, the eigendeparture and eigenjacobian are introduced as a way of understanding the effect of correlated observation errors in data assimilation. The trailing eigenvalues can be problematic because they strongly amplify high-order harmonic combinations of the water vapour channels, which could have at least three consequences. First, if there are small inter-channel biases, these can be greatly amplified. Second, the trailing eigenjacobians map onto features resembling gravity waves that the data assimilation may not be able to handle. Finally, these harmonic combinations can amplify trace sensitivities, for example, revealing a strong upper stratospheric sensitivity over high cloud in what are usually mid- to upper-tropospheric water vapour channels. A likely explanation is the sensitivity to gravity wave features that are present in the observations but hard for the data assimilation to handle. After reducing the sensitivity to the trailing eigenjacobians, the new error covariance matrix gives good results in all-sky infrared assimilation.

How to cite

How to cite.

Dates

Received: 26 Oct 2018 – Discussion started: 16 Nov 2018 – Revised: 31 May 2019 – Accepted: 12 Jun 2019 – Published: 04 Jul 2019

The author's copyright for this publication is transferred to ECMWF.

1 Introduction

Geophysical quantities are inferred from indirect observations (such as satellite radiances) using techniques ultimately derived from Bayes' theorem. This requires a representation of the error in the prior state and in the observations. Especially in meteorological data assimilation, accurate modelling of the prior or “background” error is critical (e.g. Bannister, 2008 a, b), and the need to improve this has led to major algorithmic developments, such as the move to hybrid and ensemble data assimilation (e.g. Bonavita et al., 2012; Houtekamer and Zhang, 2016). But in contrast to the sophistication of modern background error models, observation errors have usually been represented by a single, globally constant standard deviation. This is increasingly recognized as inadequate since observation errors do not just account for the instrument noise but also for errors in the observation operator (e.g. the radiative transfer model that links the state and the observed radiance) and representation errors (e.g. Janjić et al., 2018). All these error sources can be correlated in time and space and between satellite channels, and their error variances and correlations can vary greatly depending on the meteorological situation.

Recently many numerical weather prediction (NWP) centres have started to represent observation error with more sophistication. For the assimilation of hyperspectral infrared (IR) sounder radiances in clear-sky conditions, the representation of inter-channel error correlations has improved the skill of operational weather forecasts (Weston et al., 2014; Bormann et al., 2016; Eresmaa et al., 2017; Campbell et al., 2017). For the assimilation of microwave radiances in all-sky conditions, observation error models have needed situation dependence, representing errors that are smaller in clear-sky conditions and larger in the presence of cloud and precipitation (Geer and Bauer, 2011; Geer et al., 2018). Along with other developments, this has allowed all-sky microwave assimilation to provide significant gains in forecast skill (Geer et al., 2017). To develop the assimilation of hyperspectral IR radiances in all-sky conditions, it is likely that both inter-channel error correlations and situation dependence will be required. Indeed, even for all-sky microwave observations, inter-channel error correlations are present and become much stronger in the presence of cloud (Bormann et al., 2011), although this has been ignored so far. Hence, this study aims to find an observation error model that can include both inter-channel error correlations and situation dependence as a function of cloud amount.

The first problem of observation error modelling is to estimate the observation error covariances R. These can be inferred from the covariance of background departures, which is, on average, equal to the sum of background and observation errors in observation space, $E (d d^{T}) = {HBH}^{T} + R$ . Here, d is the background departure, B is the background error, H is the linearized observation operator, E() is the expectation operator, ^T indicates a transpose, and it has been assumed there are no correlations between background and observation errors. A range of techniques have been proposed to isolate the observation errors. If the observation errors have no spatial correlations, then the Hollingsworth and Lönnberg (1986) spatial separation technique is appropriate. If an estimate of the background error is available, it can be subtracted (e.g. Bormann and Bauer, 2010). Alternatively, the covariance of background and analysis departures is equal to the observation error in a data assimilation system where the errors are already correctly specified (Desroziers et al., 2005). This latter technique is widely used (e.g. Stewart et al., 2014; Waller et al., 2016 a), and it has been the starting point for the observational error covariance matrices used for hyperspectral IR data assimilation at operational centres.

The estimation of situation-dependent error variances for all-sky microwave assimilation has followed a different approach (Geer and Bauer, 2011; Geer et al., 2018). A piecewise linear error model is fitted, as a function of a “symmetric” cloud proxy variable, to the standard deviation of the background departures d. Hence, the error model is fitted to the sum of observations and background errors (the total error). Originally, the error model allowed for a scaling factor, estimated by trial and error, that was supposed to remove the error variance due to the background errors. In practice, the best scaling factor was 1, i.e. no scaling at all, and current practice is to provide observation errors that are equal in size to the total errors (e.g. Zhu et al., 2016; Kazumori et al., 2016). Because of the limited predictability of cloud and precipitation at small scales (e.g. Fabry and Sun, 2010), total errors in cloudy situations are dominated by large displacement and intensity errors in forecasted cloud and precipitation, often imprecisely known as “mislocation error”. While this might be expected to be part of the background error, most weather centres use some variant of four-dimensional assimilation, in which a short model forecast is used to map from the assimilation control variables to the atmospheric state at the observation location. The error in this forecast thus belongs in the observation error, unless otherwise represented as model error. This error is also often seen as an error of representation, even if it does not come from the mismatch in scales between the observation and the model, which is more usually called representation error (see Janjić et al., 2018; Geer et al., 2018).

Since all-sky microwave assimilation has been most successful where $\tilde{R} ≃ {HBH}^{T} + R$ (using a tilde to distinguish an assumed error model from the true errors), this suggests that in cloud and precipitation, $∥ {HBH}^{T} ∥ ≪ ∥ R ∥$ , i.e. that observation errors dominate. An alternative hypothesis would be that observation errors needed to be inflated because of other suboptimalities, most likely relating to the observation error correlations that are still not modelled in all-sky assimilation. However, Harnisch et al. (2016) used the spread of an ensemble of forecasts to estimate the background errors HBH^T in cloudy conditions, finding that in many cases they are around a third the size of the total error standard deviation (or a ninth of the error variance). This result could have come from a lack of spread in the ensemble, but if not it supports the dominance of observation error in cloudy and precipitating situations.

The current work will follow common practice in all-sky assimilation by fitting an observation error model directly to the covariances of the background departures, assuming that background error is relatively small. This is justified both by the previous success of this approach and by the lack of suitable alternatives. First the Hollingsworth and Lönnberg (1986) approach is ruled out because the mislocation error is spatially correlated. Second, a good estimate of the background error in cloudy situations is not available at ECMWF. Although an ensemble of data assimilations (EDA, Bonavita et al., 2012) from which to compute a spread of all-sky background departures is available, there are some strange features in its estimates of HBH^T for any observation with a situation-dependent error model. This unresolved issue has been present for many years and it has prevented the use of the EDA ensemble statistics to support the development of all-sky assimilation at ECMWF.

One other option for estimating observation error is the Desroziers et al. (2005) approach, but it also has not been used in this work. First, idealized theoretical studies have shown limitations in its ability to identify the true observation error (e.g. Waller et al., 2016 b; Ménard, 2016). Second, the diagnosed error covariance matrices have never been used in operational assimilation without additional error inflation. Bormann et al. (2016) inflated error standard deviations by a factor 1.75, determined by trial and error, Weston et al. (2014) inflated all eigenvalues of the error covariance matrix, and Campbell et al. (2017) used only the diagnosed error correlations, sticking with pre-existing error variance estimates that were much larger than those diagnosed. Bormann et al. (2015) argue that this does not necessarily invalidate the observation error estimates if inflation is necessary to address other remaining suboptimalities, such as the lack of treatment for temporal and spatial correlations. However, at a minimum these estimates are not immediately applicable in real systems without time-consuming trial-and-error adjustment. Third, when applied to humidity sounding channels the Desroziers et al. (2005) technique produces results that are not yet fully understood. Different to the temperature channels it diagnoses observation error standard deviations that are much smaller than the standard deviation of background departures (e.g. Bormann and Bauer, 2010; Waller et al., 2016 a) but much larger than the instrument noise (Bormann et al., 2016). Weston et al. (2014) explained this as representation error from scale mismatches, observing that it gets worse with coarser model resolution. Nevertheless, in the current work, the best justification to use the covariance of the background departures as an estimate of the observation error is that in clear skies this covariance matrix is nearly identical to the 1.75 times inflated diagnosed error covariance matrix of Bormann et al. (2016), at least for humidity sounding channels. Hence, just starting from the covariances of the background departures saves work.

This study has been performed as part of wider developments towards the assimilation of IR radiances in all-sky conditions at ECMWF, to be reported by Geer et al. (2019). Understanding how best to implement a correlated, situation-dependent error model for all-sky assimilation was the final step in getting this working. Section 2 will provide more details of the all-sky IR developments, alongside a general description of the ECMWF forecasting system in which this work has been performed. As a first step, the technique has been applied to the mid- and upper-tropospheric water vapour channels of the Infrared Atmospheric Sounding Interferometer (IASI), which have smaller errors and more linear sensitivity to the model state, making them more amenable to data assimilation than temperature-sounding or window channels (Chevallier et al., 2004). Section 3 gives a mathematical overview of the place of the error covariance matrices in data assimilation and the importance of the eigenvector representation of these matrices. In particular, it introduces the concepts of eigendepartures and eigenjacobians, which mirror their parent concepts but involve projection onto the uncorrelated eigenbasis of the error covariance matrix. These diagnostics give great insight into what is going on when an inter-channel error correlation matrix is used in data assimilation. This section also shows how error correlations and situation dependence can be combined, by using a symmetric all-sky error inflation model to inflate the leading eigenvector of the error covariance matrix. This provides an error covariance matrix that resembles the existing clear-sky error model in clear-sky situations (Bormann et al., 2016) but gives larger error variances and larger inter-channel error correlations in cloudy situations. The proposed error covariance model is tested in the ECMWF system in Sect. 4, initially with poor results, but a key realization was the role of the trailing eigenvalues in amplifying information that the assimilation system may not handle properly, such as biases, gravity waves and trace sensitivities to the stratosphere. Hence, the most successful error model reduces the weight given to the trailing eigenvectors, following earlier studies (Weston et al., 2014; Bormann et al., 2016; Campbell et al., 2017) but with a broader understanding of why the trailing eigenvectors can be so problematic.

2 Methods

2.1 Data assimilation framework

ECMWF operates an NWP system with the aim of predicting weather globally for the medium range and beyond (day 3 onwards). Initial conditions for the forecast are produced by 4-D variational data assimilation (4D-Var, Rabier et al., 2000), which combines a 12 h background forecast with the observations available within either a 12 h or 6 h assimilation window. The “delayed cutoff” 12 h window is used to create the background forecasts for the next assimilation windows; the “early delivery” 6 h window is used to initialize the main forecasts using the most recently available observations but does not contribute to the next background. An ensemble of data assimilations and an ensemble of forecasts are also run to provide dynamically varying background errors and estimates of forecast error.

The forecast model is run at TCo1279 horizontal resolution (around 8–9 km) and with 137 terrain-following vertical levels. Cloud water, cloud ice, and rain and snow precipitation are prognostic variables. Both the large-scale and convective moist processes are parameterized. The convective precipitation is not included in the prognostic precipitation variables, which is not an issue for the IR but requires special treatment in the all-sky microwave, which is strongly sensitive to convective precipitation (Geer et al., 2018). However, detrained convective cirrus cloud is represented in the prognostic cloud variables, allowing a good representation of the clouds seen by infrared sounders.

The data assimilation uses an incremental formulation (Courtier et al., 1994), with inner loops run at reduced but increasing resolution, up to TL399 (approx. 50 km). Tangent-linear and adjoint models of cloud and precipitation physics (e.g. Lopez and Moreau, 2005) and of other parts of the forecast model link changes in the control variables (transforms of surface pressure, horizontal wind vector, temperature, specific humidity and ozone) to changes in dry and moist variables at the observation location. Most available components of the global observing system are assimilated, including surface-based platforms (e.g. buoys, surface stations, aircraft and radiosondes) and satellites (e.g. radiances from infrared and microwave instruments on polar and geostationary platforms, radio occultation, atmospheric motion vectors, and scatterometers for ocean surface wind vectors). This includes all-sky assimilation of microwave humidity sounders and microwave imagers (Geer et al., 2017). The latter are used to improve the dynamical initial conditions through “generalized 4D-Var tracing”, where the initial conditions are adjusted so that the updated model forecast provides a better fit to the observed patterns of water vapour, cloud and precipitation. It is expected that all-sky IR water vapour sounding channels will improve analyses and forecasts in a similar manner, as they already do in clear-sky conditions (Peubey and McNally, 2009). Full documentation of the ECMWF system is available (ECMWF, 2018).

The experiments in this study are run at reduced horizontal resolution: forecasts and data assimilation outer loops are at TCo399, around 25 km, and inner loops reach a maximum of TL255, around 80 km. The early delivery assimilation window is dropped, so that forecasts out to 240 h are run from the main 12 h assimilation window. This is the standard configuration for testing new developments at ECMWF and in most cases its results have been representative of those in the full operational configuration. Experimentation is carried out for two periods of 3 months: from 1 June to 31 August 2017 and from 1 December 2017 to 28 February 2018; results from the two periods are combined so as to give up to around 360 forecast samples. A control experiment has been run that includes the full observing system but without the seven IASI water vapour channels, and then a series of experiments (to be introduced later) add these channels with various configurations of observation error and variational quality control (VarQC). Cycle 45r1 of the ECMWF system has been used in most of the work presented here: this is a version that went operational in June 2018.

2.2 All-sky infrared assimilation

Full details of the all-sky IR configuration are given by Geer et al. (2019) and only an overview is provided here. As mentioned in the Introduction, all-sky IR assimilation is first being tested on channels sensitive to upper-tropospheric water vapour and cloud, due to their better linearity and smaller errors (Chevallier et al., 2004). The combination of a forecast model and a cloud-capable observation operator has long been able to make simulations that resemble the real observations (Chevallier and Kelly, 2002) but substantial development was needed to create an observation operator that has small enough systematic errors for data assimilation. In particular the representation of cirrus cloud has required improvement. In this work, RTTOV v12.2 (Saunders et al., 2018) is used to simulate the IASI observations, using the Chou-scaling cloud scheme with a multiple independent column representation of cloud overlap, the “OPAC” water clouds of Matricardi (2005) and the “Baran” ice cloud of Vidot et al. (2015). Using the ECMWF background as input, this produces monthly mean biases of at worst around 2–4 K in cloudy areas. Compared to the size of observation errors in cloudy areas this is a negligible bias (see Sect. 3.4). Therefore, the observation operator and forecast model are sufficiently accurate for reproducing the observations that all-sky assimilation is a viable possibility.

Table 1Details of the seven mid- and upper-tropospheric water vapour channels to be assimilated in all-sky conditions.

Download Print Version | Download XLSX

ECMWF assimilates (see, e.g. Collard and McNally, 2009; Bormann et al., 2016) a subset of 191 out of the 8461 channels available from IASI, currently from two polar orbiting satellites, Metop-A and Metop-B (Klaes et al., 2007). These channels all have a spectral width of 0.25 cm⁻¹ and provide information on the atmospheric temperature profile and the surface (165 channels at wavenumbers between approximately 649 and 875 cm⁻¹), ozone and the surface (16 channels between 1014 and 1062 cm⁻¹), mid- and upper-tropospheric water vapour (7 channels between 1367 and 1422 cm⁻¹), and lower tropospheric moisture (3 channels between 1990 and 2015 cm⁻¹). Table 1 gives the details of the seven mid- and upper-tropospheric water vapour channels investigated in this work. Note that, although the global mean peak of the weighting function is given, in practice the weighting functions move up and down in the atmosphere by many hundreds of hPa depending on the relative humidity profile of the free troposphere. Figure 10 later gives evidence of sensitivity down to the surface over the Weddell Sea during Antarctic winter. Figures 5a and 6a later illustrate more typical temperature and humidity Jacobians for these channels.

For the operational clear-sky assimilation, a globally constant observation error covariance matrix is used, which includes correlations between all the different channels of one observation (Bormann et al., 2016). Observations are thinned to around 100 km spacings to avoid, as much as possible, the spatial correlations of observation error, which are not modelled. Cloudy scenes are detected and removed using a combination of the McNally and Watts (2003) approach, imager cloud detection (Eresmaa, 2014) and by using a thinning algorithm that selects the observation with the smallest background departure in a window channel. Aerosol-affected scenes are also detected and removed. A small number of scenes detected as being completely overcast are assimilated using this diagnosed cloud top as a lower boundary (McNally, 2009), but this does not apply to the water vapour (WV) channels. All selected channels are assimilated over ocean and land, with two main exclusions: channels over 875 cm⁻¹ over sea ice, which includes the WV channels, and any channel that might be sensitive to the surface over land. Other quality control techniques include the rejection of channels with normalized departures greater than 2.5, but although many other observation types use variational quality control (VarQC, Andersson and Järvinen, 1999) this is not applied to the IASI observations. Variational bias correction (VarBC) is applied, in common with most other satellite observation types (Auligné et al., 2007) with a globally constant predictor, four air mass predictors based on layer thicknesses across four different ranges and a third-order polynomial in the instrument scan position. The surface skin temperature is treated as a sink variable in observation space, allowing the window channels to update the potentially erroneous first-guess skin temperature.

Hyperspectral infrared observations are also assimilated from the Atmospheric Infrared Sounder (AIRS) and Cross-track Infrared Sounder (CrIS) using similar configurations to IASI. Further, the mid- and upper-tropospheric water vapour channels are assimilated from five geostationary imagers around the Equator. The presence of all of these data (plus its equivalent from many microwave sounders) means that changes in forecast quality coming from different usage of IASI water vapour data will not be large, but, as will be explained below, there is still enough sensitivity in the fits of the short-range forecast to other observations that it is possible to clearly measure the impact of the work described here.

In the all-sky IR experiments, it is only the usage of the seven mid- and upper-tropospheric water vapour sounding channels that has been changed. The error correlations between these channels and the others are set to zero, so that the seven channels are treated in many respects as an independent instrument. The remaining coupling to the other channels is through the skin-temperature sink variable and through the thinning that includes a selection of the smallest window channel background departure. However, this is not expected to have much effect due to the removal, through a screening test, of situations where the seven chosen channels have sensitivity to the surface. The principal changes for all-sky assimilation are (i) to stop rejecting cloud-affected observations (but to retain rejection of aerosol-affected scenes); (ii) to use the cloud-capable version of the RTTOV observation operator described above and (iii) to use the situation-dependent all-sky error covariance matrix developed in the current study.

Some of the more detailed processing is retained from the clear-sky assimilation, such as the 100 km box thinning and the same bias correction model. But most other data selection and quality control processes have been changed to implement all-sky assimilation. Background quality control is now applied to the whole block of seven channels, so that either all channels are kept, or all are rejected. This means that the eigenvectors of the observation error covariance matrix remain fixed, allowing the eigenvalues to be scaled in a controlled way as described later. Instead of checking the size of the normalized background departures, it is the size of the normalized eigendepartures (Sect. 3.1) that is checked: if any of the seven eigendepartures has a magnitude larger than 3, the whole block of WV channels is rejected. Further, the block is rejected if the lowest-peaking channel (channel 2889) has a surface to space transmittance greater than 0.1. This protects against situations where the WV channels actually do have significant surface sensitivity, such as on dry days over the Andes. Finally, variational quality control has been activated for the seven WV channels because it has proved essential to getting good results from all-sky microwave assimilation (Geer and Bauer, 2011). This is a novel development in the context of correlated observation errors because of the complexities of representing the prior probability of gross error when it is correlated across channels or levels (Ingleby and Lorenc, 1993). The solution has been to follow the proposal of Andersson and Järvinen (1999) to apply VarQC to the eigenprojected departures, assuming that the prior probability of gross error is independent for each eigenvalue. Further details are given in the Appendix. The assumption is that in all-sky assimilation, the gross error modelled by VarQC does not primarily come from radiance–space issues (for example, the failure of an individual channel) but rather from scenes where the analysis struggles to match the observed cloud or precipitation.

3 Error covariance matrices

3.1 Definitions and concepts

To find the best estimate of the state, x, variational assimilation minimizes a cost function, presented here in its most simplified form:

\begin{matrix} (1) & J (x) = \frac{1}{2} {(x - x^{b})}^{T} {\tilde{B}}^{- 1} (x - x^{b}) + \frac{1}{2} d^{T} {\tilde{R}}^{- 1} d . \end{matrix}

Here, x^b is the background (prior) state and $\tilde{B}$ its error covariance matrix; this background error determines how far the analysis can go from the background state. As in the Introduction, the tilde is used to signify the error covariance matrices as applied practically and to distinguish them from the unknown true matrices. $\tilde{R}$ is the observation error covariance and d is the departure between the state and the observations y:

\begin{matrix} (2) & d = y - b - H (x), \end{matrix}

where H() is the non-linear observation operator that maps from state space to observation space and b is a bias correction (here estimated by VarBC). In 4D-Var, the observation operator H() is further extended to include a forecast model that propagates the state from the beginning of the time window (where the analysis is being made) to the time of the observation. For clarity, the more complex aspects of the real cost function used at ECMWF have been ignored: for example, modifications are required to estimate VarBC and VarQC parameters as part of the minimization. Further, the notation has been simplified compared to that introduced by Ide et al. (1997).

Some of the key concepts of observation error covariance matrices can be understood through the second term on the right-hand side of Eq. (1), the observation term $J^{o} (x) = \frac{1}{2} d^{T} {\tilde{R}}^{- 1} d$ . If the observation errors are uncorrelated then the observation error matrix is diagonal and contains the square of the error standard deviations, ${(σ_{i}^{o})}^{2}$ , for each observation, i, so that for N observations the J^o part of the cost function can be computed as follows:

\begin{matrix} (3) & J^{o} (x) = \frac{1}{2} d^{T} {\tilde{R}}^{- 1} d = \frac{1}{2} \sum_{i = 1}^{N} {(\frac{d_{i}}{σ_{i}^{o}})}^{2}, \end{matrix}

where d_i is the departure computed for each observation. The error of each observation is clearly independent of the others, although the terms in the summation are not independent due to the use of the state to compute H(x) in d_i. As described in the Introduction, the expectation of the background departures is HBH^T+R, but in the derivation of all-sky observation error models a significant approximation is often made here: if B were zero (or, as discussed earlier, much smaller than R) then the background departures normalized by the observation errors, $\frac{d_{i}^{b}}{σ_{i}^{o}}$ , should be distributed according to a Gaussian with an expectation of 1. If an observation error model is based directly on the expectation of background departures, as is mostly the case for all-sky assimilation, that model can be validated by showing that it produces a Gaussian probability density function (PDF) of the normalized background departures (Geer and Bauer, 2011).

The gradient of the observation term with respect to the state can also be written as a summation including independent observation errors:

\begin{matrix} (4) & J^{o} (x)^{'} = - H^{T} {\tilde{R}}^{- 1} d = - \sum_{i = 1}^{N} h_{i} \frac{d_{i}}{(σ_{i}^{o})^{2}} . \end{matrix}

Here H^T is the transpose of the matrix of partial derivatives of the observation operator H() with respect to the state (the Jacobian) and h_i are its columns (one per observation). Kalnay (2003) further explains the construction of this derivative. In any case, the Jacobians of the observation operator are a familiar way to represent the sensitivity of the observations to the state.

If observation errors are correlated, none of the above simplifications can be made because of the off-diagonal terms in R. However, it is possible to project the observations onto an uncorrelated basis using an eigenvector decomposition of the observation error covariance matrix:

\begin{matrix} (5) & \tilde{R} = E Λ E^{T}, \end{matrix}

where E is a matrix with its columns being the eigenvectors e_j and Λ being a diagonal matrix containing the eigenvalues λ_j. The observation cost function can again be written as a summation in which the errors – now described by the eigenvalues – are independent:

\begin{matrix} (6) & J^{o} (x) = \frac{1}{2} d^{T} E Λ^{- 1} E^{T} d = \frac{1}{2} \sum_{j = 1}^{N} {(\frac{e_{j}^{T} d}{(λ_{j})^{0.5}})}^{2} . \end{matrix}

This is a fundamental change in the way we have to think about observations: when their errors are correlated, we can think about a sum over what we might term the eigendepartures $e_{j}^{T} d$ , with implied observation error standard deviations given by the square root of the eigenvalues (λ_j)^0.5. If the variation in the background error is relatively small, each of the terms in the summation will on average have roughly the same weight in the cost function. In other words, normalized eigendepartures are (excluding the effect of the background errors) all equally important in the cost function and hence have equal weight in the data assimilation. Other practical consequences are as follows. First, the eigenvector decomposition can be a useful way of computing the inverse of the R matrix when computing the cost function. Second, it gives a practical way to combine a correlated observation error matrix with variational quality control, which is more easily applied to the now independent eigendepartures (Andersson and Järvinen, 1999, see the Appendix).

Just as there is an equivalence between the departures and the eigendepartures, there is also what we can call an eigenjacobian that gives the sensitivities of each eigenvector to the state. Now, the gradient of the observation cost function can be computed from the following summation:

\begin{matrix} (7) & J^{o} (x)^{'} = - H^{T} E Λ^{- 1} E^{T} d = - \sum_{j = 1}^{N} H^{T} e_{j} \frac{e_{j}^{T} d}{λ_{j}} . \end{matrix}

By analogy to Eq. (4), the eigenjacobian for each eigendeparture is given by H^Te_j. A common problem with the eigendepartures is trying to understand what they respond to physically, so the eigenjacobian gives a useful tool to understand their physical sensitivities.

Finally, these equations have so far been written with one giant observation error covariance matrix, but this is not how current data assimilation systems work. When the assumption is made that observations are uncorrelated in space, but errors are correlated across the channels of one instrument, the matrix becomes block diagonal. Then the same maths can be applied to the sub-matrices of R and sub-vectors of the departures d for each individual observation as is done in the rest of this study. For further background, see Rodgers (2000) and Kalnay (2003).

Bormann et al. (2016)

Table 2Details of the error covariance matrices examined here

Download Print Version | Download XLSX

3.2 Clear-sky and all-sky covariance matrices

The all-sky error covariance matrix has been in development for a while, so a number of different versions will be examined here (Table 2). The one used in active experimentation is based on background departures from a development version of the all-sky IR assimilation at an earlier cycle (cycle 43r1) using the Metop-A IASI data from a single 12 h analysis window on 3 May 2016. A number of other matrices have been derived using data from 1 to 20 June 2017 from Metop-A and Metop-B from a passive monitoring experiment using the cycle 45r1 configuration described in Sect. 2 (passive monitoring here means that the background is kept fixed and taken from a control experiment, so it remains unaffected by the change in observation usage). The variety allows an assessment of the robustness of the error estimates. In most cases the sample includes all land and ocean scenes except where orography is greater than 2500 m and excluding the whole of the Antarctic continent. However, the covariances from cycle 45r1 have also been computed using further reduced samples: one keeping clear skies only and the other using all-sky conditions but keeping ocean only. Also, the corresponding part of the operational clear-sky error matrix of Bormann et al. (2016) has been examined.

https://www.atmos-meas-tech.net/12/3629/2019/amt-12-3629-2019-f01

Figure 1Error correlation matrices C for the seven selected IASI upper-tropospheric water vapour channels, ordered by ascending altitude of weighting function, from (a) operational clear-sky observation errors and (b) estimated all-sky errors from cycle 43r1 experiments.

Correlated observation error models for assimilating all-sky infrared radiances

2.1 Data assimilation framework

2.2 All-sky infrared assimilation

3.1 Definitions and concepts

3.2 Clear-sky and all-sky covariance matrices

3.3 Jacobians and eigenjacobians

3.4 Behaviour of eigendepartures

4.1 Finding the best configuration

4.2 Problems with error correlation models