Impacts of AMSU-A , MHS and IASI data assimilation on temperature and humidity forecasts with GSI – WRF over the western United States

Using NOAA’s Gridpoint Statistical Interpolation (GSI) data assimilation system and NCAR’s Advanced Research WRF (Weather Research and Forecasting) (ARWWRF) regional model, six experiments are designed by (1) a control experiment (CTRL) and five data assimilation (DA) experiments with different data sets, including (2) conventional data only (CON); (3) microwave data (AMSUA+MHS) only (MW); (4) infrared data (IASI) only (IR); (5) a combination of microwave and infrared data (MWIR); and (6) a combination of conventional, microwave and infrared observation data (ALL). One-month experiments in July 2012 and the impacts of the DA on temperature and moisture forecasts at the surface and four vertical layers over the western United States have been investigated. The four layers include lower troposphere (LT) from 800 to 1000 hPa, middle troposphere (MT) from 400 to 800 hPa, upper troposphere (UT) from 200 to 400 hPa, and lower stratosphere (LS) from 50 to 200 hPa. The results show that the regional GSI–WRF system is underestimating the observed temperature in the LT and overestimating in the UT and LS. The MW DA reduced the forecast bias from the MT to the LS within 30 h forecasts, and the CON DA kept a smaller forecast bias in the LT for 2-day forecasts. The largest root mean square error (RMSE) is observed in the LT and at the surface (SFC). Compared to the CTRL, the MW DA produced the most positive contribution in the UT and LS, and the CON DA mainly improved the temperature forecasts at the SFC. However, the IR DA gave a negative contribution in the LT. Most of the observed humidity in the different vertical layers is overestimated in the humidity forecasts except in the UT. The smallest bias in the humidity forecast occurred at the SFC and in the UT. The DA experiments apparently reduced the bias from the LT to UT, especially for the IR DA experiment, but the RMSEs are not reduced in the humidity forecasts. Compared to the CTRL, the IR DA experiment has a larger RMSE in the moisture forecast, although the smallest bias is found in the LT and MT.


Introduction
Instead of the random distribution and heterogeneous spatial density in the traditional conventional radiosondes, satellite observations provide a large amount of data covering worldwide areas in order to improve the initialization of the weather forecast models through a data assimilation system.Many studies have demonstrated that the assimilation of satellite data has significantly improved weather forecasts (Eyre, 1992;Andersson et al., 1991;Derber and Wu, 1998;Zhou et al., 2011), especially over some areas with sparse conventional observations (McNally et al., 2000;Zapotocny et al., 2008;Liu et al., 2012).For example, using the operational European Centre for Medium-Published by Copernicus Publications on behalf of the European Geosciences Union.

Y. Bao et al.: Impacts of AMSU-A, MHS and IASI data assimilation
Range Weather Forecasts (ECMWF) system, Andersson et al. (1991) pointed out that the forecast shows a negative impact of the satellite sounding data in the Northern Hemisphere, and a strong positive impact in the Southern Hemisphere.Based on the National Centers for Environmental Prediction (NCEP) Global Data Assimilation/Forecast System (GDAS/GFS), Zapotocny et al. (2008) found a positive forecast impact from both the conventional in situ and remotely sensed satellite data in both hemispheres.The positive forecast impacts from the conventional and satellite data are of similar magnitude in the Northern Hemisphere; however, the contribution to forecast quality from satellite data is considerably larger than the conventional data in the Southern Hemisphere.The importance of satellite data also generally increases at longer forecast times relative to conventional data.It is clear that satellite data assimilation plays an important role in the improvement of weather forecasts.
The Meteorological Operational satellite program (MetOp) launched its first polar-orbiting satellite (MetOp-A) on 19 October 2006.MetOp-A is in a sun-synchronous orbit, carrying a payload of 10 scientific instruments, including the Advanced Microwave Sounding Unit-A (AMSU-A), Microwave Humidity Sounder (MHS) and the new-generation Infrared Atmospheric Sounding Interferometer (IASI) to make atmospheric soundings at various altitudes.IASI (Clerbaux, et al., 2009) measures the radiance emitted from the Earth in 8461 channels covering the spectral interval 645-2760 cm −1 at a resolution of 0.5 cm −1 (apodized) and with a spatial sampling of 18 km at nadir.Limited spectral data are currently being transmitted, stored and assimilated.Rabier et al. (2002) compared a number of techniques for channel selection from high-spectral-resolution infrared sounders and concluded that the channel-selection method of Rodgers (1996Rodgers ( , 2000) ) is the optimal one.Collard (2007) applied his method to select a subset of 300 channels for data assimilation, so that the total loss of information for a typical numerical weather prediction (NWP) state vector consisting of one or more of temperature and/or humidity is minimized.
This study focuses on assessing the effects of hyperspectral infrared and microwave radiance data assimilation on the weather forecasts, especially on the different performance of vertical structures, based on AMSU-A, MHS and IASI radiance data.The model, data and methodology are presented in Sects. 2 and 3, respectively.Section 4 describes the results of experiments.The results are summarized and discussed in Sect. 5.

The GSI system for ARW-WRF regional model
The assimilation system used here is the Gridpoint Statistical Interpolation (GSI) analysis system, which was developed by United States National Centers for Environmental Pre-diction (NCEP).The current GSI regional analysis system accepts NCEP's Nonhydrostatic Mesoscale Model (NMM) WRF and NCAR's Advanced Research WRF (Weather Research and Forecasting) (ARW-WRF) regional model mass core (Liu and Weng, 2006a;Xu and Powell, 2012;Wan and Xu, 2011).The interfaces are specialized separately for the WRF NMM core and the WRF ARW core.The analysis system produces an analysis through the minimization of an objective function given by where x is the analysis state; B is the background error covariance matrix; x b is the first guess that comes from GFS 6 h forecast field in this study; H is the transformation operator from the analysis variable to the form of the observations; and y o is the observation, such as AMSU-A, MHS, IASI, etc.
The minimization algorithm with two outer iterations proposed in John Deber's report (Deber, 2012) has been verified and used in NOAA operational forecasts.Therefore, the minimization algorithm is used to account for weak nonlinearities in the cost function.In the first external iteration the first guess is a 6 h forecast, while in the second one it is the solution from the previous outer iteration.In the cost function B has been estimated from scaled differences between 24 and 48 h forecasts valid at the same time (Parrish and Derber, 1992).The observation error covariance matrix (R) contains information on the observational error and errors in representativeness, which were calculated before running the GSI.

Radiative transfer model
The radiative transfer model incorporated into the GSI data assimilation system at the NCEP is the Community Radiative Transfer Model (CRTM).The CRTM was developed by the United States Joint Center for Satellite Data Assimilation (JCSDA) for rapid calculations of satellite radiances based on radiative transfer (RT) theory (Han et al., 2006).The forward model, tangent-linear, adjoint and K-matrix models were also developed for the data assimilation of satellite data: CRTM is always updated for new satellite data.It supports a large number of sensors onboard geostationary and polarorbiting satellites, covering the microwave, infrared and visible frequency regions.
The CRTM comprises four major modules: (1) RT solution module, (2) atmospheric transmittance module, (3) surface emissivity/reflectivity module, and (4) particle-scattering module.Six RT solution schemes were tested in the CRTM (Weng et al., 2007).According to several performance factors, the advance doubling and adding scheme (ADA; Liu and Weng, 2006a) was selected for the CRTM implementation.In CRTM, a fast and optimal spectral sampling (OSS) absorption model (Moncet et al., 2004) is used to calculate atmospheric transmittance.

Experiment design
The objective of this study is to explore the effect of satellite data assimilation on the main atmospheric state forecast by comparing the results from microwave (AMSU-A and MHS), hyperspectral infrared radiance (IASI) and conventional data assimilation.Over the contiguous United States of America (USA), there are many conventional observation stations, which can be used to validate the forecast results.Therefore, the west coast region of the USA is selected as the experimental region.There was more satellite data coverage of the experimental region around 18:00 UTC than at other times, such as 00:00, 06:00 and 12:00 UTC.The covered region at 18:00 UTC is 20-55 • N and 85-155 • W, which includes the western USA and sea area near the west coast (Fig. 1).
The experiment design includes six simulations (Table 1).The control (CTRL) experiment is first made with an initial time at 18:00 UTC from 30 June to 30 July and makes 6 h forecasts.The five data assimilation (DA) experiments and the continued control experiment are made with initial time at 00:00 UTC from 1 to 31 July 2012 and make a 72 h forecast for each day.The initial condition in all six exper-iments is obtained from the 6 h forecasts of the first control experiment.The five DA experiments are made with different data sets, including conventional data (CON); microwave data (AMSU-A + MHS) (MW); infrared data (IASI) (IR); a combination of microwave and infrared data (MWIR); and a combination of conventional, microwave and infrared observation data (ALL).The initial condition and lateral boundary conditions came from the operational GFS forecast at 6 h intervals and 0.5 • × 0.5 • resolution, which were downloaded from the NCEP data inventory (ftp://ftp.ncep.noaa.gov/pub/data/nccf/com/gfs/prod/).
In the ARW model, the physics of the model includes the Goddard Cumulus Ensemble (GCE) microphysics scheme, Yonsei University planetary boundary layer (PBL) scheme, Noah land surface model, Rapid Radiative Transfer Model (RRTM) longwave radiation, and the Goddard shortwave radiation scheme (Xu et al., 2009).The 15 km WRF model forecast with a mesh size domain of 718 × 373 (Fig. 1) was used.Forty-three (43) vertical layers were selected for use with a model top of 10 hPa.  3 Data and methodology

Conventional and satellite data
In this study the conventional observation data include atmospheric temperature (T ), moisture (Q) and wind speed (WSP) at various pressure levels and pressure data at the surface that were downloaded from NCEP data inventory (ftp: //ftp.ncep.noaa.gov/pub/data/nccf/com/gfs/prod/). Figure 1a shows the distribution of the conventional data on 1 July 2012 where the atmospheric temperature, moisture and surface pressure observations are rare.Most of atmospheric temperature and moisture observations are conducted at the surface level in the pressure range of 1000-1200 hPa.Most of the WSP data are found over the sea close to the west coast of the United States.
The satellite data include the AMSU-A, MHS and the new-generation IASI. Figure 1b shows the distribution of the AMSU-A, MHS and IASI data sets acquired at about 18:00 UTC on 1 July 2012.AMSU-A is a 15-channel crosstrack, stepped-line scanning, total power microwave radiometer.In this study the channels from 4 to 14 are assimilated, which were designed to detect atmospheric temperature at 11 layers from the surface to around 45 km.Their weighting function is illustrated in Fig. 2a.MHS on the other hand probes at millimetric frequencies between 89 and 183 GHz; channels 2 to 5 are assimilated, which were designed to detect atmospheric moisture at two layers from surface to around 400 hPa.Their weighting function is illustrated in Fig. 2b.Channel 4 of AMSU-A and channel 2 of MHS can detect the atmospheric temperature and humidity at the lowest layer of the troposphere.Channels 5 and 6 of AMSU-A and channels 3, 4 and 5 of MHS can represent the atmospheric temperature and humidity in the middle atmospheric layer of the troposphere.Channel 7 of AMSU-A can indicate the atmospheric temperature in the highest layer of troposphere.Channels 9 and 10 of AMSU-A can detect the atmospheric temperature in the lower layer of the stratosphere.The IASI instrument covers the spectral range from the thermal infrared at 3.62 µm (2760 cm −1 ) to 15.5 µm (645 cm −1 ), covering the peak of the thermal infrared and particularly the CO 2 band with the humidity (Q) branch around 666 cm −1 .Within these bands, the selected 279 bands (Table 2) correspond to atmospheric temperature and humidity.A band number smaller than 515 represents atmospheric temperature, and a band number larger than 2701 represents atmospheric humidity.Their weighting function is illustrated in Fig. 2c.

Radiance data quality control and bias correction
The radiance data have been preprocessed by NOAA's Satellite and Information Service (NESDIS) before becoming available for usage.The data have been statistically limb-corrected (adjusted to nadir) and surface-emissivitycorrected in the microwave channels and cloud-cleared in the tropospheric channels.Although the satellite data have undergone preprocessing, they need further bias correction before being ingested into the data assimilation system.The source of the biases can be related to instrument calibration problems, and predictor and zenith angle bias.It has been demonstrated that a successful bias correction scheme must take into account the spatially varying and air-massdependent nature of radiance biases (Kelly and Flobert, 1988;McMillin et al., 1989;Uddstrom, 1991).Eyre (1992) and Harris and Kelly (2001) categorized the bias into two types: scan bias and air-mass bias, and they presented a bias correction scheme.GSI uses this bias correction scheme to correct radiance bias.The radiance bias correction coefficients may be downloaded from the GDAS data directory (ftp://ftp.ncep.noaa.gov/pub/data/nccf/com/gfs/prod/),and it can be used to correct the radiance bias in GSI.For that purpose in this study monthly regional mean innovations, e.g., observation minus background (OMB) and observation minus analysis (OMA), are calculated with or without bias corrections.For example, Fig. 3 shows the scattering plots of surface pressure (Fig. 3a), atmospheric temperature at the height of 2 m (Fig. 3b) and wind speed at the height of 10 m (Fig. 3c) between OMB and OMA in the ALL experiment.The result shows that the slope of the simulated line is less than 1, which indicates the analysis fields are closer to observation than background fields.

Methodology
In order to evaluate the effects of radiance data assimilation on temperature and moisture at the different vertical layers, the surface (SFC) and four atmospheric layers are examined.The four layers include lower troposphere (LT) from 800 to 1000 hPa, middle troposphere (MT) from 400 to 800 hPa, upper troposphere (UT) from 200 to 400 hPa and lower stratosphere (LS) from 50 to 200 hPa.Similar to a previous study (Xu, et al., 2009), two statistical variables -bias and root mean square errors (RMSEs) -are investigated.1.
If X represents any of the parameters under consideration for a given time and vertical level, then the forecast error is defined as X = X f −X o , where the subscripts f and o denote forecast and observed quantities, respectively.Given N valid pairs of forecasts and observations, the bias is computed as bias the RMSE is computed as (2) The bias and RMSE at 00:00 and 12:00 UTC are calculated because more than enough observational data and approximately 3000 sounding stations can be used at the two times.

Impact of DA on temperature
At the SFC, the CON DA experiment shows (Fig. 4a) the smallest bias value in all six experiments.The three involved infrared satellite DA experiments (IR, IR + MW, IR + MW+CON) show a larger bias than the CTRL experiment.For the first 24 h, it seems that satellite radiance DA, especially for the infrared IASI data, gives a negative contribution to the temperature forecasts.In additon, the bias characterized a diurnal cycle feature for the 72 h forecasts, with the smaller bias appearing at 6, 30, 54 and 72 h, corresponding to 16:00 LT, while the higher bias appeared at 18, 42 and 66 h, corresponding to 04:00 local time.
Compared to the SFC, the LT shows a more clear diurnal variation (Fig. 4b), and all model forecasts underestimated the observed temperature.The CTRL and CON experiments obtained the smallest forecast bias.
Different from the SFC and LT, the diurnal variation of bias disappeared in the MT (Fig. 4c).Compared to the CTRL experiment, the bias is significantly reduced in all DA experiments, especially for the two combination experiment (MWRI and ALL); the bias is almost zero within the 30 h forecast.It implies that both MW (AMUS-A and MHS) and IR (IASI) DA experiments give a positive contribution to the accuracy of temperature forecasts in the MT.
In the UT, the smaller bias appeared in the CON and MW DA experiments (Fig. 4d), and the combination DA experiments (MWIR and ALL) show a larger bias than the CTRL experiment.The results indicate that the IR DA gave a negative contribution to the temperature forecasts and that the MW experiment improved the forecast accuracy in the UT.
In contrast, the bias in the LS indicates an opposite pattern to the SFC and LT, where all satellite DA experiments reduced the forecast bias (Fig. 4e).The result demonstrated that the conventional DA did not improve the forecasts because of the sparse observational data used in this layer.The MW DA obtained the smallest bias in the LS.
In order to clearly understand the different performance in the six experiments, the temperature forecast bias profile at 6, 30 and 54 h has been examined.Figure 5 indicates a similar pattern at the three forecast times, where the lower bias can be found at the SFC and in the MT while the larger bias appeared in the UT and LS.Generally, the model forecasts overestimated the observed temperature except in the LT.Compared to the CTRL experiment, the four satellite DA experiments (MW, IR, MWIR and ALL) show a smaller bias from the MT through LS, but the forecasts did not improve in the LT below 800 hPa.In contrast, the CON experiment has better performance in the LT, especially at the SFC.
It is obvious that the larger bias in temperature forecast appeared in the LT, UT and LS, but the model is underestimating the observed temperature in the LT and overestimating in the UT and LS (Fig. 5).The satellite DA, especially for the MW DA experiment using AMSU-A, reduced the forecast bias at the levels from the MT to LS.Meanwhile, the CON DA has a smaller forecast bias in the LT, especially at the SFC.Note the IR experiment using the IASI data produced a worse result in the LT.
The forecast RMSE demonstrated some different features (Fig. 6).First, the RMSE reduced the diurnal variation and significantly increased with the extended length of forecast time at the SFC.The RMSE in the CON and MW experiments is slightly less than that in the CTRL experiment and the other three satellite DA experiments within 24 h forecasts (Fig. 6a).Second, consistent with the larger negative bias in all the satellite DA experiments (Fig. 4b) in the LT, larger RMSEs are observed in these DA experiments (Fig. 6b) compared to the CRTL.Third, different from the smaller bias in the DA experiments, the larger RMSEs are maintained in the DA experiments in the MT (Fig. 6c).Fourth, the CON and MW experiments improved the temperature forecasts in the UT (Fig. 6d).But in the LS, the microwave DA experiments -including MW, MWIR and ALL -indicate smaller RM-SEs than the CTRL experiment (Fig. 6e).It is apparent that the CON DA gave a negative contribution to the temperature forecast in the LS.
Corresponding to the bias profile (Fig. 5), the forecast RMSE profile at 6, 30 and 54 h indicates (Fig. 7) that the smallest RMSE is observed at the MT and the largest RMSE appeared in the LT and SFC.Compared to the CTRL experiment, the smaller RMSEs are only found in the MW experiment in the UT and LS, and the CON DA gave a positive contribution at the SFC and in the UT.
The results clearly show that the IR DA experiment gives a negative contribution to the temperature forecast in the regional system.But the MW DA experiment shows a positive impact at the LS, and the CON experiment displays better performance at the SFC and in the UT.It is worth noticing that the RMSE is not always consistent with the bias in the temperature forecasts; for example, the smaller bias appeared at the SFC, while a larger RMSE is observed there.

Impact of DA on humidity
Similar to the temperature forecasts at the SFC, the diurnal variation of the moisture bias is observed and the smallest bias appeared in the CON and CTRL experiments within the 42 h forecast (Fig. 8a), with largest bias occurring in the MWIR experiment at 18 h.It is clear that all four satellite DA experiments do not improve the moisture forecast compared to the CTRL experiment.In contrast, the IR DA produced a larger bias significantly different from the other ex-  periments in the entire troposphere (Fig. 8b-d).It seems to tell us that the IR DA significantly impacts the humidity forecasts in the troposphere.However, the impact disappeared in the LS (Fig. 8e).
Compared to the bias profile of the temperature forecast (Fig 4), all model runs overestimated the observed humidity except for the UT.The smallest bias in the humidity forecast occurred at the SFC and in the UT (Fig. 9).Most of the DA experiments apparently reduced the bias from LT to UT, es- .Other definitions can be found in Table 1.pecially for the IR experiment.But it is worth noting that the MW DA has a larger bias than the CTRL experiment in the whole troposphere.
However, the RMSE in the humidity forecasts (Fig. 10) increases from the SFC to LS.The largest error in the UT and LS is almost double the amount at the SFC.In addition, most of the DA experiments demonstrated a larger RMSE than that in the CTRL experiment.In other words, the DA experiments gave a negative contribution to the humidity forecasts.The IR DA experiment did not improve moisture forecast, although its bias is very small at the LT and MT.

Summary
In this study six experiments were designed to assess the effects of data assimilation on atmospheric temperature and moisture forecasts over the western United States.The results are summarized as follows.
The regional model underestimates the observed temperature in the LT and overestimates it in the UT and LS.The MW experiment reduced the forecast bias from the MT to LS, and the CON DA obtained a smaller forecast bias in the LT, especially at the SFC.But the IR experiment using the IASI data obtained the largest bias in the LT.
However, the RMSE is not always consistent with the bias profile in the temperature forecasts: in fact, the RMSE profile shows that the largest RMSE appeared in the LT and the smallest error in the MT.Compared to the CTRL experiment, the smaller RMSEs are only found in the MW experiment in the UT and LS, and the CON DA gave a positive contribution at the SFC and in the UT.The IASI DA experiment has a negative impact on the temperature forecast in the regional forecast system.
In contrast, all model forecasts overestimated the observed humidity except in the UT.The smallest bias in the humidity forecast occurred at the SFC and in the UT.Most of the DA experiments apparently reduced the bias in the LT to UT, especially for the IR DA experiment.But the MW DA obtained a larger bias than the CTRL experiment in the entire troposphere.
The RMSE in the humidity forecasts increases from the SFC to the LS, which is similar to the bias profile except in the UT.The largest error in the UT and LS is almost double the amount at the SFC.The DA experiments give a limited contribution to the humidity forecasts.The IR DA experiment does not improve the moisture forecast, although its smallest bias is found in the LT and MT.

Discussion
This is a study using the WRF-ARW mesoscale model linked to GSI data assimilation system to explore the impact of AMSU-A-MHS and IASI radiance data assimilation on the temperature and humidity forecasts in the different vertical layers over the west coast of United States.Due to the complexity of measurements for satellite instruments (such as the IASI with 8461 channels) and lack of knowledge in the estimation of impacts of those data sets in this regional area, forecasters should be aware of the limitations of this data assimilation when forecasting in this region.
The results show that the bias and forecast error are substantially related to the vertical layer of the objective.For example, the AMSU-A data assimilation reduced the temperature forecast bias in the upper atmospheric layers, and the conventional data assimilation indicates the best performance in the lower layer, but the IASI data assimilation shows worst performance in the lower layer.Compared to the largest bias in the upper atmospheric layer, the largest RMSE appeared in the lower atmospheric layer.For the humidity forecast there is a different behavior: the IASI data assimilation significantly reduced the bias in the troposphere, but the RMSE tells us that the IASI data assimilation does not improve the moisture forecast in this layer.The reason is very complicated; it is partially attributed to the data selection in the processes of data assimilation.IASI data have 8461 channels, but only 279 of those channels were used, based on pre-vious studies.Until now, it is not clearly understood what the main reason is for the different performances since many factors have contributed to the overall result.More experiments are necessary as part of our future study to try to understand the contributions from the various factors and components.The results shown in this analysis demonstrate the partial impact of satellite data on temperature and humidity forecasts in this region, but the positive or negative impact depends on the atmospheric layer and forecasts variables.
It is worth noting that the results presented here are based on one month's forecasts with three satellite instruments.The model performance needs to be examined with longer experiments and more data selection that extend to all available satellite data sets and more experiments from the different areas.As expressed by Manning and Davis (1997), "These statistics would provide additional information to model users and alert model developers to those research areas that need more attention."

Figure 3 .
Figure 3.The scattering plot between observation minus background (OMB) and observation minus analysis (OMA) in the all-data (CON + AMSU-A + MHS + IASI ) experiment (a: surface pressure; b: atmospheric temperature at the height of 2 m; c: wind speed at the height of 10 m) for 1 July 2012.

Figure 5 .
Figure 5. Bias profile of the temperature (T ) forecasts at (a) 6 h, (b) 30 h, (c) 54 h forecasts.Unit: • C. Other definitions are the same as in Fig. 4.

Figure 7 .
Figure 7.The RMSE profile of the temperature forecasts at (a) 6 h , (b) 30 h, (c) 54 h forecasts.Unit: • C. Other definitions are the same as in Fig. 4.

Figure 9 .
Figure 9. Bias profile of the specific humidity forecasts at (a) 6 h, (b) 30 h, (c) 54 h forecasts.Unit: g kg −1 .Other definitions are the same as in Fig. 4.

Figure 10 .
Figure 10.The RMSE profile of the specific humidity forecasts at (a) 6 h , (b) 30 h, (c) 54 h forecasts.Unit: g kg −1 .Other definitions are the same as in Fig. 4.

Table 1 .
The experiment design includes six simulations (EXP1-EXP6).All experiments are made from 30 June to 31 July 2012 and make a 72 h forecast for each day.IR) Conventional + AMSU-A + MHS + IASI data assimilation 00:00 UTC from 1 to 31 July

Table 2 .
Listed below are the 279 channels in IASI corresponding to atmospheric temperature and humidity.The numbers indicate the order in which the channels were chosen in current data assimilation.