Journal topic
Atmos. Meas. Tech., 13, 2099–2117, 2020
https://doi.org/10.5194/amt-13-2099-2020
Atmos. Meas. Tech., 13, 2099–2117, 2020
https://doi.org/10.5194/amt-13-2099-2020

Research article 23 Apr 2020

Research article | 23 Apr 2020

# Shallow cumuli cover and its uncertainties from ground-based lidar–radar data and sky images

Shallow cumuli cover and its uncertainties from ground-based lidar–radar data and sky images
Erin A. Riley1, Jessica M. Kleiss1, Laura D. Riihimaki2,3, Charles N. Long2,3, Larry K. Berg4, and Evgueni Kassianov4 Erin A. Riley et al.
• 1Environmental Studies, Lewis & Clark College, Portland, OR 97219, USA
• 2Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO 80309, USA
• 3NOAA Global Monitoring Laboratory, Boulder, CO 80305, USA
• 4Atmospheric Sciences and Global Change Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA

Correspondence: Jessica M. Kleiss (jkleiss@lclark.edu) and Evgueni Kassianov (evgueni.kassianov@pnnl.gov)

Abstract

Cloud cover estimates of single-layer shallow cumuli obtained from narrow field-of-view (FOV) lidar–radar and wide-FOV total sky imager (TSI) data are compared over an extended period (2000–2017 summers) at the established United States Atmospheric Radiation Measurement mid-continental Southern Great Plains site. We quantify the impacts of two factors on hourly and sub-hourly cloud cover estimates: (1) instrument-dependent cloud detection and data merging criteria and (2) FOV configuration. Enhanced observations at this site combine the advantages of the ceilometer, micropulse lidar (MPL) and cloud radar in merged data products. Data collected by these three instruments are used to calculate narrow-FOV cloud fraction (CF) as a temporal fraction of cloudy returns within a given period. Sky images provided by TSI are used to calculate the wide-FOV fractional sky cover (FSC) as a fraction of cloudy pixels within a given image. To assess the impact of the first factor on CF obtained from the merged data products, we consider two additional subperiods (2000–2010 and 2011–2017 summers) that mark significant instrumentation and algorithmic advances in the cloud detection and data merging. We demonstrate that CF obtained from ceilometer data alone and FSC obtained from sky images provide the most similar and consistent cloud cover estimates; hourly bias and root-mean-square difference (RMSD) are within 0.04 and 0.12, respectively. However, CF from merged MPL–ceilometer data provides the largest estimates of the multiyear mean cloud cover, about 0.12 (35 %) and 0.08 (24 %) greater than FSC for the first and second subperiods, respectively. CF from merged ceilometer–MPL–radar data has the strongest subperiod dependence with a bias of 0.08 (24 %) compared to FSC for the first subperiod and shows no bias for the second subperiod. The strong period dependence of CF obtained from the combined ceilometer–MPL–radar data is likely results from a change in what sensors are relied on to detect clouds below 3 km. After 2011, the MPL stopped being used for cloud top height detection below 3 km, leaving the radar as the only sensor used in cloud top height retrievals. To quantify the FOV impact, a narrow-FOV FSC is derived from the TSI images. We demonstrate that FOV configuration does not modify the bias but impacts the RMSD (0.1 hourly, 0.15 sub-hourly). In particular, the FOV impact is significant for sub-hourly observations, where 41 % of narrow- and wide-FOV FSC differ by more than 0.1. A new “quick-look” tool is introduced to visualize impacts of these two factors through integration of CF and FSC data with novel TSI-based images of the spatial variability in cloud cover. The influence of cloud field organization, such cloud streets parallel to the wind direction, on narrow- and wide-FOV cloud cover estimates can be visually assessed.

1 Introduction

Shallow cumuli (ShCu) have a number of important roles in the Earth's climate system due to their complex interactions with radiation, the atmosphere and the surface (e.g., Arakawa, 2004; Vial et al., 2017; Park and Kwon, 2018). For example, the amount of surface moisture can influence the cloud properties by altering the humidity of the boundary layer and by modifying the partitioning of the sensible and latent heat fluxes (Zhang and Klein, 2013; Berg et al., 2013), while the presence of ShCu provides a negative feedback by shading the surface and reducing its solar heating (Berg et al., 2011; Xiao et al., 2018). The ShCu exhibit strong spatial and temporal variability, which has been a topic of increasing interest in recent years for both observational (Berg and Kassianov, 2008; Chandra et al., 2013) and model (Yun et al., 2017; Angevine et al., 2018) studies. To improve our understanding of cloud cover variability and its impact on the intricate cloud–atmosphere–surface interactions, both long-term and detailed diurnal observations of the ShCu properties are highly desirable. There are two conventional measurement-based estimates of cloud cover: (1) cloud fraction (CF) obtained from zenith-pointing narrow-FOV observations and defined as the fraction of time when a cloud is detected within a specified period and (2) fractional sky coverage (FSC) obtained from wide-FOV observations and defined as the fraction of cloudy pixels in a sky image. Note that FSC is similar to that estimated by a cloudy-sky observer (e.g., Henderson-Sellers and McGuffie, 1990; Kassianov et al., 2005; Long et al., 2006).

Long-term measurement-based statistics of ShCu have been employed for model–observation comparison (Zhang et al., 2017; Endo et al., 2019). The observational studies take advantage of the synergistic use of ground-based observations from the ceilometer, micropulse lidar (MPL) and millimeter cloud radar at the Atmospheric Radiation Measurement (ARM) mid-continental Southern Great Plains (SGP) site to obtain vertically resolved hydrometeor layers with high temporal resolution (https://www.arm.gov/, last access: 20 September 2018). A merged lidar–radar data product is available from November 1996 to the present at the SGP site and has served as a basis for developing ShCu climatology at the SGP site (Berg and Kassianov, 2008; Zhang and Klein, 2013) and observational test beds for model evaluation (Zhang et al., 2017).

Efforts in improving both ground-based observations and modeling of continental ShCu at the SGP site are currently underway (e.g., Gustafson et al., 2017; Zhang et al., 2017). Zhang et al. (2017) suggested that, on average, the modeled areal cloud cover tends to underestimate observed CF substantially (up to 0.1 or 65 % for clouds with vertical extent greater than 0.3 km). Although several potential factors for the obtained model–data discrepancy have been considered, including different model setups and selection of events with different vertical cloud extents, the role of observational uncertainties has not been directly addressed. For example, FSC obtained from the wide-field-of-view (FOV) total sky imager (TSI) data as the fraction of cloudy pixels in a hemispherical image provides a better agreement with model outputs than the narrow-FOV CF from merged ceilometer–MPL data (Gustafson et al., 2018), indicating a potential consequence of the FOV configuration. In addition, long-term averages of CF obtained from merged ceilometer–MPL data tend to be larger than FSC acquired from collocated TSI observations (Boers et al., 2010; Qian et al., 2012; Wu et al., 2014; Kennedy et al., 2014), indicating a potential consequence of instrument-dependent cloud detection differences. Moreover, sampling of large eddy simulation (LES)-generated cloud fields by a virtual instrument can be a helpful way to reconcile debated differences between the retrieved and predicted values of cloud cover (Oue et al., 2016).

It is expected that the impacts of instrument-dependent cloud detection and FOV configurations are entangled, and our study aims to assess their relative importance on cloud cover estimates of single-layer continental ShCu observed at the SGP site. In 2010, upgrades of three instruments (TSI, MPL and radar) took place, and data merging improvements were implemented on the ceilometer–MPL and ceilometer–MPL–radar (hereafter lidar–radar) merged datasets. Our study uses the natural separation provided by these upgrades and improvements to assess the relative impact of instrument-dependent cloud detection on the ShCu cover derived from three commonly used CF calculations and FSC. Our study also uses wide-FOV and narrow-FOV TSI-based observations to assess the relative impact of the FOV configuration on FSC. Two main questions guide our investigation: (1) have significant changes in the observations of ShCu cover occurred at the SGP site due to instrumental and algorithmic upgrades? (2) what is the impact of FOV configurations on hourly and sub-hourly observations of ShCu cover? These questions are addressed using statistical approaches as well as a new tool that integrates the narrow- and wide-FOV observations to visualize the two considered impacts for a given day of interest at user-specified temporal and spatial scales.

2 Data

Eighteen years (2000–2017) of summertime (May–September) observations of FSC, cloud base height (CBH), cloud top height, and wind speed and direction were collected at the ARM SGP Central Facility. The data described below are available at https://www.arm.gov/data (last access: 20 September 2018), and references to detailed instrument descriptions and records are in Table A1. Here is a short summary of data used in our analysis, and Appendix A contains all pertinent information.

1. Merged lidar and lidar–radar data products. A time-series of CBH and cloud top height values are obtained from the combined 34.86 GHz millimeter cloud radar (MMCR), MPL and ceilometer data in the Active Remotely-Sensed Cloud Locations (ARSCL) value-added product (data reference: Johnson and Giangrande, 1996). The merged dataset improves upon the MMCR vertical profiling of hydrometeors by employing an MPL cloud mask to identify CBH and aide in mitigating the significant impact of insect returns (Clothiaux et al., 2000). The screened high temporal (10 s) and vertical (45 m) radar reflectivity best estimate is used to identify the vertical distribution of hydrometeor layers above the SGP. The CBH best estimate is obtained from merged ceilometer–MPL data. A cloud top height estimate is derived from merging the radar reflectivity best estimate, CBH best estimate and MPL cloud mask. In 2011 the MMCR was replaced and renamed to Ka-band Zenith Radar (KAZR) (Kollias et al., 2016), and ARSCL was subsequently updated and renamed to KAZRARSCL (data reference: Johnson et al., 2011). A summary of the pertinent instrumental and algorithmic differences to the results herein for the periods before and after 2011 are provided in Appendix A.

2. Ceilometer. A time series of CBH values is obtained from the ceilometer (data reference: Ermold and Morris, 1997). The ceilometer records up to three cloud bases at heights up to a maximum of 7.7 km and 16 s resolution. Only the lowest cloud-base is used here.

3. Total sky imager cloud mask. In addition to the sky image, the TSI processing produces a cloud mask (data reference: Morris, 2005) every 30 s that classifies each image pixel as clear, opaque or thin cloud and identifies a 25 FOV circumsolar region and an inner zenith region (Long et al., 2006). The percent error in FSC retrieval is estimated to be 10 % for 95 % of observations (Long et al., 2001).

4. ShCu event selection. The newly released ARM Shallow Cumulus data product (data reference: Shi et al., 2000) identifies times of ShCu from lidar/radar cloud boundary heights and includes FSC from TSI observations (Sunny Lim et al., 2019). The dataset provides hourly flags for the presence of ShCu and a flag indicating the additional presence of other cloud types, such as commonly overlying cirrus that can be either intermittent or persistent throughout the day. From the Shallow Cumulus data product, we initially select days with single-layer (no upper-level) ShCu with at least 2 h duration. Once selected, all hours in this day with ShCu are included, additionally extending the start and end times by 1 h each. The extension allows for more accurate determination of the start and end times of the event on the finer timescale of the TSI FSC (15 min). Quality control procedures (Sect. 3.4) are used to censor multilayer clouds and clear sky conditions on the 15 min and hourly observations of cloud cover. The initial selection provided 614 candidate days with a total of 2393 h.

5. 915 MHz radar wind profiler. Wind data are obtained at CBH from the low power setting of the 915 MHz radar wind profiler which has a range-gate spacing of 60 m and hourly reporting intervals (data reference: Muradyan and Coulter, 1997). In the event of missing data at CBH, the first available reading above 500 m is used. In the event of missing data, 10 m surface wind data are used as a tertiary means to impute wind data (data reference: ARM, 1994). The wind data are used qualitatively in this analysis.

3 Methods

We calculate five estimates of the cloud cover from active and passive observations. Three estimates define narrow-FOV cloud fractions (CFs) retrieved from the lidar–radar observations (Sect. 3.1) to assess the impact of the instrument-dependent cloud detection and data merging on cloud cover estimates (Sect. 4.1). Two estimates define fractional sky cover (FSC) with 100 FOV (Sect. 3.2) and narrow-FOV (3 pixels × 3 pixels within 100 FOV) (Sect. 3.3) obtained from TSI observations to estimate the impact of the FOV configuration on CF and FSC comparisons (Sect. 4.2). The resulting five ShCu cloud cover estimates are then combined, and data selection and quality control procedures are applied to the dataset (Sect. 3.4). The cloud cover estimates are compared for the entire period (2000–2017) and two subperiods (2000–2010 and 2011–2017) separated by instrumental and algorithmic upgrades (Sect. 4). Finally, a heuristic tool is developed to visualize impacts of the cloud detection, data merging and FOV configuration on the cloud cover estimates (Sect. 4.3).

Table 1Variants of cloud fraction (CF) products used in this work: contributing instruments, thresholds on cloud base height (CBH) and cloud top height (CTH).

* In 2011–2017 period, the MPL is not used to determine cloud top height below 3 km (Sect. A4).

## 3.1 Narrow-FOV cloud fraction (CF)

Narrow-FOV observations of CF, CBH and wind are computed for two averaging periods: (1) a 30 min period centered on the 15 min averaging time for FSC, meaning the CF time period begins 7.5 min before the FSC averaging time-bin and ends 7.5 afterwards; and (2) a 60 min period. The extended observation time (30 min instead of 15 min) for the CF measurement is aimed to compensate for the additional sky area observed by the wide-FOV TSI during a 15 min period. Hourly CF observations and FSC averages are computed over identical averaging periods for consistency with previous studies.

## 3.2 Wide-FOV fractional sky cover (FSC)

FSCs obtained from TSI observations within the 100 and 160 FOV are available from routine TSI processing. The 100 FOV is used in this analysis because it was previously established to best correspond to nadir observations due to a reduced impact of cloud sides on estimated cloud cover (Kassianov et al., 2005). Prior to 2005, the TSI used 20 FOV rather than 100; therefore, we calculate 100 FOV FSC for an additional period (2000–2005) directly from the TSI cloud mask as the fraction of thin and opaque pixels over total number of pixels. In our calculations, we remove a 25 FOV ellipse surrounding the sun location for consistency with the routine TSI processing. Our product compares well with the available TSI product; only 2 % of the calculated FSCs exceed the processed FSCs by more than 0.05. The total (opaque + thin) FSC is calculated for each cloud mask and then averaged in nonoverlapping 15 min and 60 min intervals. Note that 15 min is consistent with the expected decorrelation time of a ShCu field (Kassianov et al., 2005). The TSI does not contain CBH information; therefore selection of single-layer ShCu time periods requires the joint use of active sensors.

## 3.3 Narrow-FOV FSC (“CF-like”)

A narrow-FOV FSC is calculated from the TSI cloud mask with the sampling, sensitivity and processing of the wide-FOV FSC. These calculations are aimed to mimic the narrow-FOV lidar–radar measurements (“CF-like”). The narrow-FOV represents a 3 pixel × 3 pixel region located at a zenith angle of 20 and azimuthal angle of 315 from each TSI cloud mask. There are three main reasons for selection of this region. This region (1) is close to zenith (the zenith region is obstructed by the camera box), (2) is located outside of the sun circle region during the study hours (local daylight 09:00–18:00) and (3) corresponds to the best agreement with the wide-FOV FSC observations (Wagner and Kleiss, 2016). The TSI image resolution has been increased from 352 pixels × 288 pixels (original resolution) to 480 pixels × 640 pixels (improved resolution) in August 2011. The narrow-FOV (3 pixel × 3 pixel region) “CF-like” observation has 4.1 (original) and 2.3 (improved) angular resolution. For comparison, the narrow-FOV ARSCL cloud products, including the lidar–radar CF, have much finer angular resolution (about 0.2). The corresponding spatial resolution of the lidar–radar CF can be estimated by multiplying wind speed at cloud base height by lidar–radar dwell time (about 10 s). For example, its spatial resolution is about 100 m for 10 m s−1 wind speed.

## 3.4 Data selection and quality control procedures

Data selection and quality control procedures are applied to the averaged and merged dataset. The scatterplots for 30 min CF and 15 min FSC (Fig. B1) illustrate application of the quality control criteria, while resulting data completeness is provided in Table B1. These screening criteria are executed on the times with ShCu identified in Sect. 2 and includes the following steps:

1. Periods with clear (averaged FSC < 0.05) or overcast (averaged FSC > 0.95) conditions are removed from this analysis for all variables.

2. Periods with multiple cloud layers are screened using merged ceilometer–MPL data where the CF from all clouds is required to be within 0.1 of the ShCu fraction (Berg et al., 2011) (Fig. B1a)

3. Individual 10 s values from the merged ceilometer–MPL product that were flagged as suspect are excluded from the analysis. Incomplete data contribute to errors in the CF calculation (Fig. B1b), and so only periods with 100 % available data are used in the analysis.

4. Periods with questionable TSI retrievals are censored using a threshold on allowed thin clouds. The thin cloud classification of the TSI represents an uncertain pixel, which is “not really clear or cloudy” and typically appears as a thin border surrounding the bright and easily visible ShCu clouds. However, camera degradation and normal automated white balance adjustments of the camera can lead to excessive thin cloud as well as erroneous opaque regions. Manual inspection of these time periods results in a QC criterion that discards intervals with thin cloud fraction greater than 0.3 (Fig. B1c).

5. Additional requirement for comparisons using the ceilometer dataset. Individual observations in which the quality-control flag was marked as good are retained, and those with faults are censored from the dataset. Averaging periods with missed or censored ceilometer data are discarded. The 10-year (2000–2010) period has a substantial (∼40 %) fraction of missed ceilometer data (Table B1).

Initial selection criteria for the partially cloudy events with single-layer ShCu (criteria 1 and 2) reduced the dataset from 614 to 609 d. Quality control (steps 3 and 4) on the 15 min averaged data further reduced it to 569 d and from 2393 to 2048 h (Table B1). It is important to note that this dataset is not strictly selective against cases that transitioned from another cloud type (i.e., stratus to cumulus), nor is it selective against cases that are driven by large-scale weather systems.

Table 2Total number of observations (N) and summary statistics for cloud amount estimated by five methods for three periods of interest (second–fourth rows) at coarser temporal scales: 60 min FSC, “CF-like” and CF. These statistics include the mean (second column) and root mean-square difference (RMSD) (third column). Values different from FSC by more than 0.05 are bold.

Each 60 min period excludes clear and overcast conditions using criteria 0.05 < FSC < 0.95. * Note that the ceilometer data in 2000–2010 had a large percentage of missing during quality control (see Table B1) and that the mean is calculated from a subset of times reflected by Nceil. CF abbreviations defined in Table 1.

Table 3The same as Table 2, except for fine temporal scales: 15 min FSC and 30 min “CF-like” and CF.

Each 15 min period excludes clear and overcast conditions using the following criteria 0.05 < FSC < 0.95. *This value is calculated from a much smaller subset of times, Nceil. CF abbreviations defined in Table 1.

4 Results and discussion

## 4.1 Cloud detection and data merging

Instrument-dependent values of the estimated CF (Table 1, Sect. 3.1) and the narrow- and wide-FOV FSCs (Sect. 3.2, 3.3) are generated for the whole 18-year period (2000–2017) and two subperiods (2000–2010 and 2011–2017). The comparison of the cloud cover estimates from the subperiods illustrates the joint and individual impacts of instrumental upgrades (Sects. A2–A3) and data merging schemes (Sect. A4) on the long-term statistics of cloud cover. The comparison includes both 15 min (e.g., Lareau et al., 2018) and 60 min (e.g., Zhang et al., 2017) averaging windows to assess the impact of the shorter averaging time on the CF-FSC agreement.

We begin our investigation of differences in cloud cover estimates by comparing mean values (Tables 2 and 3). The 60 min average FSCs obtained for the two subperiods are comparable (0.34 vs. 0.33), and their distributions are very similar (comparison not shown), indicating weak changes in the mean ShCu cloud cover for the two subperiods. The ceilometer-based CF supports this interpretation as mean values show good agreement with FSC, particularly for the later subperiod (2011–2017) where the ceilometer data have good completeness. However, the mean CF values calculated from the merged instruments are larger than FSC values by 0.12 for the ceilometer–MPL and 0.08 for the merged lidar–radar in the early subperiod (2000–2010). The larger values of the CF from the merged data in comparison with the ceilometer and FSC in the early subperiod are consistent with previous studies which focus on all cloud types and sky conditions (Boers et al., 2010; Qian et al., 2012; Wu et al., 2014; Kennedy et al., 2014). However, the FSC–CF correlation obtained for the single-layer ShCu is greatly improved over those from the previous studies with all cloud types. For example, Wu et al. (2014) reported a correlation coefficient of 0.54 comparing hourly CF and FSC when ignoring clear and overcast conditions. Here, the correlation coefficients for hourly data are much higher (0.8–0.84) (Table B3). For the later subperiod (2011–2017), the mean CF from the merged instruments decreases by 0.05 for the combined ceilometer–MPL and 0.09 for the combined lidar–radar. As a result, the combined lidar–radar CF shows an improved agreement with the FSC, whereas the CF from ceilometer–MPL is still larger than the FSC by 0.08 (Table 2).

The changes in mean values of the merged instrument CFs between the two subperiods are examined in the 1D and 2D histograms in Fig. 1. The figure shows strong correlation between merged ceilometer–MPL and merged lidar–radar CFs in the early subperiod, indicating that the cloud top height threshold employed has little influence on cloud fraction. In contrast, in the later subperiod the RMSD between the CFs is twice as large, and the correlation is only moderate, indicating a higher influence of the cloud top height criteria. The 1D histograms (Fig. 1a, b) show that both variables shifted to have more observations of low CF in the second subperiod, although this is most pronounced for the lidar–radar. These results suggest that changes in merging the ceilometer–MPL cloud base height detection impacted CF moderately, whereas changes in cloud top height detection in the merged lidar–radar data created significant differences in CF between the two periods.

Figure 1Relationship between CF from merged ceilometer–MPL (clouds identified by CBH) and from merged lidar–radar (clouds identified by both CBH and cloud top height). (a, b) Histograms of CF from merged ceilometer–MPL (blue) and merged lidar–radar (red). (c, d) Joint histograms (counts) for the two methods, magenta line is 1:1. (a, c) Present data obtained from 2000 to 2010 (N=4628), where the radar and MPL data are used jointly to determine cloud top height (Table 1). (b, d) Present data obtained from 2011 to 2017, (N=3564), where the cloud top height is determined from radar data alone below 3 km. Each observation represents a 30 min average. RMSD (Pearson's correlation coefficient) in (c) is 0.08 (0.98) and 0.17 (0.87) in (d).

The 2D histogram comparisons of merged lidar–radar CF and FSC (Fig. 2) show the elimination of bias across the entire range of FSC in the later subperiod; however, this may be due to the introduction of compensating errors when the overprediction of clouds by the combined MPL–ceilometer cloud base height is compensated by the under prediction of clouds by the radar cloud top height in the later period when only radar is used to detect cloud top height below 3 km. Though the mean bias is reduced by 0.08 (Table 2) in the later period, the scatter about the 1:1 line is little affected as evidenced by only a 0.04 reduction in the RMSD in the later subperiod; additionally, 51 % of CF values differ from FSC by more than 0.1 in the early period compared to 41 % in the later subperiod. The modest improvement in agreement indicates the continued presence of cloud detection differences. The 2D histograms for the merged-lidar data CF vs. FSC (Fig. 3) show better agreement across the entire range in the later subperiod, consistent with an improvement in ShCu cloud detection. The compensating error hypothesis is supported further by the comparisons of ceilometer CF to FSC (Fig. 4), which return the highest correlation coefficients (Table B3) and the lowest RMSD (Tables 2 and 3).

Figure 2Two-dimensional (2D) occurrence distributions of paired-in-time CF and FSC for the three periods of interest: (a, b) 2000–2017, (c, d) 2000–2010 and (e, f) 2011–2017. Solid magenta line is 1:1 and dashed is least-squares fit, with regression coefficients given in Appendix B (Tables B2 and B3). Left (a, c, e) and right (b, d, f) columns define fine (15 min FSC and 30 min CF) and coarser (60 min FSC and CF) temporal scales, respectively. Color scale represents counts in increments of 10. Each included data point excludes clear and overcast conditions using criteria 0.05 < FSC < 0.95. CF from merged lidar–radar method uses information about cloud base height (ceilometer–MPL) and cloud top height (radar–MPL).

Since ceilometer CF does not demonstrate dependence on subperiods, it is unlikely that the decrease in merged ceilometer–MPL CF between the two periods is due to the ceilometer. The merged ceilometer–MPL CF is always the same or greater than CF from the ceilometer alone, and the bias decreases between the two subperiods from 0.19 to 0.11 (comparing times with complete ceilometer observations). This decrease in bias also corresponds to a decrease in the RMSD from 0.25 to 0.16 and an increase in Pearson's correlation from 0.86 to 0.91. The improvement in agreement might be attributable to improvements in the MPL mask that reduce false cloudy returns from boundary layer aerosols, resulting in a lower MPL CF (Sect. A3). It is also consistent with an increased reliance on the ceilometer for boundary layer retrievals in the cloud base best estimate retrieval algorithm (Sect. A4). Specifically, the updates of the merging algorithm would likely result in more frequent “clear” returns. The TSI and ceilometer data in comparison with the merged ceilometer–MPL and/or ceilometer–MPL–radar data may be preferable for consistent estimates of ShCu cloud cover at the SGP site from 2000 to present.

## 4.2 Impact of FOV on cloud cover measurements

Recall that the CF obtained from lidar–radar observations with narrow FOV represents a transect of a cloudy sky along the wind direction, while FSC acquired from wide-FOV TSI data defines an area of cloudy sky. Both the CF and the FSC are widespread measurement-based estimates of cloud cover. Fractional sky cover (FSC) from a 100 image spans about a 3 km width of the sky for a 1.5 km cloud base, whereas the active remote sensing instruments (radar, MPL and ceilometer) observe only a “soda straw” approximately 10 m wide – essentially a point in the sky. Cloud fraction obtained from these narrow field of view instruments is thus essentially a 1D transect through clouds that pass overhead. Previous studies have reported sampling uncertainties in transect measurements for modeled random cloud fields (Astin et al., 2001; Berg and Stull, 2002; Kassianov et al., 2005). The term “random” refers to the random arrangement of clouds on a horizontal plane within a given domain. In particular, the previous model studies (Astin et al., 2001; Berg and Stull, 2002) have demonstrated that the cloud cover obtained from the transect measurements mimics the area-averaged cloud cover for non-organized (e.g., random) cloud fields well if the sample size is relatively large (or numerous individual clouds are sampled). Recently Oue et al. (2016) showed that 10 or more ceilometers equally spaced across a 25 km width in the cross-wind direction are required to estimate the simulated cloud cover in the small (30 km) domain. Certainly, the number of ceilometers, their locations and averaging time required for an accurate estimation of the cloud cover depend on the spatial arrangement of clouds and wind speed. Conversely, poor agreement is expected for organized cloud fields, such as so-called “cloud streets,” where individual clouds are arranged in rows along the mean wind direction within a given period of interest. To estimate the relative impact of the FOV configuration on the estimated cloud cover, we use the “CF-like” observations (Sect. 3.3) following an approach previously introduced by Wagner and Kleiss (2016). Recall that the narrow-FOV “CF-like” and the wide-FOV observations are both from the TSI and thus have identical sensitivities for ShCu detection. Thus, the differences between “CF-like” and FSC illustrate the FOV impact on the estimated cloud cover.

Comparison of “CF-like” and FSC shows symmetric variability around the 1:1 line (Fig. 5) and no indication of bias in average values (Tables 2 and 3). For the 1 h (30 min) observations, 23 % (41 %) of “CF-like” values differ from FSC by more than 0.1, and 5 % (14 %) differ by more than 0.2. Reducing the averaging time from 1 h to 30 min increases the RMSD by 0.05 (Table 2) and decreases the correlation from 0.92 to 0.85 (Table B3). Similar trends in the RMSD and correlation coefficients are seen for all the CF measurements as well. This result is consistent with a 15 min decorrelation time for the temporal FSC fluctuations (Kassianov et al., 2005). The “CF-like”–FSC comparison allows us to estimate that narrow-FOV observations are responsible for a ±0.1 uncertainty in hourly observations of CF (Table 2).

Using the passive “CF-like” observations, we also emphasize the relative impact of the instrument-dependent cloud detection and data merging for actively sensed CF measurements. The variability in the best fits in Figs. 2–4 (comparing FSC to CF) is greater than is expected from FOV impacts alone (Fig. 5) for both hourly and sub-hourly observations. For hourly measurements, 23 % of the “CF-like” values differ from FSCs by more than 0.1; this is the expected impact of FOV alone. For the earlier subperiod (2000–2010), the percentages of the CF values differing by more than 0.1 from the wide-FOV FSC are 62 %, 51 % and 34 % for the merged ceilometer–MPL, lidar–radar and ceilometer retrievals, respectively. For the later subperiod (2011–2017), which has the least bias, the corresponding percentages are 53 %, 41 % and 30 %. The large percentages of observations with significant uncertainty in both time periods highlight the importance of distinguishing between FOV effects from instrumental detection differences when using data for sub-hourly applications.

Figure 3The same as Fig. 2, except CF from merged ceilometer–MPL method only uses information about cloud base height.

The effective spatial area sampled by either narrow or wide FOV instruments is a function of both sampling duration and wind speed. High wind speed in comparison with low wind speed (1) increases sample size for a given period and (2) tends to organize horizontal arrangement of clouds (e.g., Weckworth et al., 1999; Atkinson and Zhang 1996). These two factors associated with sample size and spatial arrangement of clouds should be considered when differences between cloud cover obtained from narrow- and wide-FOV observations as function of wind speed are considered (Fig. 6). In particular, Fig. 6 illustrates that both CF–FSC and “CF-like”–FSC differences are reduced noticeably as the wind speed increases from 1 to 3 m s−1 and continue to reduce slightly as the wind speed grows up to 11 m s−1. The CF–FSC and “CF-like”–FSC differences obtained at a higher wind speed (above 11 m s−1) should be considered with caution due to limited number of the corresponding cases with high wind speed (e.g., fewer than 100 cases for 60 min time average). The increased sampling area associated with increased wind speed does not necessarily result in an improved agreement between the narrow- and wide-FOV observations for both hourly and sub-hourly observations due to the impact of wind speed on cloud organization.

Figure 4The same as Fig. 2, except CF only uses cloud base height information from the ceilometer.

Figure 5The same as Fig. 2, except for comparison between “CF-like” and FSC, where both are derived from the TSI instrument.

## 4.3 Heuristic tool to evaluate individual cloud cover estimates.

We developed a tool to help understand the impact of different sources on individual CF measurements. These sources include data quality control, detection differences and spatial cloud organization during the time period of interest (FOV-impact). Many factors contribute to cloud field organization including cloud cover, cloud size (Zhu et al., 1992), wind speed, cloud growth and decay rates, and the presence of cloud streets and clusters. The “quick-look” tool is used to distinguish the detection-based uncertainty from the FOV-impact during different sky conditions.

Figure 6Absolute difference between FSC and CF as a function of wind speed at CBH (2 m s−1 bins). Markers indicate median, and bars indicate 25th and 75th quartiles. Red: absolute difference between FSC and “CF-like” from the TSI. Blue: absolute difference between FSC and CF from the ceilometer. (a) The 15 min FSC and 30 min CF and (b) 60 min FSC and CF. Missing wind speed at CBH is replaced by RWP wind speeds at 500 m; missing RWP data are replaced using surface wind speeds.

TSI cloud mask images are analyzed for FSC differences along the cross-wind direction to visually assess the spatial inhomogeneity of the cloud field and its potential impact on disagreement between narrow- and wide-FOV observations. Prior to our analysis, the 25 FOV sun circle is removed and the cloud mask is cropped to the 100 FOV and undistorted from hemispherical to rectilinear coordinates (Chow et al., 2011). Mean composite images are generated by summing all images within 15 min and dividing by the number of images (Fig. 7a). Each pixel in the averaged image can be interpreted as a 15 min CF measurement from a narrow-FOV sensor. The variability of CF in the cross-wind direction can indicate the possible influence of cloud field organization on cloud cover estimates by narrow-FOV observations. The composite image is then divided into 21 equally spaced “lanes” parallel to the wind direction at CBH (lane FOV is 5). The mean and interquartile CF from each lane is then computed for this 15 min average. Figure 7b shows that the cross-wind variability represented by the lane CF means exceeds substantially the within-lane variability (vertical bars). The low variability within each lane indicates small changes in FSC within a lane for this period.

Figure 7(a) A composite 15 min average cloud cover image (Sect. 4.3) created from 100 FOV TSI cloud masks captured every 30 s with sun circle region removed and projected onto a rectilinear grid. Color bar indicates the fraction of time each pixel observes a cloud (thin or opaque). North is top of figure; streaks result from cloud motion across the image field. Image is divided into 21 lanes oriented parallel to the wind direction (red arrow points downwind) obtained from a radar wind profiler at CBH. (b) The mean and interquartile range (vertical bars) of CF observed within each of the 21 lanes. Note, the moderate ($\sim \mathrm{20}{}^{\circ }$) misalignment of the cloud streaks and the wind vector, illustrating the challenges of wind-based analyses.

Figure 8 shows an example of “quick-look” results generated for a single day (14 June 2017). The narrow-FOV (5) lane-averaged FSC (Fig. 7) is used to estimate the FOV impact for each 15 min interval. The variability in lane FSC for each 15 min interval is visually displayed as one row in the heatmap (Fig. 8a) and provides the spatial distribution of the narrow-FOV FSC perpendicular to the wind direction. The source of between-lane FSC variability can be inferred from the 15 min mean composite images themselves (Fig. 8b). For perfect data, the thumbnail images are a representation of the mean cloud field within the 100 FOV image. For typical wind speeds (4–15 ms−1) it is common to observe streaks, as is seen in this example day, generated by the motion of clouds (see also Fig. 9a, b). For low wind speeds (less than 4 ms−1), a mottled field of clouds emerges with pixel CF dependent upon the rate of cloud formation and decay (Fig. 9c). Less variability in cross-lane FSC is observed for large cloud cover (FSC > 0.7) especially for typical winds (Fig. 9d).

Figure 8Example of results from “quick-look” tool for comparing FSC and CF for 14 June 2017. (a) Heat map of 15 min average CF indicated by the color bar. Cross-wind distance is calculated from the average CBH of 1.5 km, with zero representing the center lane of the composite image – the row from 13:00–13:15 (LST) corresponds to the FSC values in Fig. 7b. (b) The 15 min composite images within the 100 FOV, projected onto rectilinear grid. Red arrows indicate wind direction at CBH. (c) Comparison of cloud amounts obtained from TSI and lidar–radar data, including the following: 30 min averages of CFbt (CF “cloud bases and tops”) from merged lidar–radar for ShCu only (red dashed with circles), CFceil (CF “from ceilometer”; red line with “x” symbol) and CFtot (CF “total”) from merged ceilometer–MPL for any cloud detection (blue line with squares). The 15 min average FSC from TSI data (black with circles) and 15 min minimum and maximum lane-averaged CF (light grey shading) and interquartile range of lane CF (dark grey shading). Markers are placed at time bin center.

Figure 9Illustrations of spatial patterns of ShCu manifest in 15 min averaged composite images (100 FOV, rectilinear coordinates) and a representative all-sky image from the averaging period. Color bar is 15 min mean FSC for each pixel in the composite image; black arrow indicates wind direction; red dot in sky image indicates location of the 3 pixels × 3 pixels for “CF-like”. The 15 min FSC, 30 min CF from merged lidar–radar method, “CF-like”, and wind speed are provided as text. (a) A bit of bird feces contaminates the mirror on the left-hand side of the image, fortunately not within the 100 FOV, resulting in no error of FSC. (a–c) *1: wind source is radar wind profiler at CBH, (d) *3 is 10 m surface wind speed.

The lane-averaged FSC is compared to actual 30 min CF measurements in Fig. 8c. The intention of this plot is to compare the narrow-FOV CF to the range of possible narrow-FOV FSC observed over different sky regions. By quantifying the uncertainty in cloud cover estimates due to the spatial variations in the mean cloud field, we are better equipped to separate FOV impacts from detection differences. For example, a noticeable cloud street appears from about 12:30–13:15 local time, characterized by a wider spread in the lane-FSC minmax and interquartile range (IQR). The mean FSC is approximately 0.5 during this time, and the CF varies between 0.25 and 0.3 and is nearly equal to the lane-FSC minimum. The corresponding thumbnails in Fig. 8b appear nonuniform, suggesting that the CF measurement is impacted by the narrow FOV configuration. In contrast, from 13:15 to 14:15 local time a more uniform mean cloud field is depicted, and the lane-FSC IQR is within 0.1 of the FSC. During this period, ceilometer CF is also within the lane IQR. Surprisingly, CF from the merged lidar–radar diverges from the ceilometer value for the first time in the day and exceeds the lane-FSC maximum significantly. We interpret the larger merged lidar–radar CF to be an overestimation of cloud cover associated with detection differences or data merging issues.

Figure 8 is also useful for quality checks as the quick looks are not censored based on the criteria of Sect. 3.4. For example, on this day we can see little evidence of upper-level clouds by the close agreement of the merged ceilometer–MPL total CF (blue) and ShCu CF (red dashed). The thumbnails illustrate image artifacts such as incomplete removal of sun glare (17:00–19:00 LT), which would cause FSC to be overestimated. In this same period, the images otherwise show relatively uniform cloud cover; therefore CFceil might be preferred over FSC. Sun glare is common in the TSI, and a method has been proposed to identify and correct its effects on the FSC (Long, 2010). The wind-direction, indicated by the red arrow, is provided to verify cloud motion with observed wind direction. This is particularly helpful if missing wind data at CBH have been imputed from other sources, as it can be incorrect. Sun glare and incorrect wind direction are significant limitations of the extensibility of this method to a statistical analysis.

There are two main expected applications of the introduced “quick-look” tool. The first potential application is a classification of spatial organization of cloud fields using, for example, cross-wind cloud field variability (e.g., peaks and valleys in Fig. 7b) and within-lane variance in cloud amount (e.g., vertical bars in Fig. 7b). Numerous images generated by the “quick-look” tool (e.g., Fig. 8b) for the extended period (2000–2017) can be considered as a valuable training dataset for machine learning with focus on automated detection of desired features of the cloud fields (e.g., cloud streets) and unwanted contaminations of TSI images (e.g., Fig. 9). The second potential application is a visual inspection of the generated images for a given period of interest (e.g., a short-term field campaign) to check for the impact of instrumental detection differences and cloud field organization on the observed cloud amount. Visual inspection may be feasible given a limited number (about 40) of ShCu events annually during the warm season. For example, a spread of the lane CFs (gray region in Fig. 8c) gives an idea about the cross-wind cloud field variability within a given FOV and thus aids in understanding the difference between cloud amounts obtained from the narrow- and wide-FOV observations.

5 Conclusions

We compare single-layer ShCu cover estimates obtained from the narrow-FOV lidar–radar and the wide-FOV TSI data collected at the continental SGP site. The data represent an extended period (2000–2017 summers) and two subperiods (2000–2010 and 2011–2017 summers), which mark the instrumentation upgrades and corresponding improvements in data merging. Our comparison includes the bias between the long-term mean values of the cloud cover estimates, the corresponding RMSDs and correlation coefficients. The main conclusions are organized along the two guiding questions highlighted in the introduction:

1. Have significant changes in the observations of ShCu cover occurred at the SGP site due to instrumental and algorithmic upgrades? We demonstrate that the best agreement (bias and RMSD are within 0.04 and 0.12, respectively) occurs between the CF obtained from the ceilometer alone and FSC obtained from TSI data for the entire period and two subperiods. In contrast, the CF calculated from combined ceilometer–MPL data has the largest disagreement with FSC in the first subperiod (positive bias of 0.12; RMSD of 0.2), with improvement in the second subperiod (positive bias of 0.08; RMSD of 0.17). This improvement is likely associated with updates to the MPL cloud mask and improved merging strategies that rely more on the ceilometer. When incorporating a threshold on the cloud top height, the CF obtained from the combined ceilometer–MPL–radar data has better agreement with FSC in the first subperiod (positive bias of 0.08; RMSD of 0.18),and good agreement in the second subperiod (no bias; RMSD of 0.14). The strong period dependence of CF obtained from the combined ceilometer–MPL–radar data, is likely due to a change in what sensors are relied on to detect cloud top below 3 km. Post 2011, KAZRARSCL stopped using the MPL cloud top detection below 3 km leaving the radar as the sole sensor for cloud detection in that region. However, the radar has known difficulties in detecting cumuli; therefore, the improved agreement is likely due to partial compensating errors in cloud detection.

2. What is the impact of FOV configurations on hourly and sub-hourly observations of ShCu cover? TSI-based narrow- and wide-FOV FSC comparisons have no bias and RMSD of 0.1 for hourly observations for all considered time periods. The obtained small RMSD shows that, on average, the narrow-FOV introduces an uncertainty in cloud cover of ±0.1 for 1 h CF measurements. The “penalty” for decreasing the averaging time to 30 min (15 min FSC) is to increase the average uncertainty in CF to ±0.15. The majority (77 %) of the comparisons are within 0.1 for hourly observations, whereas only 59 % of the 30 min (narrow-FOV) versus 15 min (wide-FOV) comparisons are within 0.1. This considerable impact of the FOV configuration on sub-hourly observations of ShCu, confounded with instrumental detection differences, motivates the introduction of a new “quick-look” tool.

The “quick-look” tool uses a spatially resolved analysis of the cross-wind variability in cloud cover in the TSI 100 FOV to provide an expected range of cloud cover that would be detected by a set of 21 narrow-FOV instruments with the same detection properties as the TSI. We demonstrate the utility of the “quick-look” tool to identify the impacts of the FOV configuration on the cloud cover estimates and to identify periods with organized cloud fields. In addition, the “quick-look” tool can identify times with potential issues associated with data merging and cloud detection, as well as periods with TSI data quality concerns. The developed data-centric tool may supplement outputs from the existing and future narrow-FOV lidar–radar simulators (Bouniol et al., 2010; Tatarevic and Kollias, 2015; Lamer et al., 2018).

The presented dataset provides a unique opportunity to (1) contrast the long-term cloud cover estimates with different instrumental sensitivities and data merging strategies and (2) describe the across-wind variability in cloud cover at user-specified temporal and spatial scales. These data provide an observational foundation for a better interpretation of quandaries such as the model-to-data discrepancy of cloud cover (Zhang et al., 2017). Finally, the developed dataset is expected to facilitate evaluation of the modeled shallow cumuli over small domains ($\sim \mathrm{4}\phantom{\rule{0.125em}{0ex}}\mathrm{km}×\mathrm{4}\phantom{\rule{0.125em}{0ex}}\mathrm{km}$) at the SGP site. These studies with strong model and observational components include the recent large eddy simulation (LES) ARM Symbiotic Simulation and Observation (LASSO) project (Gustafson and Vogelmann, 2015) and the Holistic Interactions of Shallow Clouds, Aerosols, and Land-Ecosystems (HI-SCALE) field campaign (Fast et al., 2017).

Appendix A: Description of millimeter cloud radar (MMCR), micropulse lidar (MPL) and merging algorithm 2000–2017

## A1 Active Remotely-Sensed Cloud Locations product overview

MMCR, MPL and ceilometer data are combined in the Active Remotely-Sensed Cloud Locations (ARSCL) value-added product (Clothiaux et al., 2000). The product improves upon the MMCR vertical profiling of hydrometeors by employing an MPL cloud mask to identify CBH and aide in mitigating the significant impact of insect returns. The screened high temporal (10 s) and vertical (45 m) radar reflectivity best estimate (Kollias et al., 2016) is further distilled to generate a user-friendly data stream of the vertical distribution of hydrometeors above the SGP site (sgparsclbnd1clothC1.c1). Within this data stream is a field for best estimate of CBH (“CloudBaseBestEstimate”) from merging MPL and ceilometer data and two fields containing up to 10 hydrometeor layer determinations of cloud bottom (“CloudLayerBottomHeightMplCloth”) and top (“CloudLayerTopHeightMplCloth”) heights. These two fields are derived from the reflectivity best estimate, cloud base best estimate and MPL cloud mask. In 2011, the ARSCL was replaced with the KAZRARSCL with corresponding data stream name “sgparsclkazrbnd1kolliasC1.c1”. Although the reporting interval of the MPL and ceilometer are 30 and 16 s, respectively, the ARSCL reporting interval is 10 s corresponding to the MMCR and 4 s for KAZRARSCL corresponding to the KAZR.

Each instrument (MMCR, MPL and ceilometer) has challenges for detecting ShCu; therefore the data merging process can significantly impact the resulting cloud fraction. Radar reflectivity is weak for shallow cumuli with low liquid water path (below 50 gm−2) due to the small cloud droplet radii (3–7 µm); for this reason, approximately half of the clouds identified by the ceilometer are not seen by the radar (Chandra et al. 2013). Consequently, the lidar data (ceilometer and MPL) are used for CBH information.

The MPL has a greater sensitivity to cloud droplets than the ceilometer and MMCR, resulting in a larger CF (Donovan and van Lammeren, 2001; Kennedy et al., 2014). Some weaknesses of the MPL include the following: (1) misidentification of aerosol layers as clouds, due to the increased sensitivity to aerosols of shorter wavelength of the laser; (2) difficulty retrieving cloud bases in the “blind spot” below 500 m (Welton and Campbell, 2002; Sivaraman and Comstock, 2011); (3) quick attenuation of the backscatter signal within clouds resulting in inaccurate cloud top height; and (4) longer integration time (30 s), which may complicate returns in partially cloudy conditions. It was previously found that when MPL is merged with the ceilometer, the combined lidar CF is larger by 0.03–0.06 than CF from the MPL alone (Kennedy et al., 2014). The ceilometer has better visibility and accuracy in the lower atmosphere than the MPL, but its range is limited to 7.7 km. Below 3 km, the ceilometer is used to define CBH if both MPL and ceilometer detect clouds; otherwise if just one instrument detects cloud then a CBH is returned from that instrument. Therefore, there is a preference to report a 10 s period as cloudy if the lidars disagree.

Table A1References to instrumentation descriptions and service records at the SGP site.

The lidars are limited in determining cloud top height owing to the attenuation of the backscatter signal. Cloud top height from the MPL is typically assessed by setting a threshold on the attenuation of the return; for this reason, MPL cloud top heights are usually considered effective top heights. Multiple cloud layer detection is better performed by the radar, and the algorithmic updates in KAZRARSCL processing reflect this strength (Sect. A4).

The MPL cloud mask calculates cloud boundaries predominately from the attenuated backscatter signal. The original cloud mask involved a signal-to-noise threshold based on cloud droplet scattering (Campbell et al., 1998), whereas the updated cloud mask incorporated information on different scattering properties of aerosol and clouds (Wang and Sassen, 2001). In 2004 the MPL non-polarized implementation was replaced with a polarized system. The MPL hardware was upgraded in 2010 to a fast switching mode that allowed for switching between linear and circular polarization channels (Coulter, 2012). The cloud mask algorithm was also updated at this time to a methodology developed by Wang and Sassen (2001), which has been used to process the entire MPL record at the ARM facilities, though this cloud boundary processing has only been used in ARSCL and KAZRARSCL since 2010. It is expected that the updated cloud mask would improve cloud returns. Interested readers are directed to the technical documentation for the MPL mask (Sivaraman and Comstock, 2011).

## A4 Cloud detection algorithm developments (2011)

Many changes occurred between the summers of 2010 and 2011 including the following: (1) ARSCL was changed to use the MPL cloud mask of Wang and Sassen (2001) in 2010; (2) the new KAZR was deployed, and the MMCR was decommissioned; and (3) KAZRARSCL replaced ARSCL in 2011, including updates to how retrievals of cloud layer boundaries (bases and tops) are handled in the MPL cloud mask. It is beyond the scope of this paper to identify all the possible changes that could result in differences in the ARSCL- and KAZARSCL-based cloud detections. However, it is important to note that both the MPL and MMCR are used to determine the cloud top heights within the boundary layer in ARSCL, whereas in KAZRARSCL only the KAZR is used to determine cloud top heights below 3 km due to well-known MPL difficulties detecting low clouds (Berg and Kassianov, 2008; Karen Johnson, private communication, 2018). In KAZRARSCL the MPL mask is still used for removing clutter in the radar returns.

Updates to the cloud base best estimate algorithm are also implemented. In particular, when the ceilometer observed clear skies and the MPL had no return, the ARSCL data are flagged as “possibly clear”, whereas the KAZRARSCL data are flagged as clear. The ceilometer is used to judge clear or cloudy below 500 m in KAZRARSCL. Otherwise in both versions the cloud base of the ceilometer is preferred if both instruments return cloud below 3 km, and if either detects cloud then a cloud base height is returned (with the exception noted above).

Appendix B: Supporting figures and tables

Scatter plots of sequential application of quality control procedures (Sect. 3.4) for 30 min CF and 15 min FSC are shown in Fig. B1, while data completeness for each summer in this study is provided in Table B1. The regression coefficients and correlation coefficients for Figs. 3–6 are provided in Tables B2 and B3, respectively.

Table B1Summary of single-layer ShCu events by year and percent data completeness after applying quality control steps. Entries correspond to scatterplots in Fig. B1. In particular these periods comply to the requirement that 0.05 < FSC < 0.95.

a TSI data begins 2 July. b TSI missing most data in June and July. c In 2010, 19 d with shallow cumuli are identified, of which six contained events longer than 2 h that also satisfied single layer conditions, CF(all clouds) – CF (ShCu) < 0.1. Only 14 % of the observations satisfied the 100 % instrument validity criterion. The values in this table are generated from the 15 min dataset. These values do not correspond to the 1 h values (i.e., the 1 h values are not generated from the 15 min values and are subject to passing the same quality control thresholds but on the hourly timescale). d Data in Fig. B1a. e Percent of observations that pass a 100 % quality control (QC) percent valid test for ARSCL/KAZRARSCL; data shown in Fig. B1b. f Data shown in Fig. B1c. g “Ceil-ok” is the field in the original dataset indicating the ceilometer data passed automated quality control.

Table B2Regression coefficients for Figs. 3–6.

CF abbreviations defined in Table 1.

Table B3Pearson's correlation coefficient for FSC vs. CF for fine and hourly timescale.

* This value is calculated from a much smaller subset of times (see Nceil, Tables 2 and 3). CF abbreviations defined in Table 1.

Figure B1Comparison of 15 min FSC and 30 min CF from the merged ceilometer–MPL method after data screening steps. (a) Single-layer cases: CF (all clouds) – CF (ShCu) < 0.1 and 0.05< FSC <  0.95 (N=9571); (b) Data as in (a), with additional requirement to discard periods with missing CBH returns (N=9128); (c) Data as in (b), with periods when 15 min FSC from thin cloud exceeds 0.3 are removed (N=8192). Each point represents 15 min period.

Data availability
Data availability.

The analysis dataset is derived from observations collected at the Atmospheric Radiation Measurement (ARM) Southern Great Plains (SGP) site Central Facility in northern Oklahoma, USA, and are freely available at links listed below. Specific data streams in Sect. 2 can be requested through the interactive search tool at https://adc.arm.gov/discovery/ (last access: 16 April 2020). The individual datasets are as follows:

The data from the figures and novel “quick-look” tool can be accessed online for all 614 d with detected shallow cumuli: TSI composite images merged cloud fraction product for shallow cumulus cases (tsiQLtable) (Kleiss, 2020), available on the ARM archive at https://doi.org/10.5439/1523254.

Author contributions
Author contributions.

All authors contributed to the experimental design, analysis plan and editing of the paper. EAR conducted the statistical data analysis. JMK implemented the “quick-look” tool. LDR and LKB advised on the active remote sensing instrumentation. CNL advised on the TSI quality control. EK provided project oversight and direction. EK and EAR lead the composition of the paper.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Acknowledgements
Acknowledgements.

We would like to thank radar experts Joseph Hardin and Nitin Bhawadraj at Pacific Northwest National Laboratory for researching (and many conversations explaining) the updates to the MMCR and replacement by KAZR. We would also like to thank Karen Johnson for diagramming the changes made to the cloud base best estimate algorithm in the ARSCL processing. We also appreciate all of the technicians, software engineers, data managers and scientists who work together to produce ARM data, in particular Victor Morris who is in charge of the ceilometer and TSI measurements for ARM that were pivotal to this work. This research was supported by the U.S. Department of Energy Atmospheric System Research grants DE-SC0016084 and KP1701000/57131.

Financial support
Financial support.

This research has been supported by the U.S. Department of Energy Atmospheric System Research (grant nos. DE-SC0016084 and KP1701000/57131).

Review statement
Review statement.

This paper was edited by Piet Stammes and reviewed by two anonymous referees.

References

Angevine, W. M., Olson, J., Kenyon, J., Gustafson, W. I., Endo, S., Suselj, K., and Turner, D. D.: Shallow Cumulus in WRF Parameterizations Evaluated against LASSO Large-Eddy Simulations, Mon. Weather Rev., 146, 4303–4322, https://doi.org/10.1175/MWR-D-18-0115.1, 2018.

Arakawa, A.: The Cumulus Parameterization Problem: Past, Present, and Future, J. Climate, 17, 2493–2525, https://doi.org/10.1175/1520-0442(2004)017<2493:RATCPP>2.0.CO;2, 2004.

ARM: ARM Best Estimate Data Pproducts (ARMBE) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1), Atmospheric Radiation Measurement (ARM) user facility, available at: https://www.arm.gov/capabilities/vaps/armbe (last access: 25 February 2019), 1994.

Astin, I., Girolamo, L. D., and van de Poll, H. M.: Bayesian confidence intervals for true fractional coverage from finite transect measurements: Implications for cloud studies from space, J. Geophys. Res.-Atmos., 106, 17303–17310, https://doi.org/10.1029/2001JD900168, 2001.

Atkinson, B. W. and Zhang, J. W.: Mesoscale shallow convection in the atmosphere, Rev. Geophys., 34, 403–431, https://doi.org/10.1029/96RG02623, 1996.

Berg, L. K. and Kassianov, E. I.: Temporal Variability of Fair-Weather Cumulus Statistics at the ACRF SGP Site, J. Climate, 21, 3344–3358, https://doi.org/10.1175/2007JCLI2266.1, 2008.

Berg, L. K. and Stull, R. B.: Accuracy of Point and Line Measures of Boundary Layer Cloud Amount, J. Appl. Meteorol., 41, 640–650, https://doi.org/10.1175/1520-0450(2002)041<0640:AOPALM>2.0.CO;2, 2002.

Berg, L. K., Kassianov, E. I., Long, C. N., and Mills, D. L.: Surface summertime radiative forcing by shallow cumuli at the Atmospheric Radiation Measurement Southern Great Plains site, J. Geophys. Res., 116, D01202, https://doi.org/10.1029/2010JD014593, 2011.

Berg, L. K., Gustafson, W. I., Kassianov, E. I., and Deng, L.: Evaluation of a Modified Scheme for Shallow Convection: Implementation of CuP and Case Studies, Mon. Weather Rev., 141, 134–147, https://doi.org/10.1175/MWR-D-12-00136.1, 2013.

Boers, R., de Haij, M. J., Wauben, W. M. F., Baltink, H. K., van Ulft, L. H., Savenije, M., and Long, C. N.: Optimized fractional cloudiness determination from five ground-based remote sensing techniques, J. Geophys. Res., 115, D24116, https://doi.org/10.1029/2010JD014661, 2010.

Bouniol, D., Protat, A., Delanoë, J., Pelon, J., Piriou, J.-M., Bouyssel, F., Tompkins, A. M., Wilson, D. R., Morille, Y., Haeffelin, M., O'Connor, E. J., Hogan, R. J., Illingworth, A. J., Donovan, D. P., and Baltink, H.-K.: Using Continuous Ground-Based Radar and Lidar Measurements for Evaluating the Representation of Clouds in Four Operational Models, J. Appl. Meteorol. Clim., 49, 1971–1991, https://doi.org/10.1175/2010JAMC2333.1, 2010.

Campbell, J. R., Hlavka, D. L., Spinhirne, J. D., Turner, D. D., and Flynn, C. J.: Operational Cloud Boundary Detection and Analysis from Micropulse Lidar Data, in: Proc. Eighth ARM Science Team Meeting, U.S. Department of Energy, Tucson, AZ, USA, 119–122, 1998.

Chandra, A. S., Kollias, P., and Albrecht, B. A.: Multiyear Summertime Observations of Daytime Fair-Weather Cumuli at the ARM Southern Great Plains Facility, J. Climate, 26, 10031–10050, https://doi.org/10.1175/JCLI-D-12-00223.1, 2013.

Chen, X. and Xie, S.: ARM Best Estimate Data Products (ARMBEATM), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1095313, 1 January 1994.

Chow, C. W., Urquhart, B., Lave, M., Dominguez, A., Kleissl, J., Shields, J., and Washom, B.: Intra-hour forecasting with a total sky imager at the UC San Diego solar energy testbed, Sol. Energy, 85, 2881–2893, https://doi.org/10.1016/j.solener.2011.08.025, 2011.

Clothiaux, E. E., Ackerman, T. P., Mace, G. G., Moran, K. P., Marchand, R. T., Miller, M. A., and Martner, B. E.: Objective Determination of Cloud Heights and Radar Reflectivities Using a Combination of Active Remote Sensors at the ARM CART Sites, J. Appl. Meteorol., 39, 645–665, https://doi.org/10.1175/1520-0450(2000)039<0645:ODOCHA>2.0.CO;2, 2000.

Coulter, R., Muradyan, P., and Martin, T.: Radar Wind Profiler (915RWPWINDCON), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1025135, 3 August 1998.

Donovan, D. P. and van Lammeren, A. C. A. P.: Cloud effective particle size and water content profile retrievals using combined lidar and radar observations: 1. Theory and examples, J. Geophys. Res.-Atmos., 106, 27425–27448, https://doi.org/10.1029/2001JD900243, 2001.

Endo, S., Zhang, D., Vogelmann, A. M., Kollias, P., Lamer, K., Oue, M., Xiao, H., Gustafson, W. I., and Romps, D. M.: Reconciling differences between large-eddy simulations and Doppler-lidar observations of continental shallow cumulus cloud-base vertical velocity, Geophys. Res. Lett., 46, 11539–11547, https://doi.org/10.1029/2019GL084893, 2019.

Ermold, B. and Morris, V. R.: Ceilometer (CEIL) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1181954, 1997.

Fairless, T., Giangrande, S., and Johnson, K.: Active Remote Sensing of CLouds (ARSCL) product using Ka-band ARM Zenith Radars (ARSCLKAZR1KOLLIAS), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1350629, 18 January 2011.

Fast, J. D., Berg, L. K., Burleyson, C., Fan, J., Feng, Z., Hagos, S., Huang, M., Guenther, A., Laskin, A., and Ovchinnikov, M.: Holistic Interactions of Shallow Clouds, Aerosols, and Land-Ecosystems (HI-SCALE) Field Campaign Report DOE/SC-ARM-17-014, ARM Climate Research Facility, available at: https://www.osti.gov/servlets/purl/1354732 (last access: 12 April 2020), 2017.

Giangrande, S. and Johnson, K.: Active Remotely-Sensed Cloud Locations (ARSCLCBH1CLOTH), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1027284, 8 November 1996.

Gustafson, W., Cheng, X., Krishna, B., Toto, T., Vogelmann, A., Endo, S., Li, Z., and Xiao, H.: Description of the LASSO Alpha 2 Release: DOE/SC-ARM-TR-199, available at: https://www.arm.gov/publications/tech_reports/doe-sc-arm-tr-199.pdf (last access: 12 April 2020), 2018.

Gustafson, W. I. and Vogelmann, A.: LES ARM Symbiotic Simulation and Observation (LASSO) Implementation Strategy: DOE/SC-ARM-15-039, ARM Climate Research Facility, available at: https://www.arm.gov/publications/programdocs/doe-sc-arm-15-039.pdf (last access: 12 April 2020), 2015.

Gustafson, W. I., Vogelmann, A., Cheng, X., Endo, S., Krishna, B., Li, Z., Toto, T., and Xiao, H.: Recommendations for Implementation of the LASSO Workflow. DOE/SC-ARM-17-031, available at: https://www.osti.gov/servlets/purl/1406259/ (last access: 12 April 2020), 2017.

Henderson-Sellers, A. and McGuffie, K.: Are cloud amounts estimated from satellite sensor and conventional surface-based observations related?, Int. J. Remote Sens., 11, 543–550, https://doi.org/10.1080/01431169008955038, 1990.

Johnson, K. and Giangrande, S. E.: Active Remotely-Sensed Cloud Locations (ARSCLBND1CLOTH) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1), Atmospheric Radiation Measurement (ARM) user facility, Lamont, OK, https://doi.org/10.5439/1027283, 1996.

Johnson, K., Giangrande, S., and Toto, T.: Active Remote Sensing of CLouds (ARSCL) product using Ka-band ARM Zenith Radars (ARSCLKAZRBND1KOLLIAS) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1)., Atmospheric Radiation Measurement (ARM) User Facility, Oak Ridge, Tennessee, USA, https://doi.org/10.5439/1350630, 2011.

Kassianov, E., Long, C. N., and Ovtchinnikov, M.: Cloud Sky Cover versus Cloud Fraction: Whole-Sky Simulations and Observations, J. Appl. Meteorol., 44, 86–98, https://doi.org/10.1175/JAM-2184.1, 2005.

Kennedy, A. D., Dong, X., and Xi, B.: Cloud fraction at the ARM SGP site, Theor. Appl. Climatol., 115, 91–105, https://doi.org/10.1007/s00704-013-0853-9, 2014.

Kleiss, J.: TSI composite images merged cloud fraction product for shallow cumulus cases (tsiQLtable), https://doi.org/10.5439/1523254, 2020.

Kollias, P., Clothiaux, E. E., Ackerman, T. P., Albrecht, B. A., Widener, K. B., Moran, K. P., Luke, E. P., Johnson, K. L., Bharadwaj, N., Mead, J. B., Miller, M. A., Verlinde, J., Marchand, R. T., and Mace, G. G.: Development and Applications of ARM Millimeter-Wavelength Cloud Radars, Meteorol. Monogr., 57, 17.1–17.19, https://doi.org/10.1175/AMSMONOGRAPHS-D-15-0037.1, 2016.

Lamer, K. and Kollias, P.: Observations of fair-weather cumuli over land: Dynamical factors controlling cloud size and cover, Geophys. Res. Lett., 42, 2015GL064534, https://doi.org/10.1002/2015GL064534, 2015.

Lamer, K., Fridlind, A. M., Ackerman, A. S., Kollias, P., Clothiaux, E. E., and Kelley, M.: (GO)2-SIM: a GCM-oriented ground-observation forward-simulator framework for objective evaluation of cloud and precipitation phase, Geosci. Model Dev., 11, 4195–4214, https://doi.org/10.5194/gmd-11-4195-2018, 2018.

Lareau, N. P., Zhang, Y., and Klein, S. A.: Observed Boundary Layer Controls on Shallow Cumulus at the ARM Southern Great Plains Site, J. Atmos. Sci., 75, 2235–2255, https://doi.org/10.1175/JAS-D-17-0244.1, 2018.

Long, C. N.: Correcting for Circumsolar and Near-Horizon Errors in Sky Cover Retrievals from Sky Images, Open Atmos. Sci. J., 4, 45–52, 2010.

Long, C. N., Slater, D. W., and Tooman, T. P.: Total sky imager model 880 status and testing results, Pacific Northwest National Laboratory, available at: https://arm.gov/publications/tech_reports/arm-tr-006.pdf (last access: 28 May 2015), 2001.

Long, C. N., Sabburg, J. M., and Calbó, J.: Retrieving cloud characteristics from ground-based daytime color all-sky images, J. Atmos. Ocean. Tech., 23, 633–652, 2006.

Morris, V.: Total Sky Imager (TSICLDMASK), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1025306, 2 July 2000.

Morris, V., Flynn, C., and Ermold, B.: Ceilometer (CEIL), Atmospheric Radiation Measurement (ARM) user facility, https://doi.org/10.5439/1181954, 14 September 1997.

Morris, V. R.: Total Sky Imager (TSI) Handbook, ARM TR-017, available at: https://www.wmo.int/pages/prog/gcos/documents/gruanmanuals/Z_instruments/tsi_handbook.pdf (last access: 28 May 2015), 2005.

Muradyan, P. and Coulter, R.: Radar Wind Profiler (915RWPWINDCON) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1), Atmospheric Radiation Measurement (ARM) user facility, Oak Ridge, Tennessee, USA, https://doi.org/10.5439/1025135, 1997.

Oue, M., Kollias, P., North, K. W., Tatarevic, A., Endo, S., Vogelmann, A. M., and Gustafson, W. I.: Estimation of cloud fraction profile in shallow convection using a scanning cloud radar, Geophys. Res. Lett., 43, 2016GL070776, https://doi.org/10.1002/2016GL070776, 2016.

Park, R.-S. and Kwon, Y. C.: The Implications for Radiative Cloud Forcing via the Link Between Shallow Convection and Planetary Boundary Layer Mixing, J. Geophys. Res.-Atmos., 123, 13203–13218, https://doi.org/10.1029/2018JD028678, 2018.

Qian, Y., Long, C. N., Wang, H., Comstock, J. M., McFarlane, S. A., and Xie, S.: Evaluation of cloud fraction and its radiative effect simulated by IPCC AR4 global models against ARM surface observations, Atmos. Chem. Phys., 12, 1785–1810, https://doi.org/10.5194/acp-12-1785-2012, 2012.

Shi, Y., Flynn, D., and Riihimaki, L.: Fair-Weather Shallow Cumulus Identification (SHCUSUMMARY) at Southern Great Plains (SGP) Central Facility, Lamont, OK (C1), Atmospheric Radiation Measurement (ARM) user facility, Oak Ridge, Tennessee, USA, https://doi.org/10.5439/1392569, 2000.

Sivaraman, C. and Comstock, J.: Micropulse Lidar Cloud Mask Value-Added Product Technical Report DOE/SC-ARM/TR-098, ARM Climate Research Facility, https://doi.org/10.2172/1019283, 2011.

Sunny Lim, K., Riihimaki, L. D., Shi, Y., Flynn, D., Kleiss, J. M., Berg, L. K., Gustafson Jr., W. I., Zhang, Y., and. Johnson, K. L: Long-Term Retrievals of Cloud Type and Fair-Weather Shallow Cumulus Events at the ARM SGP Site, J. Atmos. Ocean. Tech., 36, 2031–2043, https://doi.org/10.1175/JTECH-D-18-0215.1, 2019.

Tatarevic, A. and Kollias, P.: User's Guide CR-SIM SOFTWARE v 2.0, McGill University Clouds Research Group, available at: http://radarscience.weebly.com/radar-simulators.html (last access: 4 December 2020), 2015.

Vial, J., Bony, S., Stevens, B., and Vogel, R.: Mechanisms and Model Diversity of Trade-Wind Shallow Cumulus Cloud Feedbacks: A Review, Surv. Geophys., 38, 1331–1353, https://doi.org/10.1007/s10712-017-9418-2, 2017.

Wagner, T. J. and Kleiss, J. M.: Error characteristics of ceilometer-based observations of cloud amount, J. Atmos. Ocean. Tech., 33, 1557–1567, https://doi.org/10.1175/JTECH-D-15-0258.1, 2016.

Wang, Z. and Sassen, K.: Cloud Type and Macrophysical Property Retrieval Using Multiple Remote Sensors, J. Appl. Meteorol., 40, 1665–1682, https://doi.org/10.1175/1520-0450(2001)040<1665:CTAMPR>2.0.CO;2, 2001.

Wainwright, C. E., Stepanian, P. M., Reynolds, D. R., and Reynolds, A. M.: The movement of small insects in the convective boundary layer: linking patterns to processes, Sci. Rep.-UK, 7, 5438, https://doi.org/10.1038/s41598-017-04503-0, 2017.

Weckwerth, T. M., Horst, T. W., and Wilson, J. W.: An Observational Study of the Evolution of Horizontal Convective Rolls, Mon. Weather Rev., 127, 2160–2179, https://doi.org/10.1175/1520-0493(1999)127<2160:AOSOTE>2.0.CO;2, 1999.

Welton, E. J. and Campbell, J. R.: Micropulse Lidar Signals: Uncertainty Analysis, J. Atmos. Ocean. Tech., 19, 2089–2094, https://doi.org/10.1175/1520-0426(2002)019<2089:MLSUA>2.0.CO;2, 2002.

Wu, W., Liu, Y., Jensen, M. P., Toto, T., Foster, M. J., and Long, C. N.: A comparison of multiscale variations of decade-long cloud fractions from six different platforms over the Southern Great Plains in the United States, J. Geophys. Res.-Atmos., 119, 3438–3459, https://doi.org/10.1002/2013JD019813, 2014.

Xi, B., Dong, X., Minnis, P., and Khaiyer, M. M.: A 10 year climatology of cloud fraction and vertical distribution derived from both surface and GOES observations over the DOE ARM SPG site, J. Geophys. Res., 115, D12124, https://doi.org/10.1029/2009JD012800, 2010.

Xiao, H., Berg, L. K., and Huang, M.: The Impact of Surface Heterogeneities and Land-Atmosphere Interactions on Shallow Clouds Over ARM SGP Site, J. Adv. Model. Earth Sy., 10, 1220–1244, https://doi.org/10.1029/2018MS001286, 2018.

Yun, Y., Fan, J., Xiao, H., Zhang, G. J., Ghan, S. J., Xu, K.-M., Ma, P.-L., and Gustafson, W. I.: Assessing the Resolution Adaptability of the Zhang-McFarlane Cumulus Parameterization With Spatial and Temporal Averaging, J. Adv. Model. Earth Sy., 9, 2753–2770, https://doi.org/10.1002/2017MS001035, 2017.

Zhang, Y. and Klein, S. A.: Factors Controlling the Vertical Extent of Fair-Weather Shallow Cumulus Clouds over Land: Investigation of Diurnal-Cycle Observations Collected at the ARM Southern Great Plains Site, J. Atmos. Sci., 70, 1297–1315, https://doi.org/10.1175/JAS-D-12-0131.1, 2013.

Zhang, Y., Klein, S. A., Fan, J., Chandra, A. S., Kollias, P., Xie, S., and Tang, S.: Large-Eddy Simulation of Shallow Cumulus over Land: A Composite Case Based on ARM Long-Term Observations at Its Southern Great Plains Site, J. Atmos. Sci., 74, 3229–3251, https://doi.org/10.1175/JAS-D-16-0317.1, 2017.

Zhu, T., Lee, J., Weger, R. C., and Welch, R. M.: Clustering, randomness, and regularity in cloud fields: 2. Cumulus cloud fields, J. Geophys. Res.-Atmos., 97, 20537–20558, https://doi.org/10.1029/92JD02022, 1992.