![]() |
CALIPSO Quality Statements |
![]() |
This document provides a high-level quality assessment of the cloud and aerosol layer products derived from the CALIPSO lidar measurements, as described in section 2.4 of the CALIPSO Data Products Catalog (Version 3.2) (PDF). As such, it represents the minimum information needed by scientists and researchers for appropriate and successful use of these data products. We strongly suggest that all authors, researchers, and reviewers of research papers review this document periodically, and familiarize themselves with the latest status before publishing any scientific papers using these data products.
These data quality summaries are published specifically to inform users of the accuracy of CALIOP data products as determined by the CALIPSO Science Team and Lidar Science Working Group (LSWG). This document is intended to briefly summarize key validation results; provide cautions in those areas where users might easily misinterpret the data; supply links to further information about the data products and the algorithms used to generate them; and offer information about planned algorithm revisions and data improvements.
The primary geophysical variables reported by Cloud and Aerosol Layer Products are the spatial locations of layers (e.g., layer base and top altitudes), the surrounding meteorological conditions (e.g., temperature and pressure) and a number of measured and derived optical properties. Optical properties that are directly measured include integrated attenuated backscatter, volume depolarization ratio, and attenuated total color ratio. Derived optical properties are those that can only be obtained via application of the CALIPSO extinction retrieval. Optical depth is the primary derived optical property reported in the Layer Products. Others include ice water path, particulate depolarization ratio, and particulate color ratio. PLEASE NOTE: Users of those CALIOP parameters that are produced by or otherwise depend on the extinction retrieval(s) should read and thoroughly understand the information provided in the Profile Products Data Quality Summary. This summary contains an expanded description of the extinction retrieval process from which the layer optical depths are derived, and provides essential guidance in the appropriate use of all CALIOP extinction-related data products.
Because validation for different parameters can require different levels of effort, and because the uncertainties inherent in some retrievals can be substantially larger than in others, the maturity levels of the parameters reported in the different layer products files are not uniform. Therefore, within this document, maturity levels are provided separately for each scientific data set (SDS) included with the data files. The data product maturity levels for the CALIPSO layer products are defined in the table below.
| Beta: | Early release products for users to gain familiarity with data formats and parameters. Users are strongly cautioned against the indiscriminate use of these data products as the basis for research findings, journal publications, and/or presentations. |
| Provisional: | Limited comparisons with independent sources have been made and obvious artifacts fixed. |
| Validated Stage 1: | Uncertainties are estimated from independent measurements at selected locations and times. |
| Validated Stage 2: | Uncertainties are estimated from more widely distributed independent measurements. |
| Validated Stage 3: | Uncertainties are estimated from independent measurements representing global conditions. |
| External: | Data are not CALIPSO measurements, but instead are either obtained from external sources (e.g., the Global Modeling and Assimilation Office (GMAO)) or fixed constants in the CALIPSO retrieval algorithm (e.g., the 532 nm calibration altitude). |
As a general (but not immutable) rule, the parameters in the cloud and aerosol layer products whose derivation depends wholly or in part on the extinction retrieval are assigned a product maturity level of provisional. Those products derived from the layer detection and scene classification algorithms only are designated as ValStage1.
The CALIPSO Cloud and Aerosol Layer Products are built around two tightly coupled data types. The first of these is a set of column properties, which describe the temporal and spatial location of the vertical column (or, for averaged data, curtain) of atmosphere being sampled. Column properties include satellite position data and viewing geometry, information about the surface type and lighting conditions, and the number of features (e.g., cloud and/or aerosol layers) identified within the column. For each set of column properties, there is an associated set of layer properties. These layer properties specify the spatial and optical characteristics of each feature found, and include quantities such as layer base and top altitudes, integrated attenuated backscatter, layer-integrated volume depolarization ratio, and optical depth. Below we provide brief descriptions of each of the column properties and the layer properties. Where appropriate, we also provide an assessment of the quality and accuracy of the data in the current release.
The layer products are generated at three different spatial resolutions.
The 1/3 km layer products report cloud detection information obtained at the highest spatial resolution of the lidar: 1/3 km horizontally and 30-m vertically. Due to constraints on CALIPSO's downlink bandwidth, this full resolution data is only available from ~8.3 km above mean sea level, down to -0.5 km below sea level.
The 1 km layer products report cloud detection information obtained at a horizontal resolution of 1 km, over a vertical range extending from ~20.2 km above mean sea level, down to -0.5 km below sea level.
The 5 km layer products report (separately) cloud and aerosol detection information on a 5 km horizontal grid. At present there is no separate stratospheric data product. Stratospheric features are recorded in the 5 km aerosol product.
Users should be aware that while the 5 km layer products are reported on a uniform 5 km grid, the amount of horizontal averaging required to detect a layer may exceed 5 km. For example, detection of subvisible cirrus during daylight operations may require averaging to 20 km or even 80 km horizontally. In these cases, the layer properties of the feature detected are replicated as necessary to span the full extent of the averaging interval required for detection. For example, the layer properties for an aerosol layer that could only be detected after averaging over 20 km horizontally will be repeated over four consecutive 5 km columns.
The fundamental data product provided by the CALIPSO layer products is the vertical location of cloud and aerosol layer boundaries. All other layer properties -- e.g., integrated attenuated backscatters and layer two-way transmittances -- are computed with reference to these boundaries. To make proper use of the CALIPSO layer products, all users must be aware of the uncertainties inherent in the fully automated recognition of layer boundaries. Note too that clouds and aerosols are reported separately in the CALIPSO layer products. Therefore, to obtain a complete representation of all features detected within any region, users must use both the cloud and the aerosol layer products.
In the remainder of this document we provide brief descriptions of the individual parameters contained within the layer products files. Accompanying these descriptions are qualitative summaries of the product maturity level. Where appropriate, specific quality flags are included in the data products, and these too are described in some detail. The data descriptions are grouped into several major categories, as follows:
The CALIOP surface detection routine uses a digital elevation map (DEM), GTOPO30 as the starting point in its search for the lidar surface echo, and thus the reliability of the lidar surface elevations depends to some extent on the accuracy of the information recorded in GTOPO30. The GTOPO30 data is very reliable over oceans, but can be considerably less so in rugged terrain, such as in the Andes mountains of Peru, and over the polar regions. Note too that due to aberrations in the signal caused by a non-ideal transient response in the 532 nm detectors, the geometric thickness associated with the lidar surface elevation (i.e., surface top - surface base) can be extremely misleading. This non-ideal transient response must be carefully considered whenever the (apparent) subsurface portions of the lidar signals analyzed
Bits 1, 2, and 3 indicate the horizontal resolution at which the surface was detected:
Bits 4 and 5 are not used and are set to zero. Taken together, bits 6, 7, and 8 report the 5-km detection frequency:
| Figure 1: Distribution of γ′column at 532 nm |
|
The CALIOP layer detection algorithm used for the Version 1 and Version 2 data releases is described in detail in Vaughan et al., 2009 and in the CALIPSO Feature Detection ATBD (PDF). For Version 3, an additional refinement has been incorporated into the base determination procedure. Under certain conditions, described here, the initial estimate of base altitude for those layers identified as aerosols will be extended to a new, lower altitude located 90 m above the local surface. The Layer Base Extended flag identifies those layers for which the base altitude has been altered by this procedure.
The uncertainties associated with detection of cloud and aerosol layers in backscatter lidar data are examined in detail in Section 5 of the CALIPSO Feature Detection ATBD (PDF). The ATBD contains quantitative assessments of feature finder performance derived using simulated data sets, for which all layer boundaries were known exactly. In the real world of layer detection, we do not have access to this underlying truth. Therefore in this document we provide the following set of "rules of thumb" that users can apply to the data products to obtain a qualitative understanding of the layer boundaries reported, and of the optical properties associated with these layers.
Strongly scattering features are easier to detect than weakly scattering features. The scattering intensity of each layer is reported in the 532 nm and 1064 nm attenuated backscatter statistics and by the integrated attenuated backscatter at 532 nm and 1064 nm.
Detection of layers during the nighttime portion of the orbits is more reliable than during the daytime portion of the orbits. Due to solar background signals, the noise levels in the daytime measurements are much larger than those at night, and this additional noise can obscure faint features, and can lead to boundary detection errors even in more strongly scattering layers.
Features become increasingly difficult to detect with increasing optical depth above feature top. Put another way, detection of the lower layers in a multi-layer scene is made more difficult by the signal losses that occur as the laser light passes through the upper layers. (In a sense, this is a restatement of (a), since the backscatter intensity of secondary features is reduced from what it otherwise might be by the signal attenuation caused by the overlying features.) The Overlying Integrated Attenuated Backscatter and the Layer Integrated Attenuated Backscatter QA factor serve as proxies for the optical depth above each feature, and thus provide qualitative assessments of the confidence that users should assign to the reported layer properties.
In general, our confidence in the location of the top of a layer is somewhat greater than our confidence in the location of the base of the same layer. For transmissive features, one reason for this is that the backscatter signal is attenuated by traversing the feature, thus degrading the potential contrast between feature and "non-feature" at the base. Additionally, in strongly scattering layers, multiple scattering effects and signal perturbations introduced by the non-ideal transient response of the 532 nm detectors can also make base determination less certain.
The Opacity Flag is used to indicate features that completely attenuate the backscatter signal. For these features, the base altitude reported must be considered as an "apparent" base rather than a true base.
In those cases where the layer base has been extended to 90 m above the local surface, the assumption is that extended region contains aerosol that lies below the detection limits of the standard algorithm. The resulting increase in aerosol optical depths indicates that this procedure is appropriate far more often than not.
Stratospheric features reported during daylight -- especially those reported above 20 km between 60N and 60S -- are often noise artifacts and should be treated with suspicion.
Interpretation of the number of layers found parameter is straightforward for the 1-km and 1/3-km layer products: individual layers are always separated by regions of "clear air", and layer boundaries never overlap in the vertical dimension. However, this simplicity of interpretation does not always carry over into the 5-km cloud and aerosol layer products. CALIPSO uses a nested multi-grid feature finding algorithm (see the layer detection ATBD), and thus the search for layer boundaries is conducted at multiple horizontal averaging resolutions. While the 1-km and 1/3-km layer products report only those features detected at, respectively, averaging resolutions of 1-km and 1/3-km, the 5-km products report layers detected at multiple averaging resolutions (5-km, 20-km, and 80-km in the version 3 products). Because the reporting resolution (5-km) is not always identical to the detection resolution, layers may appear to overlap in the vertical dimension.
Figure 2 shows a wholly fictitious but heuristically useful schematic of layer detection results for a data segment extending 80-km horizontally and 465-m vertically. Yellow/orange/brown colors indicate an aerosol layer detected at horizontal averaging resolutions of, respectively, 80, 20 or 5 km. Shades of blue likewise represent clouds detected at 80, 20, and 5 km resolutions. The white regions are (presumably) clear air, where no features were found. The labeled rows at the bottom indicate the 'number of layers found' that will be reported in the cloud and aerosol layer products for each 5-km column.
| Figure 2: interpreting the number of layers parameters |
![]() |
In column 16, the layer labeled F5 (top altitude = 0.285 km, base altitude = 0.165 km) appears to vertically overlap F6 (top altitude = 0.255 km, base altitude = 0.135 km), which in turn appears overlap F7 (top altitude = 0.225 km). However, F5 was detected at an averaging resolution of 5-km, and hence the backscatter data that comprises F5 is removed from consideration before construction the 20-km horizontally averaged profile in which F6 was detected. Similarly, the backscatter data from both F5 and F6 were removed from consideration before constructing the 80-km averaged profile in which F7 was detected. Layers detected at higher spatial resolutions are thus seen to overwrite, rather than overlap apparently collocated layers detected at coarser spatial resolutions. More details can be found in the Feature Detection and Layer Properties ATBD (PDF) and in Vaughan et al., 2009.
For the uppermost layer in any column, the quality of the estimate for γ′532 is determined by the accuracy of the top and base identification, the reliability of the 532 nm channel calibrations, and by the signal-to-noise ratio (SNR) of the backscatter data within the layer. For layers beneath the uppermost, the quality of our estimate for γ′532 also depends on either obtaining an independent estimate of the two-way transmittance, T2, for all overlying layers, or by estimating this quantity directly from the lidar backscatter data. In those situations where an extended region of clear air exists between successive layers, and where the uppermost layer has no more than a moderate optical depth of -- say -- between 0.4 and 2.0, T2 can be estimated directly from the attenuated backscatter data (albeit with some uncertainty due to noise and the possibility of aerosol contamination of the clear air regions). Otherwise, the only way to estimate T2 is to compute a full extinction retrieval for the profile being examined. In this case, additional error can be introduced into the estimate of γ′532 by uncertainties in the approximation of the lidar ratio(s) for the overlying layer(s). Furthermore, the effects of errors caused by misestimating T2 can increase sharply as the optical thickness above a layer increases. For the 5 km layer products, the CALIOP processing scheme always attempts to correct estimates of γ′532 for the attenuation imparted by previously identified overlying features. As a consequence, we will occasionally report unrealistically large values for γ′532 in the 5 km layer products. However, because extinction solutions are only derived for data averaged to a 5-km (or greater) resolution, the γ′532 values reported in the 1 km and 1/3 km layer products are not corrected for the signal attenuation effects imparted by overlying layers.
The values reported for γ′532 should always be positive, and for the results derived directly from the layer detection algorithm (i.e., in the 1 km and 1/3 km layer products) this is indeed always true. However, in the 5 km products there are certain rare and pathological cases where a secondary layer could only be detected after averaging to 20 km or even 80 km horizontally, and where the overlying layers were detected at 5 km and have vastly different optical depths. In these cases, integrating the reaveraged data within the secondary layer will occasionally yield a negative γ′532. Such layers can be identified by a special CAD score of 105. All measured and derived optical properties for these layers are unreliable, and should be ignored. In evaluating the reliability of the spatial properties of these layers, users should carefully consider the layer IAB QA factor.
As is the case for γ′532, in the uppermost layer within any column, the quality of the estimate for γ′1064 is determined by the accuracy of the top and base identification, the reliability of the 1064 nm calibration constant, and by the signal-to-noise ratio (SNR) of the backscatter data within the layer. However, unlike the measurements at 532 nm, reliable estimates of T2 cannot be derived from an analysis of the 1064 nm backscatter signal in the (assumed to be) clear air regions, and thus in the 5 km products, the T2 corrections for the attenuation from overlying layers are always obtained from an extinction solution that uses prescribed values of the lidar ratios for all overlying layers. As is the case at 532 nm, no T2 corrections are applied to the γ′1064 values reported in the 1 km and 1/3 km layer products. Furthermore, because the CALIOP layer detection algorithm typically examines only the 532 nm backscatter signals, negative (i.e., non-physical) values may occasionally be reported for γ′1064 in all resolutions of the layer products. Unlike the layers for which γ′532 is negative, layers with negative γ′1064 are not indicated by a special CAD score. Negative values of γ′1064 occur most often for very weakly scattering layers (e.g., subvisible cirrus and faint aerosols) and in those layers for which the backscatter signal has been highly attenuated by other, overlying layers.
The quality of the estimate for δv is determined by the accuracy of the top and base identification, the reliability of the polarization gain ratio calibration, and by the signal-to-noise ratio (SNR) of the backscatter data within the layer. In general, the CALIOP δv estimates are highly reliable. Histograms of δv compiled for midlatitude cirrus in the northern hemisphere compare very well with previously reported distributions, e.g., Sassen & Benson, 2001 (PDF).
In regions with acceptable SNR, the accuracy with which the range resolved depolarization ratios can be determined will depend almost entirely on the accuracy of the polarization gain ratio calibration.
Users can have high confidence in the calculation of all of the values in the depolarization ratio statistics fields. However, the meaning of these numbers can be somewhat obscure. This is because each of the range resolved depolarization ratios within any layer is the ratio of two noisy numbers. Especially where the feature is relatively faint, and in regions of low SNR, data values in both the numerator (the 532 nm perpendicular channel) and the denominator (the 532 nm parallel channel) can randomly and independently approach zero, which in turn can generate extremely large or extremely small (and even non-physical) depolarization ratios. When computing layer means, standard deviations, and centroids, these values can dominate the calculation, and thus return entirely unrealistic estimates. When assessing the depolarization ratio that characterizes a layer, δv and the layer median are both more reliable indicators than the mean.The quality of the estimate for χ′layer is determined by the accuracy of the top and base identification, the reliability of the 532 nm calibration constant and the 1064 nm calibration constant, and by the signal-to-noise ratio (SNR) of the backscatter data within the layer. For the 5 km layer products, the attenuated backscatter coefficients used in the calculation of χ′layer are corrected for the estimated overlying two-way transmittance. No such correction is attempted for the 1 km and 1/3 km values, as no extinction solution is computed at these resolutions.
Users can have high confidence in the calculation of all of the values in the attenuated total color ratio statistics fields. However, as with the 532 nm depolarization ratio statistics, the meaning of the various numbers can be somewhat misleading. Like the depolarization ratios, the attenuated total color ratios are produced by dividing one noisy number (the 1064 nm attenuated backscatter coefficient) by a second noisy number (the 532 nm attenuated backscatter coefficient). Depending on the noise in any pair of samples, the resulting values can range from large negative values to extremely large positive values. When computing layer means, standard deviations, and centroids, these outliers can dominate the calculation, and thus return entirely unrealistic estimates.
Because all features reported in 1/3-km and 1-km layer products are detected at a single horizontal averaging resolution (i.e., either at 1/3-km or 1-km), the opacity flag is not reported. When using these products, opacity, in the sense described above, can be assessed as follows. If the surface was detected (i.e., the lidar surface altitude field does not contain fill values) then there are no opaque layers in the column. If the surface was not detected, then the lowest layer in the column is considered to be opaque.
For the vast majority of cases, CALIOP cannot provide a direct measurement of layer optical depth. In these cases, estimates of optical depth are derived using extinction-to-backscatter ratios (i.e., lidar ratios) that are specified based on an assessment of layer type and subtype. Uncertainties in the value of the lidar ratio, which can arise both from natural variability and from occasional misclassification of layer type, propagate non-linearly into subsequent estimates of layer optical depth.
Retrievals of optical depth from space-based lidar measurements must account for contributions from multiple scattering that are generally considered negligible in ground-based and aircraft based measurements. The theoretical basis for CALIPSO's treatment of multiple scattering is provided in the extinction retrieval ATBD (PDF) and in Winker, 2003 (PDF)
Similar to the layer detection problem, estimates of layer optical depth become increasingly fraught with error in multiple layer scenes, as errors incurred in overlying layers are propagated into the solutions derived for underlying features.
IMPORTANT NOTICE: before proceeding, all users of the CALIOP optical depth data should read and thoroughly understand the information provided in the Profile Products Data Quality Summary. This summary contains an expanded description of the extinction retrieval process from which the layer optical depths are derived, and provides essential guidance in the appropriate use of all CALIOP extinction-related data products.
Despite these caveats, users should not be unduly pessimistic about the quality and usability of the CALIPSO optical depth estimates. Figure 3 (below) shows a preliminary comparison of CALIPSO version 3 aerosol optical depths with the optical depths derived from MODIS for all daytime measurements acquired during January 2007. The comparison is generally good, with MODIS appearing to slightly over-estimate values at the lower end of the optical depth range.
|
Figure 3: Comparison of CALIPSO aerosol optical depths to those derived
from MODIS (Preliminary - January 2007, daytime data only) final lidar ratio = initial lidar ratio only) |
![]() |
Calculation of the layer optical depth uncertainty is an iterative process. On some occasions when the SNR is poor, or an inappropriate lidar ratio is being used, the iteration will attempt to converge asymptotically to positive infinity. Whenever this situation is detected, the iteration is terminated, and the layer optical depth uncertainty is assigned a fixed value of 99.99. Any time an uncertainty of 99.99 is reported, the extinction calculation should be considered to have failed. The associated optical depths cannot be considered reliable, and should therefore be excluded from all science studies.
Note: optical depth uncertainties are reported as absolute errors, not relative errors.
| Initial lidar ratios used in the version 3.01 extinction solver | |||
|---|---|---|---|
| Type | Subtype | Initial 532 nm lidar ratio | Initial 1064 nm lidar ratio |
| cloud | water | 19 ± 10 sr | N/A |
| cloud | ice | 25 ± 10 sr | N/A |
| cloud | unknown phase | 22 ± 11 sr | N/A |
| aerosol | marine | 20 ± 6 sr | 45 ± 23 sr |
| aerosol | desert dust | 40 ± 20 sr | 55 ± 17 sr |
| aerosol | polluted continental | 70 ± 25 sr | 30 ± 14 sr |
| aerosol | clean continental | 35 ± 16 sr | 30 ± 17 sr |
| aerosol | polluted dust | 55 ± 22 sr | 48 ± 24 sr |
| aerosol | biomass burning | 70 ± 28 sr | 40 ± 24 sr |
| stratospheric | all | 25 ± 10 sr | 25 ± 10 sr |
The aerosol lidar ratios used for the CALIOP analyses represent well established mean values that are characteristic of the natural variability exhibited for each aerosol species (e.g., see Omar et al., 2005 (PDF); Cattrall et. al., 2005; and Figure 4 below). The clear implication of this natural variability is that even for those cases where the aerosol type is correctly identified, the initial lidar ratio represents an imperfect estimate of the layer-effective lidar ratio of any specific aerosol layer. These same caveats apply equally to the mean values used for the initial cloud lidar ratios. For all layer types, cloud-aerosol discrimination errors can exacerbate the error associated with the specification of the initial lidar ratio. Uncertainty can also be introduced by the cloud ice-water phase classification and the aerosol subtype identification procedures. However, the CALIOP extinction algorithm incorporates some error-correcting mechanisms that in many cases will adjust the initial estimate of lidar ratio so that a more suitable value is ultimately used in the retrieval. Details of the lidar ratio adjustment scheme are provided in the extinction retrieval ATBD (PDF). Algorithm architectural information and generalized error analyses for CALIPSO's cloud-aerosol discrimination algorithms, cloud ice-water phase algorithms, and aerosol subtyping algorithms can be found in the CALIPSO Scene Classification ATBD (PDF).
| Figure 4: Distributions for AERONET-derived lidar ratios, computed for aerosol types described in Omar et al., 2005 (PDF); | ||
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
The version 3.0 aerosol subtyping scheme is different from the previous versions. Two of the aerosol models, dust and polluted dust, discussed in Omar et al. (2009) and used in prior versions have been updated in light of the latest advances in the science. Recent measurements of size distributions of dust aerosol during NAMMA and T-Matrix calculations of the phase functions allowed a more realistic estimate of the dust lidar ratio at 1064 nm which, thus far, was based on a single measurement during SAFARI 2000. The dust phase function (squares in Figure 4a) is determined by T-Matrix calculations using NAMMA measurements of size distributions and refractive indices and the smoke phase functions (circles in Figure 4a) are determined from Mie calculations using size distributions and refractive indices of the biomass burning cluster of the AERONET measurements. The dust lidar ratios determined partly using NAMMA measurements are 40 sr and 55 sr at 532 nm and 1064 nm, respectively. The 55 sr at 1064 nm is a significant departure from the 30 sr used to calculate 1064 nm extinction coefficients in previous versions.
Since the polluted dust model is built from a smoke fine component and a dust coarse component, the above adjustment in the dust model is also reflected in the polluted dust model. A composite phase function (Figure 5b) of dust coarse mode and smoke fine mode from the individual phase functions is shown in Figure 5a. The resulting polluted dust lidar ratios are 55 sr at 532 nm and 48 sr at 1064 nm. The former is a departure from the old values of 65 sr at 532 nm. This is because in the original model most of the polluted dust aerosol was comprised of smoke, while this new model, partly based on NAMMA observations, apportions a significant surface area to the coarse mode dominated by dust. The old model used Mie calculations to generate polluted dust phase functions and lidar ratios while the new model uses Mie model calculations for the fine mode (smoke) and T-Matrix calculations for the coarse mode (dust) to generate the phase functions and lidar ratios.
| Figure 5: (a) Smoke and Dust phase functions determined by Mie and T-Matrix calculations respectively, and (b) composite dust and smoke phase functions representative of the polluted dust aerosol model | |
![]() |
![]() |
For weakly scattering features, the lidar ratio is most often left unchanged by the extinction solver, as a physical solution is usually obtained on the first iteration. In these cases, the uncertainties in the final lidar ratio are the same as the uncertainties in the initial lidar ratio. The exception to this statement would be if either the cloud-aerosol discrimination or the layer sub-typing procedures have misclassified the layer. However, for weak layers, the relative error in the lidar ratio is (approximately) linearly related to the resulting error in the derived optical depth estimate.
Retrievals of opaque and strongly scattering layers are very sensitive to the initial lidar ratio selection. Too large a value will cause the retrieval algorithm to, in effect, extinguish all available signal before reaching the measured base of the feature. When that point is reached the retrieval becomes numerically unstable and the calculated extinction coefficients will asymptote toward positive infinity. In these cases, a successful solution can only be obtained by reducing the lidar ratio. The CALIOP extinction routine does this automatically, and will repeat the process until a stable solution is achieved. When this happens, the final lidar ratio reported in the data products is the first one for which a physically meaningful (albeit not necessarily correct) solution was obtained for the entire measured depth of the layer.
The optical depths and extinction profiles derived in those cases where the layer lidar ratio must be reduced are generally not accurate. The current lidar ratio reduction scheme terminates after identifying (a very close estimate of) the largest lidar ratio for which a physically meaningful solution can be generated for the backscatter measured in the layer. However, the optical depths and extinction profiles reported in these situations can only be considered as upper bounds; the true values are somewhat, or perhaps even significantly, lower. Because the associated optical depth uncertainties cannot be reasonably estimated, these data should be excluded from statistical analyses of layer optical properties, and even the most sophisticated users are advised to treat these cases with extreme caution.
When an independent estimate of layer optical depth is available from a measured layer two-way transmittance, the CALIOP extinction algorithm will retrieve the optimal estimate of the layer-effective lidar ratio, irrespective of layer type, and use this retrieved lidar ratio in the extinction retrieval. These so-called 'constrained' retrievals are more accurate than unconstrained retrievals. For constrained retrievals, the uncertainty in the final lidar ratio can be well estimated using equation 7.4 from the CALIPSO Scene Classification ATBD (PDF). An extinction QC value of 1 indicates a successful constrained retrieval.
| Value | Method |
|---|---|
| 0 | not determined |
| 1 | constrained retrieval (using two-way transmittance) |
| 2 | based on cloud phase |
| 3 | based on aerosol species |
| 99 | fill value |
Ice clouds: for the CALIOP viewing geometry, simulations show multiple scattering effects are nearly independent of range and, as parameterized in the CALIOP retrieval algorithm (Winker et al. 2009), are nearly independent of extinction. In Version 2, ice clouds were assigned a range-independent multiple scattering factor of η532 = 0.6. Validation comparisons indicate this is an appropriate value and the same value is used in Version 3.
Water clouds: in Version 2 the multiple scattering factor for water clouds was set to unity, resulting in large errors in retrievals of extinction and optical depth. In Version 3, a value of η532 = 0.6 is used. Based on Monte Carlo simulations of multiple scattering, this value appears to be appropriate for semitransparent water clouds (τ < 1). (It is purely coincidental this is the same value used for ice clouds.) For denser water clouds (τ > 1) the multiply-scattered component of the signal becomes much larger than the single-scattered component, η532 becomes dependent on both cloud extinction and range into the cloud, and the retrieval becomes very sensitive to errors in the multiple scattering factor used. In these cases the multiple scattering cannot be properly accounted for in the current retrieval algorithm and retrieval results are unreliable.
Aerosols: simulations of multiple scattering effects on retrievals of aerosol layer optical depth indicate the effects are small in most cases. There is uncertainty in these estimates, however, due to poor knowledge of aerosol scattering phase functions. Validation comparisons conducted to date do not indicate significant multiple scattering effects on aerosol extinction profile retrievals. Multiple scattering effects may become significant in dense aerosol layers (σ > 1 /km), but in these cases retrieval errors are usually dominated by uncertainties in the lidar ratio or failure to fully penetrate the layer. In Version 3, as in Version 2, multiple scattering factors for both wavelengths are set to unity.
While the volume depolarization ratio is a direct measurement, the layer integrated 532 nm particulate depolarization ratio, δp, is a post-extinction quantity, calculated from ratio of the layer integrated perpendicular and parallel polarization components of particulate backscatter coefficient within the layer, using
![]() |
Here β⊥,P and β||,P are the perpendicular and parallel components of particulate backscatter coefficient at 532 nm, respectively.
The quality of the estimate for δp is determined not only by the SNR of the backscatter measurements in parallel and perpendicular channels, but also the accuracy of the range-resolved two-way transmittance estimates within the layer. The two-way transmittances due to molecules and ozone can be well characterized via the model data obtained from the GMAO. The two-way transmittances due particulates, however, are only as accurate as the CALIOP extinction retrieval. Opaque cirrus cloud layers can be particularly prone to errors in the particulate depolarization ratio, as very large attenuation corrections are applied to the weak signals at the base of the layers, and on those occasions where one channel or the other becomes totally attenuated, this situation can generate very large, negative particulate depolarization ratio estimates. For layers that are not opaque, δp is generally reliable. However, in weakly scattering layers, the quality of the daytime estimate can be degraded by a factor of 2-4 due to the larger background noise compared with the nighttime estimate.
Based on a one month test data set (January 2007), the median particulate depolarization ratio uncertainties in the aerosol layer products is typically ~0.04 and ~0.16 for nighttime and daytime measurements, respectively.
Note: both in the layer products and the profile products, particulate depolarization ratio uncertainties are reported as absolute errors, not relative errors.
![]() |
Much like the integrated attenuated total color ratio, the quality of χp is governed by the accuracy to which layer top and base altitudes are determined and by the signal-to-noise ratios of the backscatter data within the layer. Additionally, since all of the βP,λ(z) values are derived by the Hybrid Extinction Retrieval Algorithm (HERA), the quality of χp also depends on the success of the HERA profile solver in deriving accurate solutions for βP,λ(z). As such, the quality of χp can be partially assessed via the extinction QC flags which report the final state of the HERA solution attempt. In general, solutions where the final lidar ratio is unchanged (extinction QC = 0) or the extinction solution is constrained (extinction QC = 1) yield physically plausible solutions more often. Conversely, solutions tend to be more uncertain in those cases where the lidar ratio for either wavelength must be reduced.
Note: Uncertainties for layer-integrated particulate backscatter color ratios are reported as absolute errors, not relative errors.
Users can have high confidence in the calculation of all the values in the attenuated particulate color ratio statistics fields. However, particulate color ratios are produced by dividing one noisy number (the 1064 nm mean particulate backscatter coefficient) by a second noisy number (the 532 nm mean particulate backscatter coefficient), resulting in values that can range from large negative values to extremely large positive values, depending on the noise in any pair of samples. When computing layer means, standard deviations, and centroids, these outliers can dominate the calculation, and thus return entirely unrealistic estimates. Therefore, the integrated particulate ratio characterizes the particulate color ratio of a layer more reliably than does the mean value of the individual particulate color ratios within a layer.
In the current release (version 3), the CAD algorithm uses newly developed five-dimensional (5D) probability density functions (PDFs), rather than the three-dimensional (3D) PDFs used in previous versions. In addition to the parameters used in the earlier 3D version of the algorithm (layer mean attenuated backscatter at 532 nm, layer-integrated attenuated backscatter color ratio, and altitude), the new 5D PDFs also include feature latitude and the layer-integrated volume depolarization ratio. Detailed descriptions of the CAD algorithm can be found in Sections 4 and 5 of the CALIPSO Scene Classification ATBD (PDF). Enhancements made to incorporate the 5D PDFs used in version 3 release are described in Liu et al., 2010 (PDF). For further information on the CAD algorithm architecture and the three-dimensional (3D) PDFs used in versions 1 and 2 of the data products, please see Liu et al., 2004 (PDF) and/or Liu et al., 2009.
The standard CAD scores reported in the CALIPSO layer products range between -100 and 100. The sign of the CAD score indicates the feature type: positive values signify clouds, whereas negative values signify aerosols. The absolute value of the CAD score provides a confidence level for the classification. The larger the magnitude of the CAD score, the higher our confidence that the classification is correct. An absolute value of 100 therefore indicates complete confidence. Absolute values less that 100 indicate some ambiguity in the classification; that is, the scattering properties of the feature are represented to some degree in both the cloud PDF and in the aerosol PDF. In this case, a definitive classification cannot be made; that is, although we can provide a "best guess" classification, this guess could be wrong, with a probability of error related to the absolute value of the CAD score. A value of 0 indicates that a feature has an equal likelihood of being a cloud and an aerosol. Users are encouraged to refer to the CAD score when the cloud and aerosol classification results are used and interpreted.
Beginning with the version 2.01 release, several "special" CAD score values have been added. These are listed in the table below. Each of these new values represents a classification result that is based on additional information beyond that normally considered in the standard CAD algorithm.
| CAD score | Interpretation |
|---|---|
| -101 | negative mean attenuated backscatter encountered; layer is most likely an artifact, and its spatial and optical properties should be excluded from all science analyses. |
| 101 | initially classified as aerosol, but layer integrated depolarization mandates classifying layer as cloud (version 2 only; obsolete in version 3) |
| 102 | layer exhibits very high integrated backscatter and very low depolarization characteristic of oriented ice crystals (version 2 only; obsolete in version 3) |
| 103 | layer integrated attenuated backscatter at 532 nm is suspiciously high; feature authenticity and classification are both highly uncertain |
| 104 | layers with CAD scores of 104 are boundary layer clouds that were found to be opaque at the initial 5-km horizontal averaging resolution used by the layer detection algorithm; however, these layers are not uniformly filled with high-resolution clouds (i.e., layers detected at a 1/3-km horizontal resolution), and the 532 nm mean attenuated backscatter coefficient of the data that remains after cloud clearing is negative. Studies examining the spatial properties and distributions of clouds can safely include the spatial properties of these layers; however, the associated measured and derived optical properties should be excluded from all science studies. |
| 105 | a CAD score of 105 designates a layer detected at one of the coarser averaging resolutions (20-km or 80-km) for which the initial estimates of measured properties have been negatively impacted by either (a) the attenuation corrections applied to account for the optical depths of overlying layers, or (b) the extension of the layer base altitude |
| Bit | Value | Interpretation |
|---|---|---|
| 1 | 0 | unconstrained retrieval; initial lidar ratio unchanged during solution process |
| 1 | 1 | constrained retrieval |
| 2 | 2 | Initial lidar ratio reduced to prevent divergence of extinction solution |
| 3 | 4 | Initial lidar ratio increased to reduce the number of negative extinction coefficients in the derived solution |
| 4 | 8 | Calculated backscatter coefficient exceeds the maximum allowable value |
| 5 | 16 | Layer being analyzed has been identified by the feature finder as being totally attenuating (i.e., opaque) |
| 6 | 32 | Estimated optical depth error exceeds the maximum allowable value |
| 7 | 64 | Solution converges, but with an unacceptably large number of negative values |
| 8 | 128 | Retrieval terminated at maximum iterations |
| 9 | 256 | No solution possible within allowable lidar ratio bounds |
| 16 | 32768 | Fill value or no solution attempted |
The bit assignments are additive, so that (for example) an extinction QC value of 18 represents an unconstrained retrieval (bit 1 is NOT set) for which the lidar ratio was reduced to prevent divergence (+2; bit 2 is set), and for which the feature finder has indicated that the layer is opaque (+16; bit 5 is set). For the version 2.01 release, bits 10-15 are not used. Complete information about the conditions under which each extinction QC bit is toggled can be found in the CALIPSO Extinction Retrieval ATBD (PDF)
Correct interpretation of the feature subtype bits depends on the status of the feature type; e.g., the interpretation is different for clouds and aerosols. For aerosols, the feature subtype is one of eight types: desert dust, biomass burning, background, polluted continental, marine, polluted dust, other, and 'not determined'. Desert dust is mostly mineral soil. Biomass burning is an aged smoke aerosol consisting primarily of soot and organic carbon (OC), clean continental (also referred to as background or rural aerosol) is a lightly loaded aerosol consisting of sulfates (SO42-), nitrates (NO3-), OC, and Ammonium (NH4+), polluted continental is background aerosol with a substantial fraction of urban pollution, marine is a hygroscopic aerosol that consists primarily of sea-salt (NaCl), and polluted dust is a mixture of desert dust and smoke or urban pollution. Extensive test data generated prior to the version 3 release revealed a negligibly minute number of layers with spurious lidar ratios and aerosol type designations. These trace layers are currently labeled 'not determined'. The 'other' designation is a place-holder for another, yet to be determined, aerosol type. While this set does not cover all possible aerosol mixing scenarios, it accounts for a majority of mesoscale aerosol layers. In essence the algorithm trades off complex transient multi-component mixtures for relatively stable layers with large horizontal extent (10-1000 km).
In Version 1 and 2 data, the base altitudes of optically thick aerosol layers were sometimes biased high due to lidar signal attenuation or signal perturbations, causing aerosol optical depth (AOD) underestimates. In Version 3, to compensate for this, layer base altitudes of aerosol layers meeting the following criteria (hereafter termed, 'extended layers') are lowered to three range bins (90 m) from the surface as reported by the lidar surface elevation. The criteria for aerosol layer base extension are:
These criteria attempt to ensure that only boundary-layer aerosol layers with useable signals beneath their original bases are extended. The possibility of introducing surface contamination in layer optical properties is reduced by assigning the extended base to an altitude 90 m above the local surface height. Hence, surface contamination in extended layers is minimal and confined to regions with rugged terrain. However, this also means that profiles within layers with extended bases always stop 90 meters above the surface. In four months of global test data, the base altitudes of 8.6% of all layers originally classified as aerosol were extended, with average base altitudes lowered by 0.54 km.
Layer descriptors are re-computed after base extension and each extended layer is re-analyzed by the scene classification algorithms to assign feature type, subtype, and CAD Score. Consequently, the feature type or subtype of these layers may change since their optical properties have changed. Hence, layer base extended descriptors are populated with the feature classification flags of these layers prior to base extension so their previous type and subtype can be discerned. In four months of test data, 17% of extended aerosol layer subtypes changed due to base extension. Additionally, 12% of all extended aerosol layers were reclassified as cloud layers, accounting for 0.3% of all cloud layers. This typically occurs in scenes with low SNR, or when surface contamination is suspected. Extended aerosol layers which are reclassified as cloud layers tend to have very low CAD scores before and after base extension (65% have |CAD score| less than 10), so their type was never certain to begin with. These layers cannot be used with confidence. Conversely, 85% of extended layers have |CAD score| > 90 when the type does not change (extended aerosol layer remains an aerosol layer). Users are advised to consult CAD scores to assess confidence in all feature types.
Impact on Version 3 Aerosol Optical Depths
A focused study of two test days (2007-01-01 and 2007-08-27) found that the median optical depth of extended aerosol layers increased by 22%, resulting in a 1% AOD increase in all aerosol layers globally. The figure below shows the change in optical depth for extended aerosol layers (unconstrained retrievals) where the type or subtype did not change for the two test days. By design, optical depth values have increased due to the aerosol base extension algorithm.
|
Figure 6: Histogram of aerosol optical depth at 532 nm with and without base extension for all extended aerosol layers in the two day test. |
![]() |
γ′above provides a qualitative assessment of the confidence that users should assign to each layer reported. As noted earlier (see the discussion for layer base and top heights), layer detection, and the assessment of the associated layer descriptors, becomes increasingly uncertain as the overlying optical depth increases. This uncertainty cannot be easily quantified, because backscatter lidars such as CALIOP cannot measure optical depth directly, and must instead derive optical depth estimates in subsequent data processing. However, γ′above can easily be obtained directly from the calibrated backscatter signal, and hence can provide a qualitative proxy for the optical depth above each layer detected.
| Lidar Level 2 Cloud and Aerosol Layer Information Half orbit (Night and Day) lidar cloud and aerosol layer products describe both column and layer properties |
|||
|---|---|---|---|
| Release Date | Version | Data Date Range | Maturity Level |
| December 2011 | 3.02 | November 1, 2011 to present |
|
| May 2010 | 3.01 | June 13, 2006 to February 16, 2009 March 17, 2009 to October 31, 2011 |
|
The CALIPSO Team is releasing Version 3.02 which represents a transition of the Lidar, IIR, and WFC processing and browse code to a new cluster computing system. No algorithm changes were introduced and very minor changes were observed between V 3.01 and V 3.02 as a result of the compiler and computer architecture differences. Version 3.02 is being released in a forward processing mode beginning November 1, 2011.
Version 3.01 of the Lidar Level 2 data products is a significant improvement over previous versions. Major code and algorithm improvements include
In addition to the numerous algorithm updates, several new parameters have been added to the layer products. These include
The sections below highlight important changes to the layer detection, scene classification, and extinction algorithms that have implications for the overall quality of the Lidar Level 2 data products.
| Figure 7: Histograms of CAD scores for Version 2 (red) and Version 3 (blue) |
![]() |
| Figure 8: Relation between CAD score and Layer IAB QA Factor |
![]() |
Dense aerosol layers (primarily very dense dust and smoke over and close to the source regions), which are sometimes labeled as cloud in the V2 release, are now correctly identified as aerosol, largely because of the addition of the integrated volume depolarization ratio to the diagnostic parameters used for cloud-aerosol discrimination. In addition, in the open oceans, dense aerosols that were previously classified as clouds are now frequently observed in the marine boundary layer. Improvements are also seen for these maritime aerosols. Note, however, dense dust/smoke layers found at single-shot (0.333 km) resolution will be classified as cloud by default. This issue will be revisited for post-V3 releases.
Because the V2 CAD algorithm used a latitude-independent set of 3D PDFs, a class of optically thin clouds encountered in the polar regions that can extend from the surface to several kilometers were sometimes misclassified as aerosols. In version 3, these features are now correctly classified as cloud.
Correct classification of heterogeneous layers is always difficult. An example of a heterogeneous layer would be an aerosol layer that is vertically adjacent to a cloud or contains an embedded cloud, but which is nonetheless detected by the feature finder as a single entity in the V2 release. By convention, heterogeneous layers should be classified as clouds. The version 3 feature finding algorithm has also been improved greatly, and can now much better separate the embedded or adjacent single-shot cloud layers from the surrounding aerosol. This improvement in layer detection contributes significantly to the improvement of the CAD performance.
Some so-called features identified by the layer detection scheme are not legitimate layers, but instead are artifacts due to the noise in the signal, multiple scattering effects, or to artificial signal enhancements caused by non-ideal detector transient response or an over estimate of the attenuation due to overlying layers. These erroneous "pseudo-features" are neither cloud nor aerosol and are distributed outside of the cloud and aerosol clusters in the PDF space. The V3 CAD algorithm can better identify these outlier features by assigning a small CAD score (the bump between -10 and 20 in the V3 CAD histogram) and classify most of them as cloud by convention. A CAD threshold of 20 can effectively filter out these outliers.
The extinction products are produced by first identifying an aerosol type and then using the appropriate values of Sa and the multiple scattering factor, η(z). Note that multiple scattering corrections have not yet been implemented for the current data release, so that η(z) = 1 for all aerosol types. The accuracy of the Sa value used in the lidar inversions depends on the correct identification of the type of aerosol. In turn, the accuracy of the subsequent optical depth estimate depends on the accuracy of Sa.
The underlying paradigm of the type classification is that a variety of emission sources and atmospheric processes will act to produce air masses with a typical, identifiable aerosol 'type'. This is an idealization, but one that allows us to classify aerosols based on observations and location in a way to gain insight into the geographic distribution of aerosol types and constrain the possible values of Sa for use in aerosol extinction retrievals.
The aerosol subtype product is generated downstream of the cloud-aerosol discrimination (CAD) scheme and, therefore, depends on the cloud-aerosol classification scheme in a very fundamental way. If a cloud feature is misclassified as aerosol, the aerosol subtype algorithm will identify this 'aerosol' as one of the aerosol subtypes. The user must exercise caution where the aerosol subtype looks suspicious or unreasonable. Such situations can occur with some frequency in the southern oceans and the polar regions.The version 2 algorithm included a rudimentary ability to identify a specific subset of high confidence instances of HOI. These clouds were classified as ice clouds, and flagged with a 'special CAD score' of 102, indicating that they had been further classified as HOI. The new version 3 algorithm implements a much more sophisticated scheme for recognizing HOI that correctly identifies many more instances of these sorts of ice clouds. The special CAD score of 102 is no longer used to identify these layers. Instead, the "ice cloud" and "mixed phase cloud" classifications have been eliminated, and replaced as shown in the table below.
| Value | Version 2 Interpretation | Version 3 Interpretation |
|---|---|---|
| 0 | unknown/not determined | unknown/not determined |
| 1 | ice | randomly oriented ice (ROI) |
| 2 | water | water |
| 3 | mixed phase | horizontally oriented ice (HOI) |
The Ice/water Phase QA flags have also been redefined slightly for Version 3, as follows:
| Value | Version 2 Interpretation | Version 3 Interpretation |
|---|---|---|
| 0 | no confidence | no/low confidence |
| 1 | low confidence | phase based on temperature only |
| 2 | medium confidence | medium confidence |
| 3 | high confidence | high confidence |
A confidence flag of QA=1 indicates the phase classification is based on temperature. Initial classification tests are based on layer depolarization, layer-integrated backscatter, and layer-average color ratio. Layers classified as water with temperature less than -40 C are forced to ROI and given a confidence flag of QA=1. Layers classified as ROI or HOI with temperature greater than 0 C are forced to water and also given a confidence flag of QA=1. Clouds for which the phase is 'unknown/not determined' are assigned a confidence value of 0 (no/low confidence).
Layers classified as HOI based on anomalously high backscatter and low depolarization are assigned QA=3. These layer characteristics are rarely detected after the CALIOP viewing angle was changed to 3° in November 2007. The Version 3 algorithm computes the spatial correlation of depolarization and integrated backscatter, and uses this as an additional test of cloud phase. Layers classified as HOI using this test are assigned QA=2. The spatial correlation test is responsible for the majority of the layers classified as HOI. These layers typically have higher backscatter than ROI but similar depolarization, and are common even at a viewing angle of 3°. We interpret this as clouds with significant perpendicular backscatter from ROI but containing enough HOI to produce enhanced backscatter. These layers tend to be found at much colder temperatures than the high confidence HOI (see Hu et al. 2009).
PLEASE NOTE: Users of the CALIOP optical depths should read and thoroughly understand the information provided in the Profile Products Data Quality Summary. This summary contains an expanded description of the extinction retrieval process from which the layer optical depths are derived, and provides essential guidance in the appropriate use of all CALIOP extinction-related data products. Validation and improvements to the profile products QA are ongoing efforts, and additional data quality information will be included with future releases.