application of principal component analysis to high …web.gps.caltech.edu/~vijay/agu/vijay agu...

1
Application of Principal Component Analysis to High Spectral Resolution Radiative Transfer: A Case Study of the O 2 A band Abstract Radiative transfer computation is the rate-limiting step in most high spectral resolution remote sensing retrieval applications. While several techniques have been proposed to speed up radiative transfer calculations, they all suffer from accuracy considerations. We propose a new method, based on a principal component analysis of the optical properties of the system, that addresses these concerns. Taking atmospheric transmission in the O2 A band as a test case, we reproduced the radiance spectrum, obtained using the multiple scattering code DISORT, with an accuracy of 0.3%, while achieving an order of magnitude improvement in speed. Vijay Natraj 1 , Xun Jiang 1 , Run-lie Shia 1 , Xianglei Huang 2 , Jack S. Margolis 3 and Yuk L. Yung 1 ; 1 Department of Geology and Planetary Sciences, California Institute of Technology, 2 Program in Atmospheric and Oceanic Sciences, Princeton University, 3 RSA Systems, Altadena In our analysis, we seek an accurate and efficient characterization of near-infrared (NIR) absorption in the O 2 A band centered at 760 nm. We use a 23-level model atmosphere, obtained from the ECMWF data for a sub- tropical northern hemisphere (15°N) summer with 15 levels in the stratosphere and the remaining in the troposphere. The levels are spaced linearly in log(pressure) from 1 mbar to 1 bar. Rayleigh scattering by air molecules and scattering by aerosols are taken into account. The total aerosol optical depth and the surface albedo are assumed to be 0.05 and 0.2 respectively. The spectroscopic data are taken from the HITRAN2K line list . Two codes were used to generate the O 2 A band spectrum: the multi-stream, line-by-line multiple scattering code DISORT, and a multiple scattering code which uses only two streams (one up, one down), to be henceforth called TWOSTR. DISORT is on average two to three orders of magnitude slower than TWOSTR. Fig. 1 shows the correlation between the upwelling radiance spectra obtained from DISORT and TWOSTR at the top of the atmosphere (TOA), and the difference between the two calculations. It is clear that the TWOSTR spectrum has a very good correlation with that from DISORT. We observe that the variance in the radiance is much lower for the residual than for the radiances directly obtained from DISORT. We exploit this feature in the principal component analysis. Figure 1a: Correlation plot between DISORT and TWOSTR TOA reflectance spectra Figure 1b: Difference between TWOSTR and DISORT TOA reflectance spectra. The diamonds represent the P branch, and the crosses represent the R branch. The residuals have been plotted as a function of the DISORT reflectance to show systematic deviations more clearly. Figure 3a: EOF1. The figures on the left and the right show the first EOF for the optical depth and the single scattering albedo respectively. Figure 3b: PC1 Computed O 2 A band Spectrum A three-term Taylor expansion is used to compute the TOA radiance from the radiance computed for the mean optical properties and the EOFs and PCs. An error criterion of less than 1.0% was chosen for all but the most saturated lines, to simulate expected results for space-based detection [2]. The total number of monochromatic wavenumber grid points for the radiative transfer calculation was 10616; only 105 cases were needed to perform the principal component analysis. An order of magnitude improvement in speed is achieved. Fig. 5 shows the error in the computed spectrum with respect to a line by line calculation. Conclusion A novel technique based on principal component analysis has been introduced to increase the computational efficiency of radiative transfer calculations. Using the optical depth and the single scattering albedo as the EOF parameters, and taking advantage of the correlation between a two-stream and a multi-stream approach, the O 2 A-band spectrum was reproduced with an accuracy of 0.3% while achieving an order of magnitude speed improvement. Figure 4a: EOF2. The figures on the left and the right show the first EOF for the optical depth and the single scattering albedo respectively. Figure 4b: PC2 Figure 5: Error in the spectrum computed by principal component analysis. The root mean square error is less than a percent. When convolved with a typical instrument lineshape function, the error is of the order of a tenth of a percent. References [1] Kuang ZM, Margolis JS, Toon GC, Crisp D, Yung YL. Spaceborne measurements of atmospheric CO2 by high- resolution NIR spectrometry of reflected sunlight: An introductory study. Geophys Res Lett 2002; 29(15): 1716. [2] O'Brien, DM, Mitchell RM, English SA, da Costa GA. Airborne measurements of air mass from O2 A-band absorption spectra. J Atmos Ocean Tech 1998; 15(6): 1272. Model Description Multiple Scattering Codes The data set consists of optical properties in M atmospheric layers at N wavenumbers. The EOFs are unit eigenvectors of the mean-removed covariance matrix. The principal components (PCs), are the projections of the original data set onto the associated EOFs (scaled by the eigenvalue). The EOFs are just a new basis to represent the original data, so there is no loss of information provided that a complete set is used. A few EOFs are sufficient to reproduce nearly all the information. In practice, principal component analysis is performed in logarithmic space for reasons of computational efficiency. As with spectral mapping techniques, our aim is to reduce the number of radiative transfer calculations by grouping wavenumbers at which the optical properties are similar. Fig. 2 shows the layer optical depth and single scattering albedo profiles, with different lines denoting different wavenumbers. It is clear from this figure that the maximum variability in the optical depth occurs in the bottom half of the atmosphere and that of the single scattering albedo is fairly constant across the atmosphere (ignoring extremely small values). Keeping this in mind, our grouping criteria are as follows: c 1 < ln(2τ 2 ) < c 2 , where τ 2 is the cumulative optical depth of the lower half of the atmosphere (layers 11 to 22) c 3 < ω 1 < c 4 , where ω 1 is the single scattering albedo of the top layer c 1 - c 4 are to be picked by the user. A particular choice of these parameters defines a 'case'. Figure 2: Layer optical depth (dτ) and single scattering albedo (ω) profiles. The different lines represent different wavenumbers. For clarity of presentation, the profiles are shown only for every 25th wavenumber. EOFs and PCs for Sample Case Figs. 3-4 show the first two EOFs (and the corresponding PCs) for the sample case where c 1 = 0.25, c 2 = 0.5, c 3 = 0.7 and c 4 = 1 (in the P branch). The EOFs reflect the vertical variations in the gas density and the half-width of the spectral lineshape, while the PCs display the dependence of the line shape on frequency. The first two EOFs capture more than 99% of the variance. Empirical Orthogonal Functions Grouping Criteria

Upload: others

Post on 14-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Application of Principal Component Analysis to High …web.gps.caltech.edu/~vijay/AGU/Vijay AGU Poster_1.pdf[1] Kuang ZM, Margolis JS, Toon GC, Crisp D, Yung YL. Spaceborne measurements

Application of Principal Component Analysis to High Spectral Resolution Radiative Transfer: A Case Study of the O2 A band

AbstractRadiative transfer computation is the rate-limiting step in most high spectral resolution remote sensing retrieval applications. While several techniques have been proposed to speed up radiative transfer calculations, they all suffer from accuracy considerations. We propose a new method, based on a principal component analysis of the optical properties of the system, that addresses these concerns. Taking atmospheric transmission in the O2 A band as a test case, we reproduced the radiance spectrum, obtained using the multiple scattering code DISORT, with an accuracy of 0.3%, while achieving an order of magnitude improvement in speed.

Vijay Natraj1, Xun Jiang1, Run-lie Shia1, Xianglei Huang2, Jack S. Margolis3 and Yuk L. Yung1;1Department of Geology and Planetary Sciences, California Institute of Technology, 2Program in Atmospheric and Oceanic Sciences, Princeton University,3 RSA Systems, Altadena

In our analysis, we seek an accurate and efficient characterization of near-infrared (NIR) absorption in the O2 A band centered at 760 nm. We use a 23-level model atmosphere, obtained from the ECMWF data for a sub-tropical northern hemisphere (15°N) summer with 15 levels in the stratosphere and the remaining in the troposphere. The levels are spaced linearly in log(pressure) from 1 mbar to 1 bar. Rayleigh scattering by air molecules and scattering by aerosols are taken into account. The total aerosol optical depth and the surface albedo are assumed to be 0.05 and 0.2 respectively. The spectroscopic data are taken from the HITRAN2K line list .

Two codes were used to generate the O2 A band spectrum: the multi-stream, line-by-line multiple scattering code DISORT, and a multiple scattering code which uses only two streams (one up, one down), to be henceforth called TWOSTR. DISORT is on average two to three orders of magnitude slower than TWOSTR. Fig. 1 shows the correlation between the upwelling radiance spectra obtained from DISORT and TWOSTR at the top of the atmosphere (TOA), and the difference between the two calculations. It is clear that the TWOSTR spectrum has a very good correlation with that from DISORT. We observe that the variance in the radiance is much lower for the residual than for the radiances directly obtained from DISORT. We exploit this feature in the principal component analysis.

Figure 1a: Correlation plot between DISORT and TWOSTR TOA reflectance spectra

Figure 1b: Difference between TWOSTR and DISORT TOA reflectance spectra. The diamonds represent the P branch, and the crosses represent the R branch. The residuals have been plotted as a function of the DISORT reflectance to show systematic deviations more clearly.

Figure 3a: EOF1. The figures on the left and the right show the first EOF for the optical depth and the single scattering albedo respectively.

Figure 3b: PC1

Computed O2 A band Spectrum

A three-term Taylor expansion is used to compute the TOA radiance from the radiance computed for the mean optical properties and the EOFs and PCs. An error criterion of less than 1.0% was chosen for all but the most saturated lines, to simulate expected results for space-based detection [2]. The total number of monochromatic wavenumber grid points for the radiative transfer calculation was 10616; only 105 cases were needed to perform the principal component analysis. An order of magnitude improvement in speed is achieved. Fig. 5 shows the error in the computed spectrum with respect to a line by line calculation.

ConclusionA novel technique based on principal component analysis has been introduced to increase the computational efficiency of radiative transfer calculations. Using the optical depth and the single scattering albedo as the EOF parameters, and taking advantage of the correlation between a two-stream and a multi-stream approach, the O2 A-band spectrum was reproduced with an accuracy of 0.3% while achieving an order of magnitude speed improvement.

Figure 4a: EOF2. The figures on the left and the right show the first EOF for the optical depth and the single scattering albedo respectively.

Figure 4b: PC2

Figure 5: Error in the spectrum computed by principal component analysis. The root mean square error is less than a percent. When convolved with a typical instrument lineshape function, the error is of the order of a tenth of a percent.

References[1] Kuang ZM, Margolis JS, Toon GC, Crisp D, Yung YL. Spaceborne measurements of atmospheric CO2 by high-resolution NIR spectrometry of reflected sunlight: An introductory study. Geophys Res Lett 2002; 29(15): 1716.[2] O'Brien, DM, Mitchell RM, English SA, da Costa GA. Airborne measurements of air mass from O2 A-band absorption spectra. J Atmos Ocean Tech 1998; 15(6): 1272.

Model Description

Multiple Scattering Codes

The data set consists of optical properties in M atmospheric layers at N wavenumbers. The EOFs are unit eigenvectors of the mean-removed covariance matrix. The principal components (PCs), are the projections of the original data set onto the associated EOFs (scaled by the eigenvalue).The EOFs are just a new basis to represent the original data, so there is no loss of information provided that a complete set is used. A few EOFs are sufficient to reproduce nearly all the information. In practice, principal component analysis is performed in logarithmic space for reasons of computational efficiency.

As with spectral mapping techniques, our aim is to reduce the number of radiative transfer calculations by grouping wavenumbers at which the optical properties are similar. Fig. 2 shows the layer optical depth and single scattering albedo profiles, with different lines denoting different wavenumbers. It is clear from this figure that the maximum variability in the optical depth occurs in the bottom half of the atmosphere and that of the single scattering albedo is fairly constant across the atmosphere (ignoring extremely small values). Keeping this in mind, our grouping criteria are as follows:

c1 < ln(2τ2 ) < c2, where τ2 is the cumulative optical depth of the lower half of the atmosphere (layers 11 to 22)c3 < ω1 < c4, where ω1 is the single scattering albedo of the top layer

c1- c4 are to be picked by the user. A particular choice of these parameters defines a 'case'.

Figure 2: Layer optical depth (dτ) and single scattering albedo (ω) profiles. The different lines represent different wavenumbers. For clarity of presentation, the profiles are shown only for every 25th wavenumber.

EOFs and PCs for Sample CaseFigs. 3-4 show the first two EOFs (and the corresponding PCs) for the sample case where c1 = 0.25, c2 = 0.5, c3 = 0.7 and c4 = 1 (in the P branch). The EOFs reflect the vertical variations in the gas density and the half-width of the spectral lineshape, while the PCs display the dependence of the line shape on frequency. The first two EOFs capture more than 99% of the variance.

Empirical Orthogonal Functions

Grouping Criteria