assessing nonresponse bias and measurement error in estimates of employment

38
Assessing Nonresponse Bias and Measurement Error in Estimates of Employment John Dixon Clyde Tucker Polly Phipps Bureau of Labor Statistics any opinions expressed in this paper are those of the authors and do not constitute policy of the Bureau of Labor Statistics

Upload: velvet

Post on 08-Jan-2016

35 views

Category:

Documents


2 download

DESCRIPTION

Assessing Nonresponse Bias and Measurement Error in Estimates of Employment. John Dixon Clyde Tucker Polly Phipps Bureau of Labor Statistics any opinions expressed in this paper are those of the authors and do not constitute policy of the Bureau of Labor Statistics. Types of Nonresponse. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

John DixonClyde TuckerPolly Phipps

Bureau of Labor Statisticsany opinions expressed in this paper are those of the authors

and do not constitute policy of the Bureau of Labor Statistics

Page 2: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Types of Nonresponse

• Ignorable: Conditioning on known auxiliary variables some bias may be eliminated from estimates—accomplished by making weighting adjustments.

• Nonignorable: Bias that cannot be eliminated by conditioning on auxiliary variables—remains in estimates and contributes to mean squared error

• Weighting adjustments may be of only limited utility

Page 3: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Previous Research

• Household surveys– Haraldsen, et al. 1999;Keeter, et al.

2000;Curtin, et al. 2000 found little nonresponse bias

– Brick and Bose (2001) found weighting was effective

– Most results confined to response rates between 40% and 70%

Page 4: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Previous Research

• Establishment Surveys– Nonresponding units can have more effect than in household

surveys– Tomaskovic-Devey, et al. (1994) found differential nonresponse by

size, industry, and profitability of firm– Most research focused on characterizing nonrespondents

(Hidiroglou, et al. 1993 and Phipps et al. 2007, adjusting for nonresponse (Sommers, et al. 2004) or understanding why establishments don’t respond (McCarthy, et al. 1999 and McCarthy and Beckler 2000)

– Copeland (2003), in contrast, looked at the bias in early estimates that resulted from late responders in the Current Employment Statistics Program (CES)

– Tucker, et al. (2005) began an examination of bias in CES using data from the QCEW (State UI files). This paper is a more thorough report on that work.

Page 5: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

The CES

• Collects employment, hours and earnings monthly from a current sample of over 300,000 establishments

• Tracks the gains and losses in jobs in various sectors of the economy

• In this paper, nonresponse bias work on this survey focuses on estimating bias for establishment subpopulations with different patterns of nonresponse using data from the 2003 CES and QCEW (over 400,000 respondents)

Page 6: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Nonresponse Error for Sample Mean

In simplest terms

OR

Respondent Mean = Full Sample Mean +

(Nonresponse Rate)*(Respondent Mean – Nonrespondent Mean)

r n r m

mY Y Y Y

n

Page 7: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Theory

• Levels of bias will differ by subpopulations

• The difference between respondent and nonrespondent estimates will be greatest on either end of the nonresponse continuum, but potential bias greatest when response rates are low

• Bias in business surveys may be greatest in the Services sector

Page 8: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

The Current Study of Nonresponse Bias in the CES

• Because this work is theoretically driven, the paper just examines the difference in employment of respondents and nonrespondents but not the ultimate effect on estimates after accounting for response rate.

• Furthermore, the CES benchmarks the estimates to the QCEW (the data used in this study) using estimation cells based on industry and region.

• Thus, the eventual aggregate bias should be small.

Page 9: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

CES Response Rates by NAICS Categories

Page 10: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measurement error

• Administrative data for employment counts is thought to be less error-prone than survey data.– Errors in the unemployment insurance counts have

consequences for taxes or fines.– Surveys are often handled by less experienced staff.

• Differences between the survey and the administrative data could be due to – definitional differences (e.g.:students usually aren’t

counted in the unemployment insurance counts, but may be in the payroll counts)

– reporting period

Page 11: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measurement error (continued)

Both the administrative and the survey data vary from month to month, but if the survey has more measurement error, then the variability of the survey data relative to the administrative data could be used as a rough indicator of measurement error.

Page 12: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Distribution of relative variance

Page 13: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measurement error measures and closing

missing

mix no All

lrvdces mlpces lrvdces mlpces lrvdces mlpces

close

1 0.0938 -2.006 0.1709 -2.585 0.1476 -2.417

2 0.1807 -1.873 0.6552 -2.237 0.4911 -2.114

3 0.2877 -1.798 0.6702 -2.118 0.4325 -1.909

All 0.1695 -1.903 0.4665 -2.374 0.3617 -2.211

Page 14: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Estimate of Biases (nonresponse and measurement error)

• Using the most recent employment reports in the QCEW (not CES) for both responders and nonresponders

• Compare the employment reports for respondents to that for nonrespondents

• Compare the variability of estimates within companies for the CES survey relative to the variability of the QCEW as an estimate of measurement bias.

• Results presented are not weighted by probability of selection, but weighted results show similar patterns

Page 15: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Nonresponse and Measurement Bias

Survey Administrative

Median (N) Median (N)

Missingness

Always missing 08.84 (294410)

Mix 12.92 (144203) 12.84 (144512)

No missing 15.65 (222228) 15.47 (221121)

Page 16: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Quantile Regression• Bias analysis performed at the establishment level on

subpopulations defined by size and industry• Testing for the difference in employment between CES

responders and nonresponders. Y=a+Bx+e where x is an indicator of nonresponse (essentially a t-test).

• Since size of firm is theorized to relate to nonresponse, the coefficients relating nonresponse to employment is likely to be different for different size firms.

• Quantile regression examines the coefficients for different quantiles of the distribution of the sizes of firms.

• Since industries can be expected to have different patterns, the quantile regressions are done by industry group.

Page 17: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Interpretation of Quantile Regression of size on bias

• At the micro level, the quantile regression shows the coefficients relating nonresponse (coded 0) to the size of firm. Each point on the curve is a regression relating nonresponse to size conditional on the rest of the distribution. The skewness affects the standard errors, so a stabilizing transformation is needed. This can be seen in the box & whisker plot to the right.

Page 18: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Distribution of size and the quantile regression curve

Page 19: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Employment and Measurement error

nonr esponse bi as ( mi xed mi ss i ng) over empl oymentmi ss=mi x

l memp

- 3

- 2

- 1

0

1

2

3

4

5

6

7

8

9

10

ml pces

- 20 - 10 0 10

Page 20: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measures of Measurement error

Page 21: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measures of Nonresponse

Page 22: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Interpretation of a Log Transformation

• Using the log of size, the proportionate effect is greater for smaller firms. Since the coefficients are based on linear models, the transformation makes the distribution of responders and nonresponders more reasonable, as seen in the box & whisker plot to the right.

• The quantile regression curve shows smaller firms have proportionately more bias than larger firms.

Page 23: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Quantile regression using the log of size.

l i nk r el at i ve based es t i mat esnai cs2=Agr i cul t ur e, For es t r y, Fi shi ng and Hunt i ng

0 1

- 5. 0

- 2. 5

0

2. 5

5. 0

7. 5

logrel1

nfl ag1

Page 24: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Industry patterns

• In almost all cases, there is some significant positive bias.

• The most common pattern of the quantile curves is rapidly accelerating coefficients with increasing establishment size for almost half the industries.

• Another pattern is flat coefficients until past the middle of the size distribution followed by accelerating coefficients.

• A third pattern is a relatively gradual increase in the coefficients to an asymptote.

Page 25: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Retail trade

Page 26: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Accelerating patterns

• The Agriculture, Forestry, Fishing and Hunting industries typify the rapidly accelerating pattern. The logged values show decreasing coefficients. The statistical significance shown by the confidence intervals varied.

Page 27: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Accelerating pattern industries

• Agriculture; mining; metal manufacturing; transportation and warehousing; information; finance and insurance; professional, scientific and technical services; administrative services and waste management; and accommodation and food services.

Page 28: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Late accelerating pattern

• The Health Care and Social Assistance industries typified the late accelerating pattern. While health care had positive coefficients, The real estate industry started negative and only became positive at the 80th quantile.

• While most other patterns of the logged values showed decline, these showed an increase.

Page 29: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Late Accelerating Industries

• Health Care and social assistance; retail trade; real estate, rental and leasing; education services; and other services

Page 30: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Asymptote pattern

• The construction industry typifies the asymptote pattern of gradually increasing coefficients until the highest quantiles.

• The logged values showed an early decline to an asymptote.

Page 31: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Asymptote industries

• The construction; food manufacturing; wood and mineral manufacturing; wholesale trade; and arts, entertainment and recreation showed the asymptote pattern.

• They may have a critical mass, where once the size reaches the 90th quantile, there is a more uniform participation rate.

Page 32: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Utilities

• The utilities industry shows a relatively flat curve of coefficients.

• The logged values were higher for small firms.

Page 33: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Management of Companies and Enterprises

• The coefficients is virtually flat until the highest quantiles and then turns decidedly .

• The coefficients from the logged values linearly declined.

Page 34: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Measurement error and nonresponse

Page 35: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Flat patterns

• Measurement error stayed mostly constant over the likelihood of nonresponse for Agriculture, Arts, Information, and Transportation

Page 36: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Declining pattern

• Only Real-estate had a declining pattern of measurement error over the likelihood of nonresponse.

Page 37: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Summary

• Quantile regression is a useful technique for studying nonresponse bias

• The patterns of differences between industries varied• The most common pattern indicated efforts to adjust for

nonresponse should be spent on larger firms to better estimate overall changes in employment

• Yet, proportionate differences were higher for smaller firms

• The patterns weren’t related to the nonresponse rate for the industry

• Measurement error relates to smaller differences than nonresponse bias. Mixed responders had lower measurement error, contrary to expectations.

• Most industries had an increase in measurement error as the probability of nonresponse increased.

Page 38: Assessing Nonresponse Bias and Measurement Error in Estimates of Employment

Assessing Nonresponse Bias and Measurement Error in Estimates

of Employment

John [email protected]

Clyde [email protected]

Polly [email protected]

Bureau of Labor Statistics