m01 stockwatson123635 03 econ part01 1
TRANSCRIPT
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
1/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
Introductionto Econometrics
Chapters 1, 2 and 3
The statistical analysis of
economic (and relateddata
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
2/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2
Brief Overview of the Course
! "conomics s#ggests important relationships$ often %ith policyimplications$ t virt#ally never s#ggests '#antitativemagnit#des of ca#sal effects.
What is the quantitativeeffect of red#cing class si)e onst#dent achievement*
+o% does another year of ed#cation change earnings*
What is the price elasticity of cigarettes*
What is the effect on o#tp#t gro%th of a 1 percentage pointincrease in interest rates &y the ,*
What is the effect on ho#sing prices of environmentalimprovements*
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
3/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3
This course is about using data tomeasure causa effects!
! deally$ %e %o#ld li/e an eperiment
What %o#ld &e an eperiment to estimate the effect of class si)e onstandardi)ed test scores*
! #t almost al%ays %e only have o&servational (non-eperimentaldata.
ret#rns to ed#cation
cigarette prices
monetary policy
! ost of the co#rse deals %ith diffic#lties arising from #sing
o&servational to estimate ca#sal effects confo#nding effects (omitted factors
sim#ltaneo#s ca#sality
correlation does not imply ca#sation3
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
4/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"
! 4earn methods for estimating ca#sal effects #singo&servational data
! 5oc#s on applications theory is #sed only as needed to#nderstand the %hys of the methods6
! 4earn to eval#ate the regression analysis of others thismeans yo# %ill &e a&le to read7#nderstand empiricaleconomics papers in other econ co#rses6
! 8et some hands-on eperience %ith regression analysis inyo#r pro&lem sets.
In this course #ou wi$
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
5/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%
! Empirica probem$
Class si)e and ed#cational o#tp#t
Policy '#estion9 What is the effect on test scores (orsome other o#tcome meas#re of red#cing class si)e &yone st#dent per class* &y : st#dents7class*
We m#st #se data to find o#t (is there any %ay to ans%er
this withoutdata*
&eview of 'robabiit# and (tatistics)(* Chapters 2, 3+
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
6/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-
The Caifornia Test (core ata (et
All ;-< and ;-: California school districts (n= >20
?aria&les9
! @thgrade test scores (tanford-B achievement test$com&ined math and reading$ district average
! t#dent-teacher ratio (T, = no. of st#dents in thedistrict divided &y no. f#ll-time e'#ivalent teachers
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
7/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-.
Initia oo/ at the data$(You should already know how to interpret this table)
This ta&le doesnt tell #s anything a&o#t the relationship&et%een test scores and the STR.
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
8/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-0
o districts with smaer casses havehigher test scores
(catterpot of test score v. st#dent-teacher ratio
What does this figure show?
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
9/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-
*e need to get some numerica evidence on whetherdistricts with ow (T&s have higher test scores but how
1. Compare average test scores in districts %ith lo% T,s to
those %ith high T,s (estimation3
2. Test the n#ll3 hypothesis that the mean test scores in the
t%o types of districts are the same$ against thealternative3 hypothesis that they differ (hypothesis
testing3
D. "stimate an interval for the difference in the mean test
scores$ high v. lo% T, districts (confidence interval3
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
10/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-14
Initia data ana#sis$ Compare districts %ith small3 (T, E20 and large3 (T, F 20 class si)es9
1. Estimationof = difference &et%een gro#p means
2. Test the hypothesisthat = 0
3. Constr#ct a confidence intervalfor
Class i)e Average score
(
tandard deviation
(sY
n
mall 1B.> 2D:
4arge
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
11/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-11
1! Estimation
=
=
s this a large difference in a real-%orld sense*
tandard deviation across districts = 1B.1
Hifference &et%een
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
12/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-12
2! 5#pothesis testing
t=Ys Y
l
ss2
ns
+sl2
nl
=Ys Y
l
SE(Ys Y
l)
Hifference-in-means test9 comp#te the t-statistic$
! %here SE( is the standard error3 of $ the
s#&scripts sand lrefer to small3 and large3 T, districts$
and
Ys Yl Ys Yl
ss
2
=
1
ns 1 (Yi Ys )2
i=1
ns
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
13/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-13
Comp#te the difference-of-means t-statistic9
= >.0@ItI J 1.B 2D:
large
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
14/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-1"
3! Confidence interva
A B@L confidence interval for the difference &et%eenthe means is$
( M 1.B M 1.B
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
15/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-1%
*hat comes ne6t7
! The mechanics of estimation$ hypothesis testing$ andconfidence intervals sho#ld &e familiar
! These concepts etend directly to regression and its variants
! efore t#rning to regression$ ho%ever$ %e %ill revie% someof the #nderlying theory of estimation$ hypothesis testing$and confidence intervals9
Why do these proced#res %or/$ and %hy #se these ratherthan others*
We %ill revie% the intellect#al fo#ndations of statisticsand econometrics
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
16/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
&eview of (tatistica Theor#
1! The probabiit# framewor/ for statistica inference
2. "stimation
D. Testing
>. Confidence ntervals
1-1
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
17/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-1.
The probabiit# framewor/ for statisticainference
a Pop#lation$ random varia&le$ and distrition
& oments of a distrition (mean$ variance$ standarddeviation$ covariance$ correlation
c Conditional distritions and conditional means
d Histrition of a sample of data dra%n randomly from apop#lation9 Y1$ O$ Yn
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
18/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-10
)a+ 'opuation, random variabe, anddistribution
Population
! The gro#p or collection of all possi&le entities of interest(school districts
! We %ill thin/ of pop#lations as infinitely large ( is an
approimation to very &ig3
Random variable Y
! Q#merical s#mmary of a random o#tcome (district averagetest score$ district T,
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
19/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-1
Population distribution of Y
! The pro&a&ilities of different val#es of Ythat occ#r in thepop#lation$ for e. PrRY=
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
20/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-24
)b+ 8oments of a popuation distribution$ mean,variance, standard deviation, covariance,correation
mean= epected val#e (epectation of Y= E(Y = Y
= long-r#n average val#e of Yover repeated reali)ations
of Y
Variance= E(Y Y2=
= meas#re of the s'#ared spread of the distrition
tandard deviation= = Y
Y2
variance
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
21/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-21
!oments" ctd.
s#e$ness=
= meas#re of asymmetry of a distrition
! skewness= 09 distrition is symmetric
! skewnessJ (E 09 distrition has long right (left tail
#urtosis=
= meas#re of mass in tails= meas#re of pro&a&ility of large val#es
! kurtosis= D9 normal distrition
! skewnessJ D9 heavy tails (lepto#urtotic9+
E Y Y
( )
3
Y
3
E Y Y( )
4
Y
4
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
22/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-22
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
23/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-23
2 random variabes$ :oint distributionsand covariance
! ,andom varia&les"and #have a%oint distribution
! The covariance&et%een"and #is
cov("$# = ER(" "(# #S = "#
! The covariance is a meas#re of the linear association&et%een"and #6 its #nits are #nits of"N#nits of #
! cov("$# J 0 means a positive relation &et%een"and #
! f"and #are independently distrited$ then cov("$# = 0(t not vice versa
! The covariance of a r.v. %ith itself is its variance9
cov("$" = ER(" "(" "S = ER(" "2S = X2
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
24/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2"
The covariance &et%een Test S$oreand STRis negative9
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
25/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2%
The correlation coefficientis defined in terms ofthe covariance9
corr("$# = = r"#
! 1 corr("$# 1
! corr("$# = 1 mean perfect positive linear association
! corr("$# = 1 means perfect negative linear association
! corr("$# = 0 means no linear association
cov(X,Z)
var(X)var(Z)= XZ
X
Z
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
26/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2
Thecorrelation coefficient measures linearassociation
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
27/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2.
)c+ Conditiona distributions andconditiona means
&onditional distributions
! The distrition of Y$ given val#e(s of some other randomvaria&le$"
! "9 the distrition of test scores$ given that T, E 20
&onditional e'pectations and conditional moments
! $onditional ean= mean of conditional distrition
= E(YI"=% (important concept and notation
! $onditional varian$e= variance of conditional distrition
! E%aple9 E(Test s$ores I STRE 20 = the mean of test scoresamong districts %ith small class si)es
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
28/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-20
The differen$e in eans is the differen$e between the eans oftwo $onditional distributions!
= E(Test s$oresISTRE 20 E(Test s$oresISTRF 20
Uther eamples of conditional means9
! Wages of all female %or/ers (Y= %ages$"= gender
! ortality rate of those given an eperimental treatment(Y= live7die6"= treated7not treated
! f E("I# = const$ then corr("$# = 0
(not necessarily vice versa ho%ever for eample considerthe f#nction
The $onditional ean is a different ter for the failiar idea ofthe group ean
2XY=
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
29/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-2
)d+ istribution of a sampe of data drawnrandom# from a popuation$ Y1,7,Yn
(e $ill assume simple random sampling
! Choose an individ#al (district$ entity at random from thepop#lation
Randomness and data
! Prior to sample selection$ the val#e ofYis random &eca#sethe individ#al selected is random
! Unce the individ#al is selected and the val#e of Yiso&served$ then Yis K#st a n#m&er not random
! The data set is (Y1$ Y2$O$ Yn$ %here Yi= val#e of Yfor the ith
individ#al (district$ entity sampled
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
30/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-34
)istribution of Y1,7,Ynunder simple random sampling
! eca#se individ#als V1 and V2 are selected at random$ theval#e of Y1has no information content for Y2. Th#s9
Y1and Y2are independently distributed
Y1and Y2come from the same distrition$ that is$ Y1$ Y2
are identically distributed
That is$ #nder simple random sampling$ Y1and Y2are
independently and identically distrited (i.i.d..
ore generally$ #nder simple random sampling$ YiX$ i=
1$O$ n$ are i.i.d.
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
31/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
1. The pro&a&ility frame%or/ for statistical inference
2! Estimation
D. Testing
>. Confidence ntervals
1-31
&eview of (tatistica Theor#
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
32/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-32
This frame$or# allo$s rigorous statistical inferencesabout moments of population distributions using a sampleof data from that population7
Estimation
is the nat#ral estimator of the mean. #t9
a What are the properties of *
& Why sho#ld %e #se rather than some other estimator*! Y1(the first o&servation
! may&e #ne'#al %eights not simple average
! median(Y1$O$ Yn
The starting point is the sampling distrition of O
Y
Y
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
33/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-33
)a+ The samping distribution of
is a random varia&le$ and its properties are determined &ythe
sampling distributionof
The individ#als in the sample are dra%n at random.
Th#s the val#es of (Y1$ O$ Yn are random
Th#s f#nctions of (Y1$ O$ Yn$ s#ch as $ are random9 had a
different sample &een dra%n$ they %o#ld have ta/en on adifferent val#e
The distrition of over different possi&le samples of si)e niscalled thesampling distributionof .
The mean and variance of are the mean and variance of itssampling distrition$ E( and var( .
The concept of the samping distribution underpins a ofeconometrics.
Y
YY
YY
Y
Y
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
34/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3"
The sampling distribution of " ctd.
E'ample9 #ppose Yta/es on 0 or 1 (a *ernoullirandom varia&le
%ith the pro&a&ility distrition$ PrRY= 0S = .22$ Pr(Y=1 = .G:
Then
E(Y =pN1 Y (1 p N0 =p= .G:
= ERY E(YS2=p(1 p = .G:N (1.G: = 0.1G1:>
Pr( = = 2N.22N.G: = .D>D2
Pr( = 1 = .G:2= .
Y
Y
2
Y
Y
Y
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
35/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3%
The sampling distrition of %hen Yis erno#lli (p=.G:9
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
36/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3
Things we want to /now about thesamping distribution$
! What is the mean of *
f E( = tr#e = .G:$ then is an unbiasedestimator of
! What is the variance of *
+o% does var( depend on n?
! Hoes it &ecome close to %hen nis large*
4a% of large n#m&ers9 is a consistentestimator of
! appears &ell shaped for nlargeOis this generally tr#e*
n fact$ is approimately normally distrited for nlarge(Central 4imit Theorem
Y
Y
Y
Y
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
37/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3.
The mean and variance of the sampingdistribution of
! 8eneral case that is$ for Yii.i.d. from any distrition$ not
K#st erno#lli9
! mean9 E( = E( = = = Y
! ?ariance9 var( = ER E( S2
= ER YS2
= E
= E
Y
Y
1
nYi
i=1
n
1
nE(Y
i)
i=1
n
1
n
Yi=1
n
Y Y Y
1
n Yii=1
n
Y
2
1
n(Y
i
Y)
i=1
n
2
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
38/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-30
so var( = E
=
=
=
=
=
1
n(Y
i
Y)
i=1
n
2
Y
E1
n(Y
i
Y)
i=1
n
1
n(Y
j
Y)
j=1
n
1
n
2E (Y
i
Y)(Y
j
Y)
j=1
n
i=1
n
1
n2cov(Y
i,Y
j)
j=1
n
i=1
n
1
n2 Y
2
i=1
n
2
Y
n
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
39/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3
!ean and variance of samplingdistribution of " ctd.
E( = Y
var( =
&pli$ations9
1. is an unbiasedestimator of Y(that is$ E( = Y
2. var( is inversely proportional to n
1. the spread of the sampling distrition is
proportional to 17
2. Th#s the sampling #ncertainty associated %ithis proportional to 17 (larger samples$ less#ncertainty$ t s'#are-root la%
Y
Y
Y
Y
2
n
Y
n
n
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
40/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"4
The samping distribution of when nisarge
5or small sample si)es$ the distrition of is complicated$ tif nis large$ the sampling distrition is simpleZ
1. As nincreases$ the distrition of &ecomes moretightly centered aro#nd Y(the 'aw of 'arge ubers
2. oreover$ the distrition of Y&ecomes normal
(the entral 'iit Theore
Y
Y
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
41/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"1
The +a$ of +arge ,umbers$
An estimator is consistentif the pro&a&ility that its falls %ithin aninterval of the tr#e pop#lation val#e tends to one as the sample si)eincreases.
f (Y1$O$Yn are i.i.d. and E $ then is a consistent estimator of
Y$ that is$
PrRI YI E S 1 as n
%hich can &e %ritten$ Y
( Y3 means converges in pro&a&ility toY3.
(the ath9 as n$ var( = 0$ %hich implies thatPrRI YI E S 1.
Y2 Y
Y
p
Y
Y Y
p
Y
Y
2
nY
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
42/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"2
The &entral +imit Theorem)C;T+9
f (Y1$O$Yn are i.i.d. and 0 E E $ then %hen nis large
the distrition of is %ell approimated &y a normaldistrition.
[ (Y$
normally distrited %ith mean Yand variance 7n3
( Y7Yis \ approimately distrited (0$1
(standard normal
That is$ standardi)ed3 = = is
approimately distrited as (0$1
The larger the n$ the &etter is the approimation.
n Y
Y2
Y
Y
Y
2
n
Y2
YYE(Y)
var(Y)
Y Y
Y / n
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
43/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"3
ampling distrition of %hen Yis erno#lli$p= 0.G:9
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
44/66
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
45/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"%
(ummar#$ The (amping istribution of
5or Y1$O$Yni.i.d. %ith 0 E E $
! The eact (finite sample sampling distrition of has mean Y(
is an #n&iased estimator of Y3 and variance 7n
! Uther than its mean and variance$ the eact distrition of is
complicated and depends on the distrition of Y(the pop#lationdistrition
! When nis large$ the sampling distrition simplifies9
Y
Y2
Y
Y
2
Y
*Y (4a% of large n#m&ers
p
Y
YE(Y)
var(Y) is approimately (0$1 (C4T
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
46/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"
)b+ *h#
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
47/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-".
Estimator of the variance of Y$
= = sample variance of Y3
5act9
f (Y1$O$Yn are i.i.d. and E(Y> E $ then
Why does the la% of large n#m&ers apply*
! eca#se is a sample average6
! Technical note9 %e ass#me E(Y> E &eca#se herethe average is not of Yi$ t of its s'#are6
sY2
1n 1
(Yi Y)
2
i=1
n
sY2
p
Y2
sY
2
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
48/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
1.The pro&a&ility frame%or/ for statistical inference
2."stimation
3!Testing
>.Confidence intervals
1-"0
&eview of (tatistica Theor#
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
49/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"
5#pothesis Testing
The hypothesis testing pro&lem(for the mean9 ma/e aprovisional decision &ased on the evidence at hand %hether an#ll hypothesis is tr#e$ or instead that some alternativehypothesis is tr#e. That is$ test
+09 E(Y = Y$0vs. +19 E(Y J Y$0(1-sided$ J
+09 E(Y = Y$0vs. +19 E(Y E Y$0(1-sided$ E
+09 E(Y = Y$0vs. +19 E(Y ] Y$0(2-sided
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
50/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
T#pes of errors$
! Type "rror - n a hypothesis test$ a type error occ#rs %henthe n#ll hypothesis is reKected %hen it is in fact tr#e6 that is$ +0
is %rongly reKected.
! Type "rror - n a hypothesis test$ a type error occ#rs %hen
the n#ll hypothesis +0$ is not reKected %hen it is in fact false.
! The follo%ing ta&le gives a s#mmary of possi&le res#lts of anyhypothesis test9
1-%4
ecision
&e:ect 54 on=t re:ect 54
Truth54 Type "rror ,ight Hecision
51 ,ight Hecision Type "rror
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
51/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%1
ome terminology for testing statisticalhypotheses-
p-value= pro&a&ility of dra%ing a statistic (e.g. at leastas adverse to the n#ll as the val#e act#ally comp#ted %ith yo#rdata$ ass#ming that the n#ll hypothesis is tr#e.
Thesignificance level(or the alpha level of a test is a pre-
specified pro&a&ility of incorrectly reKecting the n#ll$ %hen then#ll is tr#e.
&alculating the pvalue&ased on 9
p-val#e =
Where is the val#e of act#ally o&served (nonrandom
PrH0 [Y Y,0 >Yact
Y,0 !
Y
Y
Yact Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
52/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%2
&alculating the pvalue" ctd.
! To comp#te thep-val#e$ yo# need the to /no% the sampling
distrition of $ %hich is complicated if nis small.
! f nis large$ yo# can #se the normal approimation (C4T9
p-val#e = $
=
=
pro&a&ility #nder leftYright (0$1 tails
%here = std. dev.of the distrition of = Y7 .
Y
PrH
0
[Y Y,0
>Yact Y,0
!
PrH
0
[Y
Y,0
Y
/ n>
Yact Y,0
Y
/ n!
PrH
0
[Y
Y,0
Y
>Yact
Y,0
Y
!
Y Y n
[=
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
53/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%3
&alculating the pvalue $ith Y#no$n-
! 5or large n$p-val#e = the pro&a&ility that a (0$1 randomvaria&le falls o#tside I( Y$07 I
! n practice$ is #n/no%n it m#st &e estimated
Yact
Y
Y
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
54/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%"
&omputing the pvalue $ith estimated9
p-val#e = $
=
(large n
so
pro&a&ility #nder normal tails o#tside Ita$tI
%here t= (the #s#al t-statistic
Y2
PrH0 [YY,0 >Y
act Y,0 !
PrH
0
[Y
Y,0
Y
/ n>
Yact Y,0
Y
/ n!
PrH
0
[ Y Y,0s
Y/ n
> Yact
Y,0s
Y/ n
!
Y Y,0
sY/ n
[=
Pr
H0
[ t> tact ! Y2p-val#e = ( estimated
[=
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
55/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%%
*hat is the in/ between thep-vaue andthe significance eve
! The significance level is prespecified.
! 5or eample$ if the prespecified significance level is @L$ yo#reKect the n#ll hypothesis if ItI F 1.B
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
56/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%
The (tudent t distribution
f Yi$ i= 1$O$ nis i.i.d. (Y$ $ then the t-statistic has the
t#dent t-distrition %ith n 1 degrees of freedom.
t=
The critical val#es of the t#dent t-distrition is talated inthe &ac/ of all statistics &oo/s.
1. Comp#te the t-statistic
2. Comp#te the degrees of freedom$ %hich is n 1
D. 4oo/ #p the @L critical val#e
>. f the t-statistic eceeds (in a&sol#te val#e this criticalval#e$ reKect the n#ll hypothesis.
Y2
Y Y,0
sY/ n
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
57/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%.
&omments on tudent t distribution
! f the sample si)e is moderate (several do)en or large(h#ndreds or more$ the difference &et%een the t-distrition and Q(0$1 critical val#es is negligi&le. +ere aresome @L critical val#es for 2-sided tests9
degrees of freedom
(n 1
@L t-distrition
critical val#e
10 2.2D
20 2.0BD0 2.0>
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
58/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%0
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
59/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%
The tudentt distribution / ummary
! The ass#mption that Yis distrited (Y$ is rarely pla#si&lein practice (ncome* Q#m&er of children*
! 5or nJ D0$ the t-distrition and (0$1 are very close (as ngro%s large$ the tn(1distrition converges to (0$1
! Thet-distrition is an artifact from days %hen sample si)es %eresmall and comp#ters3 %ere people
! 5or historical reasons$ statistical soft%are typically #ses the t-distrition to comp#tep-val#es t this is irrelevant %hen thesample si)e is moderate or large.
! 5or these reasons$ in this class %e %ill foc#s on the large-napproimation given &y the C4T
Y2
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
60/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
(ome other distributions
! Ta&les of +ypothesis Tests
1-4
http://mathnstats.com/index.php/hypothesis-testing/130-table-of-hypothesis-tests.htmlhttp://mathnstats.com/index.php/hypothesis-testing/130-table-of-hypothesis-tests.html -
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
61/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
&eguar e6ampes of h#pothesis testing
! Airport sec#rity systems are designed to detect %eapons and&om&s. When yo# %al/ thro#gh an airport metal detector$ thesystem is trying to discriminate &et%een the hypothesis ^thisperson is not carrying a %eapon^ and the hypothesis ^this person iscarrying a %eapon$^ on the &asis of electromagnetic fieldmeas#rements.
! A dr#g company %ants to determine %hether a ne% headacheremedy %or/s. The concl#sion %ill &e &ased on %hat happens %hena gro#p of s#&Kects #se the remedy or a place&o to treatheadaches.
1-1
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
62/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved.
1.The pro&a&ility frame%or/ for statistical inference
2."stimation
D.Testing
"!Confidence intervas
1-2
&eview of (tatistica Theor#
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
63/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-3
Confidence Intervas
! A B@L confidence intervalfor Yis an interval thatcontains the tr#e val#e of Yin B@L of repeated samples.
! ,igression9 What is random here* The val#es of Y1$...$Ynand
th#s any f#nctions of them incl#ding the confidence
interval. The confidence interval %ill differ from one sampleto the net. The pop#lation parameter$ Y$ is not random6
%e K#st dont /no% it.
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
64/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-"
&onfidence intervals" ctd.
A B@L confidence interval can al%ays &e constr#cted as theset of val#es of Ynot reKected &y a hypothesis test %ith a
@L significance level.
Y9 1.B
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
65/66
Copyright 2011 Pearson Addison-Wesley. All rights reserved. 1-%
(ummar#9
5rom the t%o ass#mptions of9
1. simple random sampling of a pop#lation$ that is$
Yi$ i=1$O$nX are i.i.d.
2. 0 E E(Y> E
%e developed$ for large samples (large n9 Theory of estimation (sampling distrition of
Theory of hypothesis testing (large-n distrition of t-statistic and comp#tation of thep-val#e
Theory of confidence intervals (constr#cted &y invertingthe test statistic
Are ass#mptions (1 _ (2 pla#si&le in practice* >es
Y
;et?s go bac/ to the origina poic#
-
8/10/2019 M01 StockWatson123635 03 Econ Part01 1
66/66
;et?s go bac/ to the origina poic#@uestion$
What is the effect on test scores of red#cing T, &y onest#dent7class*
+ave we answered this question?