![Page 1: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/1.jpg)
Wednesday, October 6
Correlation and Linear Regression
![Page 2: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/2.jpg)
RDG
MATH
SCI
CIV
CONCPT
20
40
60
80
20 40 60 80
40
60
80
40 60 80
20
40
60
80
20 40 60 80
20
40
60
80
20 40 60 80
-2
0
2
-2 0 2
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
.69
![Page 3: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/3.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• Using either the reading or math score, you can predict the other value by how much sd it is from its mean. Or r=.069 means both values are 0.69 sd from their means.
![Page 4: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/4.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• There is a strong(ish) positive linear relationship between scores on a reading test and scores on a math test in this population. Thus, we have a fair amount of predictive power when it comes to guessing a math score for a particular individual if we have that individual’s reading score. Specifically, one standard deviation unit is equal to 0.69 standard deviation units.”
![Page 5: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/5.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• .69 is a valuable number to help guess someone’s reading score given their math score. .69 suggests that the math score can help us guess more closely what the actual reading score is. With a 1.00 r value, we could guess with certainty that the reading score=the math score. With a 0 r value, we would guess with uncertainty that the reading score was the mean of reading scores. In this case it would not be helpful to know the math score. However, .69 suggests that the reading score helps us determine the math score.
![Page 6: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/6.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• When reading and math scores are correlated at r=.69, this means that the relationship between the subjects is quite strong. If you imagine all of these math scores and reading scores plotted against each other, a PERFECT correlation would be a straight line. Conversely, zero correlation would have data points all over with no trend at all. Thus, if r=.69, this is quite close to 1, meaning that the relationship between the math and reading scores has a high correlation.
![Page 7: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/7.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• A correlation of .69 between reading and math scores means that, on average, as a student’s reading score increases, so do his or her math scores and vice versa. The strength of the relationship is moderately strong, so the value of a math score is a moderately strong predictor for the reading score.
![Page 8: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/8.jpg)
Explain to a non-statistician what it means to say “reading and math scores are correlated r=.69 in this population”.
• The scores of reading and math are correlated, which means that you can take one score and predict more or less what the other may be. The correlation between the two is .69, which means that if you have figured out the average score, and know how far a student is away from the average on his reading test, you can calculate how far from the average he will be on his math. …
![Page 9: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/9.jpg)
You will not leave the room until…
• you have understood that a correlation is a systematic quantitative expression of the proportion of explained and unexplained co-variation of two variables.
![Page 10: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/10.jpg)
You will not leave the room until…
• you have understood that a correlation is a systematic quantitative expression of the proportion of explained and unexplained co-variation of two variables … and you love knowing this fact!
![Page 11: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/11.jpg)
zy = zx
When X and Y are perfectly correlated
![Page 12: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/12.jpg)
We can say that zx perfectly predicts zy
zy’ = zx
Or
zy = zx
^
![Page 13: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/13.jpg)
When they are imperfectly correlated, i.e., rxy ≠ 1 or -1
zy’ = rxyzx
r is the slope of the predicted line, with a zero-intercept of z’y=0
![Page 14: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/14.jpg)
-2-1
01
2
-2 -1 0 1 2z_meas
z_est Fitted values
Oh no, hands again…
![Page 15: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/15.jpg)
24
68
10
5 6 7 8 9Measured
Estimated Fitted values
What about in raw original units?
![Page 16: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/16.jpg)
When we want to express the prediction in terms of raw units:
zy’ = rxyzx
Y’ = bYXX + aYX
![Page 17: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/17.jpg)
24
68
10
5 6 7 8 9Measured
Estimated Fitted values
Let’s try it by eyeball…
![Page 18: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/18.jpg)
When we want to express the prediction in terms of raw units:
zy’ = rxyzx
Y’ = bYXX + aYX
bYX = rYX (σy / σx)
aYX = Y - bYXX
_ _
![Page 19: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/19.jpg)
Explained and unexplained variance
SStotal = SSexplained + SSunexplained
SStotal = SSexplained + SSunexplained
N N N
![Page 20: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/20.jpg)
-2-1
01
2
-2 -1 0 1 2z_meas
z_est Fitted values
Oh no, hands again…
![Page 21: Wednesday, October 6 Correlation and Linear Regression](https://reader035.vdocument.in/reader035/viewer/2022062712/56649c915503460f9494b90f/html5/thumbnails/21.jpg)
Explained and unexplained variance
r2XY = 1 -
σ2Y’ [ =unexplained]
σ2Y [ =total]
=
σ2Y - σ2
Y’
σ2Y
r2 is the proportion explained variance to the total variance.