psy 201: statistics in psychology - purduegfrancis/classes/psy392/lecture06.pdfrange highest score -...

29
PSY 201: Statistics in Psychology Lecture 06 Variability How to make IQ scores look good. Greg Francis Purdue University Fall 2019 Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 1 / 29

Upload: others

Post on 05-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

PSY 201: Statistics in PsychologyLecture 06Variability

How to make IQ scores look good.

Greg Francis

Purdue University

Fall 2019

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 1 / 29

Page 2: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

DESCRIPTION

central tendency gives an indication of where most, many, or average,scores are

also want some idea of how much variability exists in a distribution ofscores

I rangeI mean deviationI varianceI standard deviation

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 2 / 29

Page 3: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

RANGE

Highest score - lowest scoreName Sex Score

Greg Male 95

Ian Male 89

Aimee Female 94

Jim Male 92

95 - 89 = 6

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 3 / 29

Page 4: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

PROBLEM

range is very sensitive to “extreme” scoresName Sex Score

Greg Male 95

Ian Male 89

Aimee Female 94

Jim Male 92

Bob Male 32

95 - 32 = 63

one score makes a big difference!

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 4 / 29

Page 5: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

MEAN DEVIATION

we can decrease sensitivity to extreme scores by consideringdeviations from a measure of central tendency

a deviation score isxi = Xi − X

we define the mean deviation as:

MD =Σ|Xi − X |

n=

Σ|xi |n

where |xi | means: “absolute value of xi”

why do we take the absolute value instead of just summingdeviations?

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 5 / 29

Page 6: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

VARIANCE

mean deviation turns out to be mathematically messy

squaring also removes minus signs!

sum of squaresSS = Σ(Xi − X )2 = Σ(xi )

2

variance is the average sum of squares

calculation depends on whether scores are from a population or asample

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 6 / 29

Page 7: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

POPULATION

a population includes all members of a specified group

variance is defined as:

σ2 =SS

N=

Σ(Xi − µ)2

N=

Σ(xi )2

N

whereI µ is the mean of the populationI N is the number of scores in the population

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 7 / 29

Page 8: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

SAMPLE

a sample includes a subset of scores from a population

variance is defined as:

s2 =SS

n − 1=

Σ(Xi − X )2

n − 1=

Σ(xi )2

n − 1

where

I X is the mean of the sampleI n is the number of scores in the sample

why the differences? Don’t worry for now. Just know the calculations.

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 8 / 29

Page 9: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

SAMPLE VARIANCE

deviation formula:

s2 =Σ(xi )

2

n − 1

alternative (but equivalent) calculation is the raw score formula

s2 =SS

n − 1=

Σ(Xi )2 − [(ΣXi )

2 /n]

n − 1

use whichever formula is simpler!

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 9 / 29

Page 10: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

EXAMPLEName Sex Score

Greg Male 95Ian Male 89Aimee Female 94Jim Male 92

since we have the raw scores, we use the raw score formula (weassume a sample)

s2 =SS

n − 1=

Σ(Xi )2 − [(ΣXi )

2 /n]

n − 1

ΣX 2i = (95)2 + (89)2 + (94)2 + (92)2 = 34246

(ΣXi )2/n = (95 + 89 + 94 + 92)2/4 =

(370)2

4=

136900

4= 34225

so,

s2 =34246− 34225

3=

21

3= 7

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 10 / 29

Page 11: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

SUM OF SQUARES

earlier we calculated the squared deviation from the mean∑x2i =

∑(Xi − X )2

= (95− 92.5)2 + (89− 92.5)2 + (94− 92.5)2 + (92− 92.5)2

= (2.5)2 + (−3.5)2 + (1.5)2 + (−0.5)2 = 0

= 6.25 + 12.25 + 2.25 + 0.25 = 21.0

we can use that to calculate variance with the deviation score formula:

s2 =Σx2

i

n − 1=

21

3= 7

Same as before!

Note! variance cannot be negative

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 11 / 29

Page 12: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

STANDARD DEVIATION

variance is in squared units of measurement

I distance: squared metersI weight: squared kilogramsI temperature: squared degreesI ...

standard deviation is in the same units as the scores!

square root of variance

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 12 / 29

Page 13: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

STANDARD DEVIATION

deviation score formula:

s =√s2 =

√SS

(n − 1)=

√Σ(xi )2

(n − 1)

raw score formula:

s =√s2 =

√SS

(n − 1)=

√Σ(Xi )2 − [(ΣXi )

2 /n]

n − 1

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 13 / 29

Page 14: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

EXAMPLEName Sex Score

Greg Male 95Ian Male 89Aimee Female 94Jim Male 92

since we have the raw scores, we use the raw score formula tocalculate variance

s2 =SS

n − 1=

Σ(Xi )2 − [(ΣXi )

2 /n]

n − 1

we calculated earlier that the variance equals:

s2 =34246− 34225

3=

21

3= 7

and then the standard deviation equals:

s =√s2 =

√7 = 2.646

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 14 / 29

Page 15: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

WHY BOTHER?

the value of the standard deviation gives us an idea of how spread outscores are

larger standard deviations indicate that scores are more spread out

-5 -4 -3 -2 -1 0 1 2 3 4 5

Score

0

2

4

6

Fre

quen

cy

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 15 / 29

Page 16: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

WHY BOTHER?

we will use standard deviation to let us estimate how different a scoreis relative to the central tendency of the distribution

we can then compare (in a certain sense) across distributions!

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 16 / 29

Page 17: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

STANDARD SCORE

also called z-score

Standard score =raw score−mean

standard deviation

z =X − X

s

indicates the number of standard deviations a raw score is above orbelow the mean

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 17 / 29

Page 18: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

EXAMPLE

ifX = 26

ands = 4

and you have (among others) the scores X1 = 16, X2 = 32, X3 = 28

then

z1 =X1 − X

s=

16− 26

4= −2.5

z2 =X2 − X

s=

32− 26

4= 1.5

z3 =X3 − X

s=

28− 26

4= 0.5

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 18 / 29

Page 19: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

PROPERTIES

when a raw score is above the mean, its z-score is positive

when a raw score is below the mean, its z-score is negative

when a raw score equals the mean, its z-score is zero

absolute size of the z-score indicates how far from the mean a rawscore is

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 19 / 29

Page 20: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

UNITS

z-scores work in units of standard deviation

new numbers for same information!

just like converting units for other familiar measuresI length: feet into meters, miles into kilometersI weight: pounds into kilogramsI temperature: fahrenheit into celsiusI data: raw score units into standard deviation units

trick!: standard deviation units depend on your particular set of data!

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 20 / 29

Page 21: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

PROPERTIES

z-scores are data

we can find distributions, means, and standard deviations

special properties of z-score distributionsI The shape of the distribution of standard scores is identical to that of

the original distribution of raw scores.I The mean of a distribution of z-scores will always equal 0.I The variance (and standard deviation) of a distribution of z-scores

always equals 1.

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 21 / 29

Page 22: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

EXAMPLE

A simple data set to play withI when a raw score is above the

mean, its z-score is positiveI when a raw score is below the

mean, its z-score is negativeI when a raw score equals the

mean, its z-score is zeroI absolute size of the z-score

indicates how far from themean a raw score is

Subject Raw score z-score

1 10 1.262 9 0.943 3 -0.944 10 1.265 9 0.946 2 -1.267 2 -1.268 10 1.269 5 -0.3110 5 -0.3111 1 -1.5712 6 0.013 8 0.6314 6 0.015 6 0.016 1 -1.5717 3 -0.9418 6 0.019 10 1.2620 8 0.63

n = 20

X 6.0 0.0s 3.18 1.0

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 22 / 29

Page 23: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

EXAMPLE

compare distributions of raw scores and z-scores

shape is the same

2 4 6 8 10

Raw Score

0

1

2

3

4

Fre

quen

cy

-1.5 -1 -0.5 0 0.5 1

z - Score

0

1

2

3

4

Fre

quen

cy

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 23 / 29

Page 24: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

USES

suppose we want to compare the scores of a student in several classes

we know the student’s score, the mean score, the standard deviation,and the student’s z-score

Subject X X s z

Psychology 68 65 6 0.50

Mathematics 77 77 9 0.00

History 83 89 8 -0.75

comparison of raw scores suggests that student did best in history,mathematics, then psychology

comparison of z-scores suggests that student did best in psychology,mathematics, then history (relative to other students)

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 24 / 29

Page 25: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

TRANSFORMED SCORES

sometimes z-scores are unattractive

I zero meanI negative values

need to convert same information into a new distribution with a newmean and standard deviation

X ′ = (s ′)(z) + X′

whereI X ′ = new or transformed score for a particular individualI s ′ = desired standard deviation of the distributionI z = standard score for a particular individualI X

′= desired mean of the distribution

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 25 / 29

Page 26: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

TRANSFORMED SCORES

GOAL: make data understandable; IQ scores, personality tests,...

NOTE: you can change the mean and standard deviation all youwant, but it does not change the information in the data

shape remains the same!I conversion back to z-scores would produce the same z-scores!I a percentile maps to the corresponding transformed score

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 26 / 29

Page 27: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

TRANSFORMED SCORES

if we transform the scores from our earlier data set using

X ′ = 20X + 50

we get

20 30 40 50 60 70

Transformed Score

0

1

2

3

4

Fre

quen

cy

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 27 / 29

Page 28: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

CONCLUSIONS

variance

standard deviation

standard scores

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 28 / 29

Page 29: PSY 201: Statistics in Psychology - Purduegfrancis/Classes/PSY392/Lecture06.pdfRANGE Highest score - lowest score Name Sex Score Greg Male 95 Ian Male 89 Aime e Female 94 Jim Male

NEXT TIME

a very important distribution

normal distribution

Describing everyone’s height.

Greg Francis (Purdue University) PSY 201: Statistics in Psychology Fall 2019 29 / 29