inputting data for a single sample t

330
Single Sample t-Tests

Upload: byu-center-for-teaching-learning

Post on 20-Jun-2015

53 views

Category:

Education


4 download

DESCRIPTION

Inputting data for a single sample t

TRANSCRIPT

Page 1: Inputting data for a single sample t

Single Sample t-Tests

Page 2: Inputting data for a single sample t

Welcome to a presentation explaining the concepts behind the use of a single sample t-test

Page 3: Inputting data for a single sample t

Welcome to a presentation explaining the concepts behind the use of a single sample t-test in determining the probability that a sample and a population are similar to or different from one another statistically.

Page 4: Inputting data for a single sample t

We will follow an example where researchers attempt to determine if the sample they have collected is statistically significantly similar or different from a population.

Page 5: Inputting data for a single sample t

Their hope is that the sample and population are statistically similar to one another, so they can claim that results of experiments done to the sample are generalizable to the population.

Page 6: Inputting data for a single sample t

Let’s imagine that this is the population distribution for IQ scores in the country:

Page 7: Inputting data for a single sample t

Let’s imagine that this is the population distribution for IQ scores in the country:

Page 8: Inputting data for a single sample t

It has a population mean of 100

Page 9: Inputting data for a single sample t

m = 100

Page 10: Inputting data for a single sample t

m = 100

This Greek symbol represents the mean

of a population

Page 11: Inputting data for a single sample t

We decide to select a random sample to do experiments on.

m = 100

Page 12: Inputting data for a single sample t

So, we randomly select 20 persons

m = 100

Page 13: Inputting data for a single sample t

Let’s say that sample of 20 has an IQ score

mean of 70

m = 100

Page 14: Inputting data for a single sample t

m = 100 = 70

Let’s say that sample of 20 has an IQ score

mean of 70

Page 15: Inputting data for a single sample t

m = 100 = 70

Note, this x with a bar over it is the symbol for a sample mean.

Page 16: Inputting data for a single sample t

m = 100 = 70

Again this Greek symbol m is the

symbol for a population mean.

Page 17: Inputting data for a single sample t

m = 100 = 70

Along with a mean of 70 this sample has a

distribution that looks like this

Page 18: Inputting data for a single sample t

m = 100 = 70

Along with a mean of 70 this sample has a

distribution that looks like this

Page 19: Inputting data for a single sample t

m = 100 = 70

So, here’s the question:

Page 20: Inputting data for a single sample t

m = 100 = 70

Is this randomly selected sample of 20 IQ scores representative of the population?

Page 21: Inputting data for a single sample t

m = 100 = 70

The Single Sample t-test is a tool used to determine the probability that it is or is not.

Page 22: Inputting data for a single sample t

So, how do we determine if the sample is a good representative of the population?

Page 23: Inputting data for a single sample t

Let’s look at the population distribution of IQ scores first:

Page 24: Inputting data for a single sample t

Let’s look at the population distribution of IQ scores first:

Page 25: Inputting data for a single sample t

One thing we notice right off is that it

has a normal distribution

Page 26: Inputting data for a single sample t

Normal Distributions have some important

properties or attributes that

make it possible to consider rare or

common occurrences

Page 27: Inputting data for a single sample t

Also, Normal Distributions have

some constant percentages that

are true across all normal

distributions

Page 28: Inputting data for a single sample t

Attribute #1: 50% of all of the scores are above the mean and the other 50% of the scores are below the mean

Page 29: Inputting data for a single sample t

Attribute #1: 50% of all of the scores are above the mean and the other 50% of the scores are below the mean

The Mean

Page 30: Inputting data for a single sample t

Attribute #1: 50% of all of the scores are above the mean and the other 50% of the scores are below the mean

The Mean 50% of all scores

Page 31: Inputting data for a single sample t

Attribute #1: 50% of all of the scores are above the mean and the other 50% of the scores are below the mean

The Mean50% of all scores

Page 32: Inputting data for a single sample t

Before going on let’s take a brief time out

Page 33: Inputting data for a single sample t

The next section requires an understanding of the concept of standard deviation.

Page 34: Inputting data for a single sample t

If you are unfamiliar with this concept do a search for standard deviation in this software. After viewing it return to slide 36 of this presentation.

Page 35: Inputting data for a single sample t

Time in – let’s get back to the instruction

Page 36: Inputting data for a single sample t

The Mean

Attribute #2: 68% of all of the scores are between +1 standard deviation and -1 standard deviation

Page 37: Inputting data for a single sample t

The Mean

Attribute #2: 68% of all of the scores are between +1 standard deviation and -1 standard deviation from the mean

Page 38: Inputting data for a single sample t

The Mean

Attribute #2: 68% of all of the scores are between +1 standard deviation and -1 standard deviation from the mean

+1 sd

Page 39: Inputting data for a single sample t

The Mean

Attribute #2: 68% of all of the scores are between +1 standard deviation and -1 standard deviation from the mean

+1 sd-1 sd

Page 40: Inputting data for a single sample t

The Mean

Attribute #2: 68% of all of the scores are between +1 standard deviation and -1 standard deviation from the mean

+1 sd-1 sd

68% of all scores

Page 41: Inputting data for a single sample t

The Mean

So what this means is – if you were randomly selecting samples from this population you have a 68% chance or .68 probability of pulling that sample from this part of the distribution.

+1 sd-1 sd

68% of all scores

Page 42: Inputting data for a single sample t

The Mean

+1 sd-1 sd

68% of all scores

Let’s put some numbers to this idea.

Page 43: Inputting data for a single sample t

The Mean

+1 sd-1 sd

68% of all scores

The mean of IQ scores across the population is 100

Page 44: Inputting data for a single sample t

m = 100

+1 sd-1 sd

With a population standard deviation (s) of 15: +1 standard deviation would at an IQ score of 115 and -1 standard deviation would be at 85

68% of all scores

Page 45: Inputting data for a single sample t

m = 100

+1 sd

-1 sd

-1s=85

68% of all scores

With a population standard deviation (s) of 15: +1 standard deviation would at an IQ score of 115 and -1 standard deviation would be at 85

Page 46: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

+1s=115-1s=85

With a population standard deviation (s) of 15: +1 standard deviation would at an IQ score of 115 and -1 standard deviation would be at 85

Page 47: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores So, there is a 68% chance or .68 probability that a sample was collected between IQ scores of 85 and 115

+1s=115-1s=85

Page 48: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

Attribute 2: 2 standard deviation units above and below the mean constitute 95% of all scores.

+1s=115-1s=85

Page 49: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

Attribute 2: 2 standard deviation units above and below the mean constitute 95% of all scores.

+1s=115-1s=85

+2 sd+2 sd

Page 50: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

2 standard deviation units above the mean would be an IQ score of 130 or 100 + 2*15(sd))

+1s=115-1s=85

+2 sd+2 sd

Page 51: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

2 standard deviation units above the mean would be an IQ score of 130 or 100 + 2*15(sd))

+1s=115-1s=85

+2 sd

+1s=130

-2 sd

Page 52: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

2 standard deviation units below the mean would be an IQ score of 70 or 100 - 2*15(sd))

+1s=115-1s=85

+2 sd

+1s=115

-2 sd

Page 53: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

2 standard deviation units below the mean would be an IQ score of 70 or 100 - 2*15(sd))

+1s=115-1s=85

+2 sd

+1s=115-2s=70

-2 sd

Page 54: Inputting data for a single sample t

m = 100

+1 sd-1 sd

68% of all scores

Now, it just so happens in nature that 95% of all scores are between +2 and -2 standard deviations in a normal distribution.

+1s=115-1s=85

+2 sd

+1s=115-2s=70

-2 sd

Page 55: Inputting data for a single sample t

m = 100

+1 sd-1 sd

95% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

Now, it just so happens in nature that 95% of all scores are between +2 and -2 standard deviations in a normal distribution.

Page 56: Inputting data for a single sample t

m = 100

+1 sd-1 sd

95% of all scores

This means that there is a .95 chance that a sample we select would come from between these two points.

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

Page 57: Inputting data for a single sample t

m = 100

+1 sd-1 sd

95% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

Attribute #3: 99% of all scores are between +3 and -3 standard deviations.

Page 58: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

-3s=55

-3 sd

+3s=55

+3 sd

Attribute #3: 99% of all scores are between +3 and -3 standard deviations.

Page 59: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

-3s=55

-3 sd

-3s=55

+3 sd

These standard deviations are only approximates.

Page 60: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

-3s=55 -3s=55

+3 sd-3 sd

Here are the actual values

Page 61: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1s=115-1s=85

+2 sd

+1s=130-2s=70

-2 sd

-2.58 s=55

-2.58 sd

-3s=55

+3 sd

Here are the actual values

Page 62: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1s=115-1s=85

+2 sd

+1s=130-1.96 s=70

-1.96 sd-2.58 sd

-3s=55

+3 sd

Here are the actual values

-2.58 s=55

Page 63: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+2 sd

+1s=130

-1.96 sd-2.58 sd

-3s=55

+3 sd

Here are the actual values

-1.96 s=70

-2.58 s=55

+1s=115-1 s=85

Page 64: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+2 sd

+1s=130

-1.96 sd-2.58 sd

-3s=55

+3 sd

Here are the actual values

-1.96 s=70

-2.58 s=55

+1 s=115

-1 s=85

Page 65: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1.96 s=130

-1.96 sd-2.58 sd

-3s=55

+3 sd

Here are the actual values

+1.96 sd

-1.96 s=70

-2.58 s=55

+1 s=115

-1 s=85

Page 66: Inputting data for a single sample t

m = 100

+1 sd-1 sd

99% of all scores

+1 s=115

-1 s=85

+1.96 s=130

-1.96 sd-2.58 sd

+2.58 s=145

+2.58 sd

Here are the actual values

+1.96 sd

-1.96 s=70

-2.58 s=55

Page 67: Inputting data for a single sample t

+1 sd-1 sd

99% of all scores

+1.96 sd-1.96 sd-2.58 sd +2.58 sd

Based on the percentages of a normal distribution, we can insert the percentage of scores below each standard deviation

point.

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 68: Inputting data for a single sample t

+1 sd-1 sd

99% of all scores

+1.96 sd-1.96 sd-2.58 sd +2.58 sd

Attribute #4: percentages can be calculated below or above each standard deviation point in the distribution.

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 69: Inputting data for a single sample t

99% of all scores

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 70: Inputting data for a single sample t

99% of all scores

95% of all scores

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 71: Inputting data for a single sample t

99% of all scores

95% of all scores

68% of all scores

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 72: Inputting data for a single sample t

99% of all scores

95% of all scores

68% of all scores

With this information we can determine the probability that scores will fall into a number portions of the distribution.

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 73: Inputting data for a single sample t

For example:

Page 74: Inputting data for a single sample t

0.5%

There is a 0.5% chance that if you randomly

selected a person that their IQ

score would be below a 55 or a -

2.58 SD

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 75: Inputting data for a single sample t

2.5%

There is a 2.5% chance that if you randomly

selected a person that their IQ

score would be below a 70 or a -

1.96 SD

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 76: Inputting data for a single sample t

16.5%

There is a 16.5% chance that if you randomly

selected a person that their IQ

score would be below a 85 or a -

1 SD

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 77: Inputting data for a single sample t

50%

There is a 50% chance that if you randomly

selected a person that their IQ

score would be below a 100 or a

0 SD

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 78: Inputting data for a single sample t

There is a 50% chance that if you randomly

selected a person that their IQ

score would be above a 100 or a

0 SD

50%

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 79: Inputting data for a single sample t

There is a 16.5% chance that if you randomly

selected a person that their IQ

score would be above a 115 or a

+1 SD

16.5%

+1 sd-1 sd +1.96 sd-1.96 sd-2.58 sd +2.58 sd

m = 100

+1 s=115

-1 s=85

+1.96 s=130

+2.58 s=145

-1.96 s=70

-2.58 s=55

Page 80: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

There is a 2.5% chance that if you randomly

selected a person that their IQ

score would be above a 130 or a

+2.58 SD

2.5%

Page 81: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

There is a 0.5% chance that if you randomly

selected a person that their IQ

score would be above a 145 or a

+2.58 SD

.5%

Page 82: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

What you have just seen illustrated is the concept of probability density or the probability that a score or observation would be selected above,

below or between two points on a distribution.

Page 83: Inputting data for a single sample t

Back to our example again.

Page 84: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Page 85: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Here is the sample we randomly selected

= 70

Page 86: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

The sample mean is 30 units away from the

population mean (100 – 70 = 30).

= 70

Page 87: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

The sample mean is 30 units away from the

population mean (100 – 70 = 30).

= 7030

Page 88: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Is that far away enough to be

called statistically significantly

different than the population?

= 70

Page 89: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

How far is too far away?

= 70

Page 90: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Fortunately, statisticians have come up with a

couple of distances that are considered too far away to be a part of the population.

= 70

Page 91: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

These distances are measured in

z-scores

= 70

Page 92: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

Which are what these are:

z scores

Page 93: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

Let’s say statisticians determined that if the sample mean you collected is below a -1.96 z-score or above a +1.96 z-score that that’s

just too far away from the mean to be a part of the population.

Page 94: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

We know from previous slides that only 2.5% of the scores are below a -1.96

Page 95: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

We know from previous slides that only 2.5% of the scores are below a -1.96

2.5%

Page 96: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

Since anything at this point or below is considered to be too rare to be a part of this population, we would conclude that the population and the sample are statistically significantly

different from one another.

2.5%

Page 97: Inputting data for a single sample t

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

And that’s our answer!

2.5%

Page 98: Inputting data for a single sample t

What if the sample mean had been 105?

Page 99: Inputting data for a single sample t

Since our decision rule is to determine that the sample mean is statistically significantly different than the population mean if the sample mean lies outside of the top or bottom 2.5% of all scores,

Page 100: Inputting data for a single sample t

Since our decision rule is to determine that the sample mean is statistically significantly different than the population mean if the sample mean lies outside of the top or bottom 2.5% of all scores,

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

2.5% 2.5%

Page 101: Inputting data for a single sample t

Since our decision rule is to determine that the sample mean is statistically significantly different than the population mean if the sample mean lies outside of the top or bottom 2.5% of all scores,

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 105

2.5% 2.5%

Page 102: Inputting data for a single sample t

. . . and the sample mean (105) does not lie in these outer regions,

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 105

2.5% 2.5%

Page 103: Inputting data for a single sample t

Therefore, we would say that this is not a rare event and the probability that the sample is significantly similar to the population is high.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 105

2.5% 2.5%

Page 104: Inputting data for a single sample t

By the way, how do we figure out the z-score for an IQ score of 105.

Page 105: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

Page 106: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

- mSD

Page 107: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

- mSDHere’s our

sample mean: 70

Page 108: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - mSDHere’s our

sample mean: 70

Page 109: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - mSD

Here’s our Population mean: 100

Page 110: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - 100SD

Here’s our Population mean: 100

Page 111: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - 100SD

Here’s our Standard

Deviation: 15

Page 112: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - 10015

Here’s our Standard

Deviation: 15

Page 113: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

70 - 10015

Page 114: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

3015

Page 115: Inputting data for a single sample t

We use the following formula to compute z-scores across the normal distribution:

2.0

Page 116: Inputting data for a single sample t

A z score of 2 is located right here

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

= 70

2.5% 2.5%

Page 117: Inputting data for a single sample t

In some instances we may not know the population standard deviation s (in this case 15).

Page 118: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

Page 119: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Page 120: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

50%

Page 121: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

16.5%

Page 122: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

2.5%

Page 123: Inputting data for a single sample t

Without the standard deviation of the population we cannot determine the z-scores or the probability that a sample mean is too far away to be apart of the population.

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Etc.

Page 124: Inputting data for a single sample t

Therefore these values below cannot be computed:

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Page 125: Inputting data for a single sample t

Therefore these values below cannot be computed:

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145+2.58 sd

Page 126: Inputting data for a single sample t

When we only know the population mean we use the Single Sample t-test.

Page 127: Inputting data for a single sample t

Actually whenever we are dealing with a population and a sample, we generally use a single-sample t-test.

Page 128: Inputting data for a single sample t

In the last example we relied on the population mean and standard deviation to determine if the sample mean was too far away from the population mean to be considered a part of the population.

Page 129: Inputting data for a single sample t

The single sample t-test relies on a concept called the estimated standard error

Page 130: Inputting data for a single sample t

The single sample t-test relies on a concept called the estimated standard error to compute something like a z-score to determine the probability distance between the population and the sample means.

Page 131: Inputting data for a single sample t

We call it estimated because as you will see it is not really feasible to compute.

Page 132: Inputting data for a single sample t

Standard error draws on two concepts:

Page 133: Inputting data for a single sample t

1. sampling distributions

Page 134: Inputting data for a single sample t

1. sampling distributions2. t-distributions

Page 135: Inputting data for a single sample t

Let’s begin with sampling distributions.

Page 136: Inputting data for a single sample t

Let’s begin with sampling distributions.

What you are about to see is purely theoretical, but it provides the justification for the formula we will use to run a single sample t-test.

Page 137: Inputting data for a single sample t

Let’s begin with sampling distributions.

What you are about to see is purely theoretical, but it provides the justification for the formula we will use to run a single sample t-test.

x̄� – μSEmean

Page 138: Inputting data for a single sample t

x̄� – μSEmean

the mean of a sample

Page 139: Inputting data for a single sample t

x̄� – μSEmean

the mean of a sample

the mean of a population

Page 140: Inputting data for a single sample t

x̄� – μSEmean

the mean of a sample

the mean of a population

the estimated standard error

Page 141: Inputting data for a single sample t

x̄� – μSEmean

In our example,

this is 70

Page 142: Inputting data for a single sample t

70 � – μSEmean

In our example, this is 70

Page 143: Inputting data for a single sample t

70 � – μSEmean

And this is 100

Page 144: Inputting data for a single sample t

70 � – 100SEmean

And this is 100

Page 145: Inputting data for a single sample t

70 � – 100SEmean

The numerator here is easy to compute

Page 146: Inputting data for a single sample t

-30SEmean

The numerator here is easy to compute

Page 147: Inputting data for a single sample t

-30SEmean

This value will help us know the distance between 70 and 100

in t-values

Page 148: Inputting data for a single sample t

-30SEmean

If the estimated standard error is

large, like 30, then the t value would be:

-30/30 = -1

Page 149: Inputting data for a single sample t

A -1.0 t-value is like a z-score as shown below:

Page 150: Inputting data for a single sample t

A -1.0 t-value is like a z-score as shown below:

m = 100

+1 sd-1 sd

+1 s=115

-1 s=85

+1.96 sd

+1.96 s=130

-1.96 s=70

-1.96 sd

-2.58 s=55

-2.58 sd

+2.58 s =145

+2.58 sd

Page 151: Inputting data for a single sample t

But since we don’t know the population standard deviation, we have to use the standard deviation of the sample (not the population as we did before) to determine the distance in standard error units (or t values).

Page 152: Inputting data for a single sample t

The focus of our theoretical justification is to explain our rationale for using information from the sample to compute the standard error or standard error of the mean.

Page 153: Inputting data for a single sample t

The focus of our theoretical justification is to explain our rationale for using information from the sample to compute the standard error or standard error of the mean.

– μx̄�SEmean

Page 154: Inputting data for a single sample t

Here comes the theory behind standard error:

Page 155: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population.

Page 156: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution.

Page 157: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution.

x̄� = 70

SD = 10

Page 158: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution,

x̄� = 70

Page 159: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution,

x̄� = 70 x̄� = 100

Page 160: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution, and another

x̄� = 70 x̄� = 100

Page 161: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution, and another

x̄� = 70 x̄� = 100 x̄� = 120

Page 162: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution, and another, and another

x̄� = 70 x̄� = 100 x̄� = 120 x̄� = 140

Page 163: Inputting data for a single sample t

Here comes the theory behind standard error:

Imagine we took a sample of 20 IQ scores from the population. This sample of 20 would have its own IQ mean, standard deviation and distribution. And then let’s say we took another sample of 20 with its mean and distribution, and another, and another, and so on…

x̄� = 70 x̄� = 100 x̄� = 120 x̄� = 140

Page 164: Inputting data for a single sample t

Let’s say, theoretically, that we do this one hundred times.

Page 165: Inputting data for a single sample t

Let’s say, theoretically, that we do this one hundred times.

We now have 100 samples of 20 person IQ scores:

Page 166: Inputting data for a single sample t

Let’s say, theoretically, that we do this one hundred times.

We now have 100 samples of 20 person IQ scores:

Page 167: Inputting data for a single sample t

Let’s say, theoretically, that we do this one hundred times.

We now have 100 samples of 20 person IQ scores: We take the mean of each of those samples:

x̄� = 110 x̄� = 102 x̄� = 120 x̄� = 90

x̄� = 114 x̄� = 100 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 100

Page 168: Inputting data for a single sample t

And we create a new distribution called the sampling distribution of the means

x̄� = 110 x̄� = 102 x̄� = 120 x̄� = 90

x̄� = 114 x̄� = 100 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 100

Page 169: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

Page 170: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 171: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 172: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 173: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 174: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 175: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 176: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 177: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 178: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 179: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 180: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 181: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 182: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 183: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 184: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

Page 185: Inputting data for a single sample t

And

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

70 75 70 85 90 95 100 105 110 115 120 125

etc. …

Page 186: Inputting data for a single sample t

And then we do something interesting. We take the standard deviation of this sampling distribution.

Page 187: Inputting data for a single sample t

And then we do something interesting. We take the standard deviation of this sampling distribution. If these sample means are close to one another then the standard deviation will be small.

Page 188: Inputting data for a single sample t

And then we do something interesting. We take the standard deviation of this sampling distribution. If these sample means are close to one another then the standard deviation will be small.

70 75 70 85 90 95 100 105 110 115 120 125

Page 189: Inputting data for a single sample t

And then we do something interesting. We take the standard deviation of this sampling distribution. If these sample means are far apart from one another then the standard deviation will be large.

70 75 70 85 90 95 100 105 110 115 120 125

Page 190: Inputting data for a single sample t

This standard deviation of the sampling distribution of the means has another name:

Page 191: Inputting data for a single sample t

This standard deviation of the sampling distribution of the means has another name: the standard error.

Page 192: Inputting data for a single sample t

This standard deviation of the sampling distribution of the means has another name: the standard error.

x̄� – μSEmean

the estimated standard error

Page 193: Inputting data for a single sample t

Standard error is a unit of measurement that makes it possible to determine if a raw score difference is really significant or not.

Page 194: Inputting data for a single sample t

Standard error is a unit of measurement that makes it possible to determine if a raw score difference is really significant or not.

Think of it this way. If you get a 92 on a 100 point test and the general population gets on average a 90, is there really a significant difference between you and the population at large? If you retook the test over and over again would you likely outperform or underperform their average of 90?

Page 195: Inputting data for a single sample t

Standard error is a unit of measurement that makes it possible to determine if a raw score difference is really significant or not.

Think of it this way. If you get a 92 on a 100 point test and the general population gets on average a 90, is there really a significant difference between you and the population at large? If you retook the test over and over again would you likely outperform or underperform their average of 90?

Standard error helps us understand the likelihood that those results would replicate the same way over and over again . . . or not.

Page 196: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

Page 197: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

x̄� – μSEmean

t =

Page 198: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

92 – 900.2

t =standard error

your score

population average

Page 199: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

20.2

t =

Page 200: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

20.2

t =This is the raw

score difference

Page 201: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

10.0t =

And this is the difference in

standard error units

Page 202: Inputting data for a single sample t

So let’s say we calculate the standard error to be 0.2. Using the formula below we will determine how many standard error units you are apart from each other.

10.0t =And this is the

difference in standard error units, otherwise known as a t statistic

or t value

Page 203: Inputting data for a single sample t

So, while your test score is two raw scores above the average for the population, you are 10.0 standard error units higher than the population score.

Page 204: Inputting data for a single sample t

So, while your test score is two raw scores above the average for the population, you are 10.0 standard error units higher than the population score.

While 2.0 raw scores do not seem like a lot, 10.0 standard error units constitute a big difference!

Page 205: Inputting data for a single sample t

So, while your test score is two raw scores above the average for the population, you are 10.0 standard error units higher than the population score.

While 2.0 raw scores do not seem like a lot, 10.0 standard error units constitute a big difference!

This means that this result is most likely to replicate and did not happen by chance.

Page 206: Inputting data for a single sample t

But what if the standard error were much bigger, say, 4.0?

Page 207: Inputting data for a single sample t

But what if the standard error were much bigger, say, 4.0?

x̄� – μSEmean

t =

Page 208: Inputting data for a single sample t

But what if the standard error were much bigger, say, 4.0?

92 – 904.0

t =standard error

your score

population average

Page 209: Inputting data for a single sample t

But what if the standard error were much bigger, say, 4.0?

24.0

t =

Page 210: Inputting data for a single sample t

But what if the standard error were much bigger, say, 4.0?

0.5t =

Page 211: Inputting data for a single sample t

Once again, your raw score difference is still 2.0 but you are only 0.5 standard error units apart. That distance is most likely too small to be statistically significantly different if replicated over a hundred times.

Page 212: Inputting data for a single sample t

Once again, your raw score difference is still 2.0 but you are only 0.5 standard error units apart. That distance is most likely too small to be statistically significantly different if replicated over a hundred times.

We will show you how to determine when the number of standard error units is significantly different or not.

Page 213: Inputting data for a single sample t

One more example:

Page 214: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 80 and you still received a 92. But the standard error is 36.0.

Page 215: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 80 and you still received a 92. But the standard error is 36.0. Let’s do the math again.

Page 216: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 80 and you still received a 92. But the standard error is 36.0. Let’s do the math again.

x̄� – μSEmean

t =

Page 217: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 80 and you still received a 92. But the standard error is 36.0. Let’s do the math again.

92 – 8036.0

t =standard error

your score

population average

Page 218: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 70 and you still received a 92. But the standard error is 36.0. Let’s do the math again.

1236.0

t =

Page 219: Inputting data for a single sample t

One more example:

Let’s say the population average on the test is 70 and you still received a 92. But the standard error is 36.0. Let’s do the math again.

0.3t =

Page 220: Inputting data for a single sample t

In this case, while your raw score is 12 points higher (a large amount), you are only 0.3 standard error units higher.

Page 221: Inputting data for a single sample t

In this case, while your raw score is 12 points higher (a large amount), you are only 0.3 standard error units higher. The standard error is so large (36.0) that if you were to take the test 1000 times with no growth in between it is most likely that your scores would vary greatly (92 on one day, 77 on another day, 87 on another day and so on and so forth.)

Page 222: Inputting data for a single sample t

In summary, the single sample test t value is the number of standard error units that separate the sample mean from the population mean:

Page 223: Inputting data for a single sample t

In summary, the single sample test t value is the number of standard error units that separate the sample mean from the population mean:

Let’s see this play out with our original example.

x̄� – μSEmean

t =

Page 224: Inputting data for a single sample t

Let’s say our of 20 has an average IQ score of 70.

Page 225: Inputting data for a single sample t

Let’s say our of 20 has an average IQ score of 70.

x̄� – μSEmean

t =

Page 226: Inputting data for a single sample t

Let’s say our of 20 has an average IQ score of 70.

70 – μSEmean

t =

Page 227: Inputting data for a single sample t

We already know that the population mean is 100.

70 – μSEmean

t =

Page 228: Inputting data for a single sample t

We already know that the population mean is 100.

70 – 100SEmean

t =

Page 229: Inputting data for a single sample t

Let’s say the Standard Error of the Sampling Distribution means is 5

70 – 100SEmean

t =

Page 230: Inputting data for a single sample t

Let’s say the Standard Error of the Sampling Distribution means is 5

70 – 1005

t =

Page 231: Inputting data for a single sample t

The t value would be:

70 – 1005

t =

Page 232: Inputting data for a single sample t

The t value would be:

-305

t =

Page 233: Inputting data for a single sample t

The t value would be:

-6t =

Page 234: Inputting data for a single sample t

So all of this begs the question: How do I know if a t value of -6 is rare or common?

Page 235: Inputting data for a single sample t

So all of this begs the question: How do I know if a t value of -6 is rare or common?

• If it is rare we can accept the null hypothesis and say that there is a difference between the sample and the population.

Page 236: Inputting data for a single sample t

So all of this begs the question: How do I know if a t value of -6 is rare or common?

• If it is rare we can accept the null hypothesis and say that there is a difference between the sample and the population.

• If it is common we can reject the null hypothesis and say there is not a significant difference between veggie eating IQ scores and the general population IQ scores (which in this unique case is what we want)

Page 237: Inputting data for a single sample t

So all of this begs the question: How do I know if a t value of 5 is rare or common?

• If it is rare we can accept the null hypothesis and say that there is a difference between the sample and the population.

• If it is common we can reject the null hypothesis and say there is not a significant difference between veggie eating IQ scores and the general population IQ scores (which in this unique case is what we want)

Here is what we do: We compare this value (6) with the critical t value.

Page 238: Inputting data for a single sample t

What is the critical t value?

Page 239: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

Page 240: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

Page 241: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

rare occurrence

Page 242: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

rare occurrence

rare occurrence

Page 243: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

rare occurrence

common occurrence

rare occurrence

Page 244: Inputting data for a single sample t

What is the critical t value?

The critical t value is a point in the distribution that arbitrarily separates the common from the rare occurrences.

This represents the rare/common possibilities used to determine if the sample mean is similar the population mean).

rare occurrence

common occurrence

rare occurrence

Page 245: Inputting data for a single sample t

If this were a normal distribution the red line would have a z critical value of + or – 1.96

rare occurrence

common occurrence

rare occurrence

Page 246: Inputting data for a single sample t

If this were a normal distribution the red line would have a z critical value of + or – 1.96

rare occurrence

common occurrence

rare occurrence

Page 247: Inputting data for a single sample t

If this were a normal distribution the red line would have a z critical value of + or – 1.96

rare occurrence

common occurrence

rare occurrence

Page 248: Inputting data for a single sample t

If this were a normal distribution the red line would have a z critical value of + or – 1.96 (which is essentially a t critical value but for a normal distribution.)

rare occurrence

common occurrence

rare occurrence

Page 249: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

Page 250: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

common occurrence

rare occurrence

rare occurrence

Page 251: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 252: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 253: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 254: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 255: Inputting data for a single sample t

A t or z value of + or – 1.96 means that 95% of the scores are in the center of the distribution and 5% are to the left and right of it.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 256: Inputting data for a single sample t

So if the t value computed from this equation:

Page 257: Inputting data for a single sample t

So if the t value computed from this equation:

x̄� – μSEmean

t =

Page 258: Inputting data for a single sample t

So if the t value computed from this equation:

… is between -1.96 and +1.96 we would say that that result is not rare and we would fail to reject the null hypothesis.

x̄� – μSEmean

t =

Page 259: Inputting data for a single sample t

So if the t value computed from this equation:

… is between -1.96 and +1.96 we would say that that result is not rare and we would fail to reject the null hypothesis.

– μx̄�SEmean

t =

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 260: Inputting data for a single sample t

However, if the t value is smaller than -1.96 or larger than +1.96 we would say that that result is rare and we would reject the null hypothesis.

Page 261: Inputting data for a single sample t

However, if the t value is smaller than -1.96 or larger than +1.96 we would say that that result is rare and we would reject the null hypothesis.

rare occurrence

common occurrence

rare occurrence

+1.96-1.96

Page 262: Inputting data for a single sample t

So if the t value computed from this equation:

Page 263: Inputting data for a single sample t

So if the t value computed from this equation:

x̄� – μSEmean

t =

Page 264: Inputting data for a single sample t

So if the t value computed from this equation:

… is between -1.96 and +1.96 we would say that that result is not rare and we would fail to reject the null hypothesis.

x̄� – μSEmean

t =

Page 265: Inputting data for a single sample t

As a review, let’s say the distribution is normal. When the distribution is normal and we want to locate the z critical for a two tailed test at a level of significance of .05 (which means that if we took 100 samples we are willing to be wrong 5 times and still reject the null hypothesis), the z critical would be -+1.96.

Page 266: Inputting data for a single sample t

As a review, let’s say the distribution is normal. When the distribution is normal and we want to locate the z critical for a two tailed test at a level of significance of .05 (which means that if we took 100 samples we are willing to be wrong 5 times and still reject the null hypothesis), the z critical would be -+1.96.

+ 1.96- 1.96

Page 267: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

Page 268: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

Page 269: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

x̄� = 110 x̄� = 100 x̄� = 100 x̄� = 90

x̄� = 115 x̄� = 70 x̄� = 120 x̄� = 90 x̄� = 95

x̄� = 100 x̄� = 120 x̄� = 90

x̄� = 115 x̄� = 105

x̄� = 110

Page 270: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

x̄� = 110

Page 271: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

So let’s imagine we selected a sampleof 20 veggie eaters with an average IQ score of 70.

x̄� = 70

Page 272: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

So let’s imagine we selected a sampleof 20 veggie eaters with an average IQ score of 70.

Because we did not take hundreds of samples of 20 veggie eaters each, average each sample’s IQ scores, and form a sampling distribution from which we could compute the standard error and then the t value, we have to figure out another way to compute an estimate of the standard error.

x̄� = 70

Page 273: Inputting data for a single sample t

Because we generally do not have the resources to take 100 IQ samples of those who eat veggies, we have to estimate the t value from one sample.

So let’s imagine we selected a sampleof 20 veggie eaters with an average IQ score of 70.

Because we did not take hundreds of samples of 20 veggie eaters each, average each sample’s IQ scores, and form a sampling distribution from which we could compute the standard error and then the t value, we have to figure out another way to compute an estimate of the standard error. There is another way.

x̄� = 70

Page 274: Inputting data for a single sample t

Since it is not practical to collect a hundreds of samples of 20 from the population, compute their mean score and calculate the standard error, we must estimate it using the following equation:

Sn

SEmean =

Page 275: Inputting data for a single sample t

Since it is not practical to collect a hundreds of samples of 20 from the population, compute their mean score and calculate the standard error, we must estimate it using the following equation:

Sn

SEmean =

Standard Deviation of the sample

Page 276: Inputting data for a single sample t

We estimate the standard error using the following equation:

Sn

SEmean =

Standard Deviation of the sample

Page 277: Inputting data for a single sample t

We estimate the standard error using the following equation:

We won’t go into the derivation of this formula, but just know that this acts as a good substitute in the place of taking hundreds of samples and computing the standard deviation to get the actual standard error.

SEmean =

Standard Deviation of the sample

Square root of the sample size

Sn

Page 278: Inputting data for a single sample t

So remember this point: The equation below is an estimate of the standard error, not the actual standard error, because the actual standard error is not feasible to compute.

Page 279: Inputting data for a single sample t

So remember this point: The equation below is an estimate of the standard error, not the actual standard error, because the actual standard error is not feasible to compute.

Sn

SEmean =

Page 280: Inputting data for a single sample t

So remember this point: The equation below is an estimate of the standard error, not the actual standard error, because the actual standard error is not feasible to compute.

However, just know that when researchers have taken hundreds of samples and computed their means and then taken the standard deviation of all of those means they come out pretty close to one another.

Sn

SEmean =

Page 281: Inputting data for a single sample t

Now comes another critical point dealing with t-distributions.

Page 282: Inputting data for a single sample t

Now comes another critical point dealing with t-distributions. Because we are working with a sample that is generally small, we do not use the same normal distribution to determine the critical z or t (-+1.96).

Page 283: Inputting data for a single sample t

Now comes another critical point dealing with t-distributions. Because we are working with a sample that is generally small, we do not use the same normal distribution to determine the critical z or t (-+1.96). Here is what the z distribution looks like for samples generally larger than 30:

Page 284: Inputting data for a single sample t

Now comes another critical point dealing with t-distributions. Because we are working with a sample that is generally small, we do not use the same normal distribution to determine the critical z or t (-+1.96).

Here is what the z distribution looks like for samples generally larger than 30:

95% of the scores

+ 1.96- 1.96

Sample Size 30+

Page 285: Inputting data for a single sample t

As the sample size decreases the critical values increase - making it harder to get significance.

Page 286: Inputting data for a single sample t

As the sample size decreases the critical values increase - making it harder to get significance.

Notice how the t distribution gets shorter and wider when the sample size is smaller. Notice also how the t critical values increase as well.

95% of the scores

+ 2.09- 2.09

Sample Size 20

Page 287: Inputting data for a single sample t

As the sample size decreases the critical values increase - making it harder to get significance.

Notice how the t distribution gets shorter and wider when the sample size is smaller. Notice also how the t critical values increase as well.

95% of the scores

+ 2.26- 2.26

Sample Size 10

Page 288: Inputting data for a single sample t

As the sample size decreases the critical values increase - making it harder to get significance.

Notice how the t distribution gets shorter and wider when the sample size is smaller. Notice also how the t critical values increase as well.

95% of the scores

+ 2.78- 2.78

Sample Size 5

Page 289: Inputting data for a single sample t

To determine the critical t we need two values:

Page 290: Inputting data for a single sample t

To determine the critical t we need two values:• the degrees of freedom (sample size minus one [20-1 = 19])

Page 291: Inputting data for a single sample t

To determine the critical t we need two values:• the degrees of freedom (sample size minus one [20-1 = 19])• the significance level (.05 or .025 for two tailed test)

Page 292: Inputting data for a single sample t

To determine the critical t we need two values:• the degrees of freedom (sample size minus one [20-1 = 19])• the significance level (.05 or .025 for two tailed test)

Using these two values we can

locate the critical t

Page 293: Inputting data for a single sample t

So, our t critical value that separates the common from the rare occurrences in this case is + or – 2.09

Page 294: Inputting data for a single sample t

So, our t critical value that separates the common from the rare occurrences in this case is + or – 2.09

95% of the scores

+ 2.09- 2.09

Sample Size 20

Page 295: Inputting data for a single sample t

What was our calculated t value again?

Page 296: Inputting data for a single sample t

What was our calculated t value again?

x̄� – μSEmean

t =

Page 297: Inputting data for a single sample t

What was our calculated t value again?

70 – μSEmean

t =

Page 298: Inputting data for a single sample t

We already know that the population mean is 100.

70 – μSEmean

t =

Page 299: Inputting data for a single sample t

We already know that the population mean is 100.

70 – 100SEmean

t =

Page 300: Inputting data for a single sample t

We already know that the population mean is 100.

To calculate the estimated standard error of the mean distribution we use the following equation:

70 – 100SEmean

t =

Page 301: Inputting data for a single sample t

We already know that the population mean is 100.

To calculate the estimated standard error of the mean distribution we use the following equation:

Sn

SEmean =

70 – 100SEmean

t =

Page 302: Inputting data for a single sample t

We already know that the population mean is 100.

The Standard Deviation for this sample is 26.82 and the sample size, of course, is 20

Sn

SEmean =

70 – 100SEmean

t =

Page 303: Inputting data for a single sample t

We already know that the population mean is 100.

Let’s plug in those values:

Sn

SEmean =

70 – 100SEmean

t =

Page 304: Inputting data for a single sample t

We already know that the population mean is 100.

Let’s plug in those values:

26.8220

SEmean =

70 – 100SEmean

t =

Page 305: Inputting data for a single sample t

26.824.47

We already know that the population mean is 100.

Let’s plug in those values:

SEmean =

70 – 100SEmean

t =

Page 306: Inputting data for a single sample t

6.0

We already know that the population mean is 100.

Let’s plug in those values:

SEmean =

70 – 100SEmean

t =

Page 307: Inputting data for a single sample t

6.0

We already know that the population mean is 100.

Let’s plug in those values:

SEmean =

70 – 1006.0

t =

Page 308: Inputting data for a single sample t

We already know that the population mean is 100.

70 – 1006.0

t =

Page 309: Inputting data for a single sample t

Now do the math:

70 – 1006.0

t =

Page 310: Inputting data for a single sample t

The t value would be:

70 – 1006.0

t =

Page 311: Inputting data for a single sample t

The t value would be:

-306.0

t =

Page 312: Inputting data for a single sample t

The t value would be:

-5t =

Page 313: Inputting data for a single sample t

With a t value of -5 we are ready to compare it to the critical t:

Page 314: Inputting data for a single sample t

With a t value of -5 we are ready to compare it to the critical t:

Page 315: Inputting data for a single sample t

With a t value of -5 we are ready to compare it to the critical t: 2.093

Page 316: Inputting data for a single sample t

With a t value of -5 we are ready to compare it to the critical t: 2.093 or 2.093 below the mean or -2.093.

Page 317: Inputting data for a single sample t

Since a t value of +5.0 is below the cutoff point (critical t) of -2.09, we would reject the null hypothesis

Page 318: Inputting data for a single sample t

Since a t value of +5.0 is below the cutoff point (critical t) of -2.09, we would reject the null hypothesis

95% of the scores

+ 2.09- 2.09

Page 319: Inputting data for a single sample t

Here is how we would state our results:

Page 320: Inputting data for a single sample t

Here is how we would state our results:

The randomly selected sample of twenty IQ scores with a sample mean of 70 is statistically significantly different then the population of IQ scores.

Page 321: Inputting data for a single sample t

Now, what if the standard error had been much larger, say, 15.0 instead of 6.0?

Page 322: Inputting data for a single sample t

Now, what if the standard error had been much larger, say, 15.0 instead of 6.0?

70 – 1006.0

t =

Page 323: Inputting data for a single sample t

Now, what if the standard error had been much larger, say, 15.0 instead of 6.0?

70 – 1006.0

t =70 – 100

15.0t =

Page 324: Inputting data for a single sample t

Now, what if the standard error had been much larger, say, 15.0 instead of 6.0?

70 – 1006.0

t =30

15.0t =

Page 325: Inputting data for a single sample t

Now, what if the standard error had been much larger, say, 10.0 instead of 2.0?

70 – 1006.0

t = 2.0t =

Page 326: Inputting data for a single sample t

A t value of -2.0 is not below the cutoff point (critical t) of -2.09 and would be considered a common rather than a rare outcome. We therefore would fail to reject the null hypothesis

Page 327: Inputting data for a single sample t

A t value of -2.0 is not below the cutoff point (critical t) of -2.09 and would be considered a common rather than a rare outcome. We therefore would fail to reject the null hypothesis

95% of the scores

+ 2.09- 2.09 - 2.0

Page 328: Inputting data for a single sample t

So in summary, the single sample t-test helps us determine the probability that the difference between a sample and a population did or did not occur by chance.

Page 329: Inputting data for a single sample t

So in summary, the single sample t-test helps us determine the probability that the difference between a sample and a population did or did not occur by chance.

It utilizes the concepts of standard error, common and rare occurrences, and t-distributions to justify its use.

Page 330: Inputting data for a single sample t

End of Presentation