normal distribution. properties of the normal distribution the center of the curve represents the...

34
Normal distribution

Post on 21-Dec-2015

238 views

Category:

Documents


0 download

TRANSCRIPT

Normal distribution

Properties of the normal distribution

• The center of the curve represents the mean (and median and mode).

• The curve is symmetrical around the mean.

• The tails meet the x-axis in infinity.

• The curve is bell-shaped.

• The area under the curve is 1 (by definition).

Central Limit Theorem(Zentraler Grenzwertsatz)

Der Mittelwert der Mittelwerte verschiedener Stichproben nähert sich dem wahren Mittelwert an. + Die Mittelwerte verschiedener Stichproben sind normalverteilt, selbst wenn das zu untersuchende Phänomen nicht normalverteilt ist.

Central Limit Theorem(Zentraler Grenzwertsatz)

X1 X2 X3 X4 M

Sample 1 6 2 5 6 4.75

Central Limit Theorem(Zentraler Grenzwertsatz)

X1 X2 X3 X4 M

Sample 1 6 2 5 6 4.75

Sample 2 2 3 1 6 3

Central Limit Theorem(Zentraler Grenzwertsatz)

X1 X2 X3 X4 M

Sample 1 6 2 5 6 4.75

Sample 2 2 3 1 6 3

Sample 3 1 1 4 6 3

Central Limit Theorem(Zentraler Grenzwertsatz)

X1 X2 X3 X4 M

Sample 1 6 2 5 6 4.75

Sample 2 2 3 1 6 3

Sample 3 1 1 4 6 3

Sample 4 6 2 2 1 2.75

Central Limit Theorem(Zentraler Grenzwertsatz)

X1 X2 X3 X4 M

Sample 1 6 2 5 6 4.75

Sample 2 2 3 1 6 3

Sample 3 1 1 4 6 3

Sample 4 6 2 2 1 2.75

Sample 5 1 5 1 3 2.5

Mean of the sample mean

4.75 + 3.0 + 3.0 + 2.75 + 2.5 = 3.2

5

The sample means are normally distributed (even if the phenomenon in the parent population is not normally distributed).

2,50 3,00 3,50 4,00 4,50 5,00

case

0

2

4

6

8

10

12

Häu

fig

keit

Mean = 3,352Std. Dev. = 0,44802N = 25

population

population

sample

population

one sample

mean of this sample

population

one sample

mean of this sample

distribution of many sample means

Are your data normally distributed?

• The distribution in the parent population (normal, slightly skewed, heavily skewed).

• The number of observations in the individual sample.

• The total number of individual samples.

Graphical representation of probability

Graphical representation of probability

z scores

x1 – x

SD

Empirical rule

Empirical rule

1.96

Parametric vs. non-parametrical tests

• If you have ordinal data.

• If you have interval data that is not normally distributed.

• If you have interval data but cannot be certain if your data is normally distributed because you don’t have sufficient data (e.g. small number of samples, small individual samples).

You use non-parametrical tests:

Confidence intervals

Confidence intervals indicate a range within which the mean (or other parameters) of the true population lies given the values of your sample and assuming a certain probability.

The standard error is the equivalent of the standard deviation for the sample distribution (i.e. the distribution based on the sample means).

Confidence intervals

• The mean of the sample means.

• The SDs of the sample means, i.e. the standard error.

• The degree of confidence with which you want to state

the estimation.

Standard error

Samples Mean

12345

1.51.81.32.01.7

8.3 / 5= 1.66 (mean)

Standard error

Samples Mean Individual means – Mean of means

12345

1.51.81.32.01.7

1.5 – 1.661.8 – 1.664 – 1.669 – 1.6612 – 1.66

8.3 / 5= 1.66 (mean)

Standard error

Samples Mean Individual means – Mean of means

12345

1.51.81.32.01.7

1.5 – 1.661.8 – 1.664 – 1.669 – 1.6612 – 1.66

0.160.14– 0.36– 0.360.04

8.3 / 5= 1.66 (mean)

Standard error

Samples Mean Individual means – Mean of means

squared

12345

1.51.81.32.01.7

1.5 – 1.661.8 – 1.664 – 1.669 – 1.6612 – 1.66

0.160.14– 0.36– 0.360.04

0.02560.01960.12960.11560.0016

8.3 / 5= 1.66 (mean)

Standard error

Samples Mean Mean – Mean of means

squared

12345

1.51.81.32.01.7

1.5 – 1.661.8 – 1.664 – 1.669 – 1.6612 – 1.66

0.160.14– 0.36– 0.360.04

0.02560.01960.12960.11560.0016

8.3 / 5= 1.66 (mean)

0.292

Standard error

0.292

5 - 1= 0.2701

Confidence interval

[degree of certainty] [standard error] = x

[sample mean] +/–x = confidence interval

Confidence intervals

95% degree of certainty = 1.96 [z-score]

Confindence interval of the first sample (mean = 1.5):

1.96 0.2701 = 0.53

1.5 +/- 0.53 = 0.97–2.03

We can be 95% certain that the population mean is located in the range between 0.97 and 2.03.

Standard error (alternative formula)

SD

N

Exercise

SD: (2-7)2 + (5-7)2 + (6-7)2 + (7-7)2 + (10-7)2 + (12-7)2

6 -1

= 3.58

Standard error: 3.58 / 6 = 1.46

Confidence I.: 1.46 1.96 = 2.86

7 +/– 2.86 = 4.14 – 9.86

Mean: 7