probability and statistics - montefiore...probability and statistics chapter 5: sampling to...

Post on 21-Mar-2021

1 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Probability and Statistics

Kristel Van Steen, PhD2

Montefiore Institute - Systems and Modeling

GIGA - Bioinformatics

ULg

kristel.vansteen@ulg.ac.be

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 1

CHAPTER 5: SAMPLING TO APPROXIMATE THE TRUE WORLD

1 Introduction

2 Generating data

2.1 Design of experiments

2.2 Sampling designs and towards inference

2.3 Ethics

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 2

3 Sample versus population

3.1 Introduction

3.2 Distribution of a sample

3.3 Statistics and sample moments

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 3

4 Sample mean

4.1 Mean and variance

4.2 Law of large numbers revisited

4.3 Central-limit theory revisited

4.4 Bernoulli and Poisson distribution

4.5 Exponential distribution

4.6 Uniform distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 4

5 Sampling from the normal distribution

5.1 The role of normal distributions in statistics

5.2 Sample mean

5.3 The chi-square distribution

5.4 The F distribution

5.5 The Student’s t distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 5

6 Future highlights

6.1 Estimating parameters

6.2 Order statistics

6.3 Sample size calculations

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 6

1 Introduction

Probability theory versus statistics

Probability and Statistics

K Van Steen

Inductive versus deductive inference

Probability and Statistics Chapter 5: Sampling to approximate the true world

Inductive versus deductive inference

Chapter 5: Sampling to approximate the true world

7

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

8

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 9

2 Generating data

2.1 Design of experiments

Obtaining data

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 10

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 11

Population versus sample

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 12

Observational studies versus experiments

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 13

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 14

Some terminology

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 15

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 16

Comparative experiments

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 17

Placebo effects

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 18

Caution about experimentation

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 19

Other ways to remove bias

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 20

Lack of realism

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 21

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 22

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 23

Randomization

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 24

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 25

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 26

Principles of experimental design

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 27

Completely randomized designs

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 28

Block designs

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 29

Matched pairs designs

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 30

Why experimental designs ?

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 31

2.2 Sampling designs and towards inference

Sampling methods

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 32

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 33

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 34

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 35

Simple random samples

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 36

Stratified samples

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 37

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 38

Caution about sampling surveys

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 39

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 40

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 41

2.3 Ethics

Institutional Review Boards

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 42

Informed Consent

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 43

Confidentiality

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 44

Clinical trials

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 45

Behavioral and social science experiments

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 46

3 Sample versus population

3.1 Introduction

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 47

Towards statistical inference

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 48

Sampling variability

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 49

3.2 Distribution of a sample

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 50

3.3 Statistics and sample moments

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 51

Practical note

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 52

Capture-recapture sampling

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 53

4 Sample mean

4.1 Mean and variance

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 54

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 55

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 56

For normally distributed populations

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 57

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 58

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 59

On a practical note

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 60

4.2 Law of large numbers revisited

• Question: using only a finite number of values of X (a random sample

of size n, say), can any reliable inference be made about E[X], “the

average of an infinite number of values of X”?

• Answer: YES!

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 61

• A positive integer n can be determined such that if a random sample

of size n or larger is taken from a population with density f(.) (with

E[X])=µ), the probability can be made as close to 1 as desired that the

sample mean X bar will deviate from µ by less than an arbitrarily

specified small quantity:

(cfr. weak law of large numbers)

• Proof: Use Chebyshev inequality

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 62

4.3 Central-limit theory revisited

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 63

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 64

How large a sample size ?

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 65

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 66

4.4 Bernoulli and Poisson distribution

See before

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 67

Binomial distributions for sample counts

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 68

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 69

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 70

Binomial distribution in statistical sampling

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 71

Reminder: sampling variability

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 72

Binomial mean and standard deviation

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 73

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 74

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 75

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 76

Sample proportions

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 77

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 78

Normal approximation

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 79

Sampling distribution of the sampling proportion

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 80

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 81

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 82

Normal approximation: continuity correction

Probability and Statistics Chapter 5: Sampling to approximate the true world

K Van Steen 83

Probability and Statistics

K Van Steen

4.5 Exponential distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

4.5 Exponential distribution

Chapter 5: Sampling to approximate the true world

84

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

85

Probability and Statistics

K Van Steen

4.6 Uniform distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

on

Chapter 5: Sampling to approximate the true world

86

Probability and Statistics

K Van Steen

5 Sampling from the normal distribution

5.1 The role of normal distributions in statistics

Probability and Statistics Chapter 5: Sampling to approximate the true world

5 Sampling from the normal distribution

5.1 The role of normal distributions in statistics

Chapter 5: Sampling to approximate the true world

87

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

88

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

89

Probability and Statistics

K Van Steen

5.2 Sample mean

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

90

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

91

Probability and Statistics

K Van Steen

5.3 The chi-square distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

square distribution

Chapter 5: Sampling to approximate the true world

92

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

93

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

94

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

95

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

96

Probability and Statistics

K Van Steen

• Recall: that if two moment generating functions both exi

equal (they agree), then the corresponding cumulative distribution

functions are the same (agree)

Probability and Statistics Chapter 5: Sampling to approximate the true world

Recall: that if two moment generating functions both exi

equal (they agree), then the corresponding cumulative distribution

functions are the same (agree)

Chapter 5: Sampling to approximate the true world

97

Recall: that if two moment generating functions both exist and are

equal (they agree), then the corresponding cumulative distribution

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

98

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

99

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

100

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

101

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

102

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

103

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

104

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

105

Probability and Statistics

K Van Steen

5.4 The F distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

106

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

107

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

108

(**)

Probability and Statistics

K Van Steen

(**)

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

109

(**)

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

110

Probability and Statistics

K Van Steen

This theorem can be very useful in sampling

Probability and Statistics Chapter 5: Sampling to approximate the true world

This theorem can be very useful in sampling

Chapter 5: Sampling to approximate the true world

111

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

112

Probability and Statistics

K Van Steen

5.5 The Student’s t distribution

Probability and Statistics Chapter 5: Sampling to approximate the true world

dent’s t distribution

Chapter 5: Sampling to approximate the true world

113

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

114

(^^)

Probability and Statistics

K Van Steen

(^^)

Probability and Statistics Chapter 5: Sampling to approximate the true world

(^^)

Chapter 5: Sampling to approximate the true world

115

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

116

Probability and Statistics

K Van Steen

Probability and Statistics Chapter 5: Sampling to approximate the true world

Chapter 5: Sampling to approximate the true world

117

top related