Download - Statistics lecture 8 (chapter 7)
![Page 1: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/1.jpg)
1
![Page 2: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/2.jpg)
2
• A PARAMETER is a number that describes a population statistic
• A STATISTIC is a number that describes a characteristic in the sample data
![Page 3: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/3.jpg)
3
• Inferential statistics
– Draw conclusion from data
– Sample
• Describe data
– Use sample statistic to infer population parameter
• Estimation
• Hypothesis testing
![Page 4: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/4.jpg)
4
Data collection
Graphs
Measures • location • spread
Descriptive statistics
Statistical inference
Estimation
Hypothesis testing
Decision making
Raw data
Information
![Page 5: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/5.jpg)
5
• Estimation
– Numerical values assigned to a population parameter using a sample statistic
• Sample mean used to estimate population
mean μ
• Sample variance s2 used to estimate population
variance σ2
• Sample stand dev s used to estimate population
stand dev σ
• Sample proportion used to estimate population
proportion p
x
p̂
![Page 6: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/6.jpg)
6
• Steps in estimation
– Select sample
– Get required information from the sample
– Calculate sample statistic
– Assign values to population parameter
![Page 7: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/7.jpg)
7
• Read example 7.1 page 214
![Page 8: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/8.jpg)
8
• Sample statistic used to estimate a
population parameter is called an
ESTIMATOR
• An estimator is a rule that tells us how to
calculate the estimate and it is generally
expressed as a formula
![Page 9: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/9.jpg)
9
POPULATION
PAPARMETER
ESTIMATE
(VALUE OF
STATISTIC)
ESTIMATOR
(Formula)
MEAN µ
VARIANCE σ2 s2
PROPORTION p
![Page 10: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/10.jpg)
10
• Two types of estimate:-
–Point estimates
–Interval estimates
![Page 11: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/11.jpg)
11
• A single number that is calculated from
sample data
• Resulting number then used to estimate
the true value of the corresponding
population parameter
![Page 12: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/12.jpg)
12
• A random sample of 10 employees
reveals the following dental expenses in
rands for the preceding year:
660; 2172; 1476; 510; 3060; 1248; 1038;
2550; 1896 and 1074
Determine a point estimate for:-
1. The population mean
2. The population variance
![Page 13: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/13.jpg)
13
• Answer p215
![Page 14: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/14.jpg)
14
• If we take another random sample of 10
employees the mean obtained for this random
sample will almost certainly differ from the one
you have just calculated
• Point estimates do not provide information
about how close the point estimate is to the
population parameter
• Point estimates do not consider the sample
size or variability of the population from which
the sample was taken
![Page 15: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/15.jpg)
15
• Sample size and variability of population will
affect the accuracy of the estimate so a point
estimate is really not very useful
• This problem can be overcome by using
INTERVAL ESTIMATES
![Page 16: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/16.jpg)
16
• No 1 – 6 page 216
![Page 17: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/17.jpg)
17
Point Estimates
– A single sample statistic used to estimate
the population parameter
Population parameter
Sample distribution
Point estimator
Population distribution
![Page 18: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/18.jpg)
18
Confidence interval
– An interval is calculated around the sample
statistic
Confidence interval
Population parameter
included in interval
![Page 19: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/19.jpg)
19
Confidence interval
– An upper and lower limit within in which the
population parameter is expected to lie
– Limits will vary from sample to sample
– Specify the probability that the interval will
include the parameter
– Typical used 90%, 95%, 99%
– Probability denoted by
• (1 – α) known as the level of confidence
• α is the significance level
Example:
Meaning of a 90% confidence interval:
90% of all possible samples taken from
population will produce an interval that will
include the population parameter
![Page 20: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/20.jpg)
• An interval estimate consists of a range of
values with an upper & lower limit
• The population parameter is expected to lie
within this interval with a certain level of
confidence
• Limits of an interval vary from sample to sample
therefore we must also specify the probability
that an interval will contain the parameter
• Ideally probability should be as high as possible
20
![Page 21: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/21.jpg)
SO REMEMBER
•We can choose the probability
•Probability is denoted by (1-α)
•Typical values are 0.9 (90%); 0.95 (95%) and 0.99 (99%)
•The probability is known as the LEVEL OF CONFIDENCE
•α is known as the SIGNIFICANCE LEVEL
•α corresponds to an area under a curve
•Since we take the confidence level into account when we
estimate an interval, the interval is called CONFIDENCE
INTERVAL
21
![Page 22: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/22.jpg)
22
Confidence interval for Population Mean, n ≥ 30
- population need not be normally distributed
- sample will be approximately normal
2
2
1 1
1 1
( ) , if is known
( ) , if is not known
CI x Zn
sCI x Z
n
![Page 23: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/23.jpg)
Example :
90% confidence interval
1 – 0,90
0,10
0,100,05
2 2
23
2
2
1 1
1 1
( ) , if is known
( ) , if is not known
CI x Zn
sCI x Z
n
Lower conf limit Upper conf limit x
1 - α
2
2
Confidence level
= 1 - α
1
1 – α
= 0,90 0,052
0,05
2
90% of all sample
means fall in this area
These 2 areas added
together = α i.e. 10%
![Page 24: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/24.jpg)
24
![Page 25: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/25.jpg)
See handout
25
![Page 26: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/26.jpg)
A random sample of repair costs for 150
hotel rooms gave a mean repair cost of
R84.30 and a standard deviation of R37.20.
Construct a 95% confidence interval for the
mean repair cost for a population of 2000
hotel rooms
26
![Page 27: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/27.jpg)
Example 7.3 p218
27
![Page 28: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/28.jpg)
28
• Four commonly used confidence levels
1 - α α z
0,9 0,1 0,05 1,64
0,95 0,05 0,025 1,96
0,98 0,02 0,01 2,33
0,99 0,01 0,005 2,57
2
![Page 29: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/29.jpg)
29
12
12
12
91,64 1,48
1009
1,96 1,76100
9
92,57 2,3
0%
95%
9910
% 10
sx z x x
ns
x z x xns
x z x xn
• Confidence interval for Population Mean, n ≥ 30
• Example:
– Estimate the population mean with 90%, 95% and 99% confidence, if it is known that
– s = 9 and n = 100
– Solution: The confidence intervals are
![Page 30: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/30.jpg)
30
1, 48 Width of interval = 2 x 1,48 = 2,961,76 Width of interval = 2 x 1,76 = 3,522,31 Width of interval
90%95%
= 2 x 99% 2,31 = 4,62
xxx
90%
95%
99%
Confidence level influence width of interval
Margin of error becomes smaller if:
• z-value smaller
• σ smaller
• n larger
![Page 31: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/31.jpg)
31
• Example
– A survey was conducted amongst 85 children to determine the number of hours they spend in front of the TV every week.
– The results indicate that the mean for the sample was 24,5 hours with a standard deviation of 2,98 hours.
– Estimate with 95% confidence the population mean hours that children spend watching TV.
12
2,9824,5 1,96
85
24,5 0,634
23,866 ; 25,134
sx z
n
95% confident the mean hours
children spend watching TV is
between 23,866 and 25,134
hours per week
![Page 32: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/32.jpg)
32
• Confidence interval for Population
Mean, n < 30 – For a small sample from a normal population and σ is
known, the normal distribution can be used.
– If σ is unknown we use s to estimate σ
– We need to replace the normal distribution with the
t-distribution ▬ standard normal
▬ t-distribution 2
1 1;1( )
n
sCI x t
n
![Page 33: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/33.jpg)
t Distribution
• Refer to handout on how to read the
critical value t n-1; 1- 𝛼
2
33
![Page 34: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/34.jpg)
34
• Example – The manager of a small departmental store is concerned
about the decline of his weekly sales.
– He calculated the average and standard deviation of his sales for the past 12 weeks,
– Estimate with 99% confidence the population mean sales of the departmental store.
1;12
134612400 3,106
12
12400 1206,86
11193,14 ; 13606,86
n
sx t
n
= R12400 and s = R1346x
99% confident the mean weekly
sales will be between
R11 193,14 and R13 606,86
t11;0.995
![Page 35: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/35.jpg)
EXAMPLE 2
• A study of absenteeism among workers at
a local mine during the previous year was
carried out. A random sample of 25 miners
revealed a mean absenteeism of 9.7days
with a variance of 16 days. Construct a
confidence interval for the average
number of days of absence for miners for
last year. Assume the population is
normally distributed. 35
![Page 36: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/36.jpg)
EXAMPLE 2 - ANSWER
• Example 7.6, page 222 textbook
36
![Page 37: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/37.jpg)
CLASSWORK
• Do concept questions 7 – 19, page 223
textbook
37
![Page 38: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/38.jpg)
38
• Confidence interval for Population
proportion – Each element in the population can be classified as a
success or failure
– Proportion always between 0 and 1
– For large samples the sample proportion is
approximately normal
21 1
ˆ ˆ(1 )ˆ( )
p pCI p p z
n
number of successesˆSample proportion = =
sample size
xp
n
p̂
![Page 39: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/39.jpg)
39
• Example – A sales manager needs to determine the proportion of
defective radio returns that is made on a monthly basis.
– In December 65 new radios were sold and in January 13 were returned for rework.
– Estimate with 95% confidence the population proportion of returns for December.
12
13ˆ 0,2
65ˆ ˆ(1 ) 0,2(1 0,2)
ˆ 0,2 1,9665
0,2 0,097
0,103 ; 0,297
p
p pp z
n
95% confident the mean monthly
returns will be will be between
10,3% and 29,7%
![Page 40: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/40.jpg)
EXAMPLE 2
• A cellphone retailer is experiencing
problems with a high % of returns. The
quality control manager wants to estimate
the % of all sales that result in returns. A
sample of 40 sales showed that 8
cellphones were returned. Construct a
99% confidence interval for the % of all
sales that result in returns
40
![Page 41: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/41.jpg)
EXAMPLE 2
• Answer – example 7.9 page 225, textbook
41
![Page 42: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/42.jpg)
42
• Confidence interval for Population
Variance
– Population variance very often important
– Very often required for quality control
– Sample drawn from a normal population
– Sample variance is based on a random
sample of size n
– Distribution of s2 resulted from repeated
sampling is a χ2 (chi-square) distribution
![Page 43: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/43.jpg)
43
• Confidence interval for Population Variance
– χ2 (chi-square) distribution
• Skewed to the right distribution
• Shape varies in relation to the degrees of freedom
• Critical values from the χ2-table A4(read same way
as t distribution)
• Critical value of χ21 - α specifies an area to the left
• Critical value of χ2α specifies an area to the right
![Page 44: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/44.jpg)
44
• Confidence interval for Population Variance
2 2
2 22
1 2 2
1;1 1;
( 1) ( 1)( ) ;
n n
n s n sCI
![Page 45: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/45.jpg)
45
• Example – For a binding machine to work on its optimum capacity
the variation in the temperature of the room is vital.
– The temperature for 30 consecutive hours were measured and sample standard deviation were found to be 0,68 degrees.
– What will be a 90% confidence interval for σ2?
90% confident the variation in
temperature will be will be between
0,315 and 0,757 degrees
2 2
2 2 2 22
1 2 2 2 2
29;0,95 29;0,051;1 1;
2 2
( 1) ( 1) 29(0,68 ) 29(0,68 )( ) ; ;
29(0,68 ) 29(0,68 );
42,56 17,71
0,315;0,757
n n
n s n sCI
n= 30; s = 0.68; α = 0.1
![Page 46: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/46.jpg)
The total revenue for a sample of 10
hardware stores in a well-known chain was
recorded for a particular week. The results
(in R1000) were as follows: 129.78;130.11;
129.83;130.02;129.67;129.87;129.88;129.86
130.18 and 129.91. Construct a 90%
confidence interval for the standard
deviation of the total weekly revenue for all
hardware stores in this chain
46
![Page 47: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/47.jpg)
47
Answer example 2
![Page 48: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/48.jpg)
Answer example 2 contd
48
CI(2)0,9 =
n 1 s2
n1;1
2
2;
n 1 s2
n1;
2
2
=
9 0,0234 16,92
;9 0,0234
3,32
= [0,0124;0,0634] CI()0,9 = [
0,0124 ;
0,0634 ]
= [0,1114;0,2518]
![Page 49: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/49.jpg)
CONCEPT QUESTIONS
• Nos 20 – 28, page 228 textbook
49
![Page 50: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/50.jpg)
Where are we?
• So far we have looked at interval estimation
procedures for µ, p and σ2 for a SINGLE
POPULATION
• We are now going to look at interval estimation
procedures for:-
– The difference between two population means
– The difference between two population proportions
– The ratio of two population variances
50
![Page 51: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/51.jpg)
51
• Interval estimation for two populations – There is different procedures for the differences in
means, proportions and variances.
Population
1
Sample
1
Population
2
Sample
2
Mean μ1 μ
2
Variance σ21 s2
1 σ2
2 s2
2
Std dev σ1 s
1 σ
2 s
2
Size N1 n
1 N
2 n
2
Proportion P1 P
2
1x 2x
1p̂ 2p̂
![Page 52: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/52.jpg)
52
• Confidence interval difference in means
– Large independent samples
2
2
2 2
1 21 2 1 1 2 1
1 2
2 2
1 2
2 2
1 21 2 1 1 2 1
1 2
2 2
1 2
( )
if and is known
( )
if and not is known
CI x x Zn n
s sCI x x Z
n n
NOTE: If 0 is not included in the interval it means that
0 does not occur between the lower and upper
boundaries of the interval
![Page 53: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/53.jpg)
Example 1
Independent random samples of male and female
employees selected from a large industrial plant
yielded the following hourly wage results:-
Construct a 99% confidence interval for the
difference between the hourly wages for all males
and females and interpret the results
53
MALE FEMALE
n1 = 45 n2 = 32
𝑥 = 6.00 𝑥 = 5.75
s1 = 0.95 s1 = 0.75
![Page 54: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/54.jpg)
Example 1- Answer
54
1 0,99
0,01
1
21
0,01
2
0,995
Z0,995= 2,57
CI(1 – 2)0,99 =
x 1 x 2 Z1
2
s12
n1
s2
2
n2
=
6 5,75 2,570,95
2
45
0,75 2
32
= [–0,2486;0,7486]
![Page 55: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/55.jpg)
Example 1- answer
Interpretation:-
At a 99% level of confidence, the difference
between the hourly wages of males and females is
between -0.2486 and 0.7486 rand. The value 0 is
included in the interval which tells us that there is a
possibility that there is no difference between the
two population means. To make sure whether
there is a difference or not, a hypothesis test (next
chapter!!!!) has to be performed.
55
![Page 56: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/56.jpg)
56
• Confidence interval difference in means
– Small independent samples
– When sample sizes are small, n1 & n2< 30 we use the t distribution
NOTE: If both the limits of the confidence interval are
negative you should suspect that the mean of first
population is smaller than mean of second population
![Page 57: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/57.jpg)
Example A plant that operates two shifts per week would like to
consider the difference in productivity for the two shifts. The
number of units that each shift produces on each of the 5
working days is recorded in the following table:-
Assuming that the number of units produced by each shift
is normally distributed and that the population standard
deviations for the two shifts are equal construct a 99%
confidence interval for the difference in mean productivity
for the two shifts and comment on the result.
57
Monday Tuesday Wednesday Thursday Friday
Shift 1 263 288 290 275 255
Shift 2 265 278 277 268 244
![Page 58: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/58.jpg)
Example 1 - answer
58
x 1 x1
n1
1 371
5
274,2
x 2 x2
n2
1 332
5
266,4
s12
x12
1
n1
x1 2
n1 1
233,7
s22
x22
1
n2
x2 2
n2 1
188,3
sp =
n1 1 s12 n2 1 s2
2
n1 n2 2
=
51 (233,7) 51 (188,3)
5 5 2
= 14,5258
1 0,99
0,01
1
21
0,01
2
0,995
t8; 0,995 = 3,355
CI(1 – 2)0,99 =
x 1 x 2 tn1 n22;1
2
sp
1
n1
1
n2
= [(274,2 – 266,4) 3,355(14,5258)
1
5
1
5]
= [–23,0221;38,6221] At the 99% confidence level, because zero is included in the interval, it is possible that there is no significant difference between the two shifts with respect to productivity.
![Page 59: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/59.jpg)
CONCEPT QUESTIONS
• Nos 29 -39, p 235 – 237, textbook
59
![Page 60: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/60.jpg)
60
• Confidence interval difference in proportions
– Large independent samples
2
1 1 2 2
1 2 1 1 2 11 2
1 21 2
1 2
ˆ ˆ ˆ ˆ1 1ˆ ˆ( )
ˆ ˆwith and
p p p pCI p p p p z
n n
x xp p
n n
![Page 61: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/61.jpg)
Example 1
Two groups of males are polled concerning
their interest in a new electric razor that has
four cutting edges. A sample of 64 males
under the age of 40 indicated that only 12
were interested while in a sample of 36
males over the age of 40, only 8 indicated
an interest. Construct a 95% confidence
interval for the difference between age froup
populations
61
![Page 62: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/62.jpg)
Example 1 - answer
62
Under 40: n1 = 64 and
ˆ p 1 =
12
64 = 0,1875.
Over 40: n2 = 36 and
ˆ p 2 =
8
36 = 0,2222.
1 0,95
0,05
1
21
0,05
2
0,975
Z0,975 = 1,96
CI(
p1 p2 )0,9 =
ˆ p 1 ˆ p 2 Z1
2
ˆ p 1 1 ˆ p 1 n1
+ˆ p 2 1 ˆ p 2
n2
=
0,1875 0,2222 1,960,1875 0,8125
64
0,2222 0,7778 36
= [–0,2008;0,1314]
![Page 63: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/63.jpg)
63
• Confidence interval for the ratio of two population variances
• We use the f distribution, table A5. See handout
2 1 2
1 2 2
2 2 2
1 1 1
2 2 2 1; 1;2 2 21 1; 1;
1;
n n
n n
s sCI F
s F s
NOTE: If 1 does not lie in the confidence interval, there
is some evidence that the population variances are not
equal
![Page 64: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/64.jpg)
EXAMPLE 1
A criminologist is interested in comparing the
consistency of the lengths of sentences given to
people convicted of robbery by two judges. A
random sample of 17 people convicted of robbery
by judge 1 showed a standard deviation of 2.53
years, while a random sample of 21 people
convicted by judge 2 showed a standard deviation
of 1.34 years. Construct a 95% confidence interval
for the ratio of the two populations variances. Does
the data suggest that the variances of the lengths
of sentences by the two judges differ? Motivate
your answer. 64
![Page 65: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/65.jpg)
Example 1 - answer
65
Judge 1: n1 = 17 and s1 = 2,53. Judge 2: n2 = 21 and s2 = 1,34.
1 0,95
0,05
2
0,05
2
0,025
Fn11;n21;
2
=
F16; 20; 0,025
= 2,55
Fn21;n11;
2
=
F20; 16; 0,025
= 2,68
CI(
12
22)0,95 =
s12
s22
1
Fn11;n21;
2
;s1
2
s22
Fn21; n11;
2
=
2,53 2
1,34 2
1
2,55
;
2,53 2
1,34 2
2,68
= [1,3979;9,5536] Yes, at the 95% level of confidence it is possible that the variances differ because 1 is not included in the interval.
![Page 66: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/66.jpg)
CONCEPT QUESTIONS
• Concept questions 40 – 47, p 241,
textbook
66
![Page 67: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/67.jpg)
DETERMINING SAMPLE
SIZES FOR ESTIMATES • Everything we have done so far has assumed
that a sample has ALREADY been taken
• We often need to know how large a sample
should we take to construct the confidence
interval
• Many factors can affect sample size such as
budget, time and ease of selection
• We will now look at how to determine the proper
sample size (from a statistical perspective)
67
![Page 68: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/68.jpg)
68
• Sample size for estimating means
– Confidence level (1 – α)
– Accepted sampling error - e
– Need to know σ, else use s
2
1zne
NOTE: Sample size, n, is required to be a whole
number. Therefore always round UP to the next
largest integer
![Page 69: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/69.jpg)
EXAMPLE 1
A pharmaceutical company is considering a
request to pay for the continuing education
of its research scientists. It would like to
estimate the average amount spent by these
scientists for professional memberships.
Base on a pilot study the standard deviation
is estimated to be R35. If a 95% confidence
of being correct to within +/- R20 is desired,
what sample size is necessary?
69
![Page 70: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/70.jpg)
Example 1 - answer
70
= 35 e = 20
1 0,95
0,05
1
21
0,05
2
0,975
Z0,975 = 1,96
n =
Z1
2
e
2
=
1,96 35 20
2
= 11,7649 12 At least 12 scientists should be selected.
![Page 71: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/71.jpg)
71
• Sample size for estimating proportions – Confidence level (1 – α)
– Accepted sampling error - e
– Need to know p, else use
2
1 (1 )z
n p pe
p̂
![Page 72: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/72.jpg)
Example 1
An audit test to establish the % of
occurrence of failures to follow a specific
internal control procedure is to be
undertaken. The auditor decides that the
maximum tolerable error rate that is
permissible is 5%. What sample size is
necessary to achieve a sample precision of
+/- 0.02 with 99% confidence?
72
![Page 73: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/73.jpg)
Example 1 - answer
73
p = 0,05 e = 0,02
1 0,99
0,01
1
21
0,01
2
0,995
Z0,995 = 2,57
n =
Z1
2
e
2
p 1 p
=
2,57
0,02
2
0,05 0,95
= 784,3319 785 A sample size of at least 785 is required.
![Page 74: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/74.jpg)
Classwork
• Questions 48 – 52, pages 244 – 245 ,
textbook
• Self review test, p245, text book
• Izimvo Exchange 1 and 2
• Activity 1,2,3
• Revision Exercise 1,2,3 and 4
74
![Page 75: Statistics lecture 8 (chapter 7)](https://reader033.vdocument.in/reader033/viewer/2022042510/5487b5c8b47959ec0c8b5519/html5/thumbnails/75.jpg)
HOMEWORK
• Supplementary questions, p249 – 253,
textbook
75