chapter 10
DESCRIPTION
Chapter 10. Experimental Design and Analysis of Variance. Experimental Design and Analysis of Variance. 10.1 Basic Concepts of Experimental Design 10.2 One-Way Analysis of Variance 10.3 The Randomized Block Design 10.4 Two-Way Analysis of Variance. Experimental Design #1. L01. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/1.jpg)
Adapted by Peter Au, George Brown College
Adapted by Peter Au, George Brown College
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited.
Chapter 10Chapter 10
Experimental Design and Analysis of
Variance
Experimental Design and Analysis of
Variance
![Page 2: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/2.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design and Analysisof Variance
10.1Basic Concepts of Experimental Design
10.2 One-Way Analysis of Variance10.3 The Randomized Block Design10.4 Two-Way Analysis of Variance
10-2
![Page 3: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/3.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #1• Up until now, we have considered only two ways of
collecting and comparing data:• Using independent random samples• Using paired (or matched) samples
• Often data is collected as the result of an experiment• To systematically study how one or more factors (the independent
variable or IV) influence the variable that is being studied (the response or DV)
10-3
L01
![Page 4: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/4.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #2• In an experiment, there is strict control over
the factors contributing to the experiment• The values or levels of the factors (IV) are called treatments
• For example, in testing a medical drug, the experimenters decide which participants in the test get the drug and which ones get the placebo, instead of leaving the choice to the subjects
• The term treatment comes from an early application of this type of analysis where an analysis of different fertilizer “treatments” produced different crop yields
• If we cannot control the factor(s) being studied, we say that the data obtained are observational
• If we can control the factors being studied, we say that the data are experimental
10-4
L01L02
![Page 5: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/5.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #3• The different treatments are assigned to
objects (the test subjects) called experimental units• When a treatment is applied to more than one experimental unit,
the treatment is being “replicated”
• A designed experiment is an experiment where the analyst controls which treatments used and how they are applied to the experimental units
10-5
L02
![Page 6: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/6.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #4• In a completely randomized experimental design,
independent random samples are assigned to each of the treatments• For example, suppose three experimental units are to be
assigned to five treatments• For completely randomized experimental design,
randomly pick three experimental units for one treatment, randomly pick three different experimental units from those remaining for the next treatment, and so on• This is an example of sampling without replacement
10-6
L02
![Page 7: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/7.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #5• Once the experimental units are assigned and the
experiment is performed, a value of the response variable is observed for each experimental unit• Obtain a sample of values for the response variable for each
treatment (group)
10-7
L02
![Page 8: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/8.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Experimental Design #6• In a completely randomized experimental design, it
is presumed that each sample is a random sample from the population of all possible values of the response variable• That could possibly be observed when using
the specific treatment• The samples are independent of each other
10-8
L02
![Page 9: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/9.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.1 Training Method Experiment Case
• Compare three training methods to package a camera kit and its effect on the hourly packaging efficiency by new employees at a camera company• The response variable is the number of camera boxes packaged per
hour• The training methods A (video), B (interactive) and C (standard) are
the treatments
10-9
![Page 10: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/10.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.1 Training Method Experiment Case
• Use a completely randomized experimental design• Have available a large pool of newly hired employees• Need samples of size five for each training type• Randomly select five people from the pool; assign these five to
training method A (Video Training)• Randomly select five people from the remaining new employees;
these five are assigned to training method B (Interactive)• Randomly select five people from the remaining employees; these
five are assigned to training method C (Standard-reading only)• Each randomly trainee is trained using the assigned method
and results of the average number of boxed packed per hour is recorded
10-10
![Page 11: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/11.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.1 Training Method Experiment Case
• The data is as shown below• Let xij denote the average number of boxes packed
by the jth employee (j = 1,2, … , 5) using training method i (i = A, B, or C)
• Examining the box plots shown next to the data, we see some evidence that the interactive training method (B) may result in the greatest efficiency in packing camera parts
10-11
![Page 12: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/12.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
One-Way Analysis of Variance• Want to study the effects of all p treatments on a
response variable• For each treatment, find the mean and standard
deviation of all possible values of the response variable when using that treatment
• For treatment i, find treatment (Group) mean mi
• One-way analysis of variance estimates and compares the effects of the different treatments on the response variable• By estimating and comparing the treatment means m1,
m2, …, mp
• One-way analysis of variance, or one-way ANOVA
10-12
L03
![Page 13: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/13.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3 Training Method Experiment Case
• The mean of a sample is the point estimate for the corresponding treatment mean
xA = 34.92 boxes/hr estimates mA
xB = 36.56 boxes/hr estimates mB
xC = 33.98 boxes/hr estimates mC
10-13
L03
![Page 14: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/14.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3 Training Method Experiment Case
• The standard deviation of a sample is the point estimate for the corresponding treatment standard estimates
sA = 0.7662 boxes/hr estimates sA
sB = 0.8503 boxes/hr estimates sB
sC = 0.8349 boxes/hr estimates sC
10-14
L03
![Page 15: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/15.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
ANOVA Notation• ni denotes the size of the sample randomly
selected for treatment i• xij is the jth value of the response variable using
treatment i• xi is average of the sample of ni values for
treatment i• xi is the point estimate of the treatment mean mi
• si is the standard deviation of the sample of ni values for treatment i• si is the point estimate for the treatment
(population) standard deviation si 10-15
L03
![Page 16: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/16.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
One-Way ANOVA Assumptions• Completely randomized experimental design
• Assume that a sample has been selected randomly for each of the p treatments on the response variable by using a completely randomized experimental design
• Constant variance• The p populations of values of the response variable (associated
with the p treatments) all have the same variance
• Normality• The p populations of values of the response variable all have
normal distributions
• Independence• The samples of experimental units are randomly selected,
independent samples
10-16
L05
![Page 17: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/17.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Notes on Assumptions• One-way ANOVA is not very sensitive to violations
of the equal variances assumption• Especially when all the samples are about the same size• All of the sample standard deviations should be reasonably equal to
each other• General rule, the one-way ANOVA results will be approximately
correct if the largest sample standard deviation is no more than twice the smallest sample standard
• Normality is not crucial• ANOVA results are approximately valid for mound-shaped
distributions• If the sample distributions are reasonably symmetric and if
there are no outliers, then ANOVA results are valid for even samples as small as 4 or 5
10-17
![Page 18: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/18.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Testing for Significant DifferencesBetween Treatment (Group) Means
• Are there any statistically significant differences between the sample (treatment) means?
• The null hypothesis is that the mean of all p treatments are the same• H0: m1 = m2 = … = mp
• The alternative is that at least two of the p treatments have different effects on the mean response• Ha: at least two of m1, m2 , …, mp differ
10-18
![Page 19: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/19.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Testing for Significant DifferencesBetween Treatment (Group) Means
• Compare the between-treatment variability to the within-treatment variability• Between-treatment variability is the variability of the sample
means from sample to sample• Within-treatment variability is the variability of the treatments
within each sample
10-19
![Page 20: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/20.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3 Training Method Experiment Case
• In Figure 10.1(a), the between-treatment variability is not large compared to the within-treatment variability• The between-treatment variability could be the result of
sampling variability• Do not have enough evidence to reject H0: mA = mB =
mC
• In figure 10.1(b), between-treatment variability is large compared to the within-treatment variability• May have enough evidence to reject H0 in favor of
Ha: at least two of mA, mB, mC differ
10-20
![Page 21: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/21.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
To Compare the Between-Groups and Within-Group Variability
• Terminology • Sums of squares• Mean squares• n is the total number of experimental units used in the one-way
ANOVA• is the overall mean of the observed values of the response
variable
• Define • Between-groups sum of squares
• Error sum of squares
10-21
x
p
iii xxnSSB
1
2
pn
jppj
n
jj
n
jj xxxxxxSSE
1
2
1
222
1
211
21
L03
![Page 22: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/22.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
+=Squares of Squares of Squares of
SumError SumTreatment Sum Total
Partitioning the Total Variability in theResponse
10-22
+ SSE SSB = SST
yVariabilit Treatment Within yVariabilit Treatment Between yVariabilit Total
L03
![Page 23: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/23.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Note• The overall mean x is:
• where n = n1 + n2 + … + ni + …. np
• Also
10-23
p
i
n
jij
i
xn
x1 1
1
p
iix
px
1
1
L03
![Page 24: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/24.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Mean Squares• The treatment mean-squares is:
• The error mean-squares is
10-24
1
pSSB
MSB
pnSSE
MSE
L03
![Page 25: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/25.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Test for Difference BetweenGroup Means
• Suppose that we want to compare p treatment means
• The null hypothesis is that all treatment means are the same:• H0: m1 = m2 = … = mp
• The alternative hypothesis is that they are not all the same:• Ha: at least two of m1, m2 , …, mp differ
10-25
![Page 26: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/26.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Test for Difference BetweenGroup Means
• Define the F statistic:
• The p-value is the area under the F curve to the right of F, where the F curve has p – 1 numerator and n – p denominator degrees of freedom
10-26
pnSSE
pSSB
MSEMSB
F=
1
L03
![Page 27: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/27.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Test for Difference BetweenGroup Means
• Reject H0 in favor of Ha at the a level of significance if • F > Fa , or if • p-value < a
• F a is based on p – 1 numerator and n – p denominator degrees of freedom
10-27
![Page 28: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/28.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3 Training Method Experiment Case
• For the p = 3 training methods and n = 15 trainees (with 5 trainees per method):
• The overall mean x is
• The treatment sum of squares is
10-28
153.3515
3.52715
9.340.350.34
x
0493.17
153.3598.335153.3556.365153.3592.345 222
222
,,
2
xxnxxnxxnxxnSSB CCBBAACBAi
ii
![Page 29: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/29.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3 Training Method Experiment Case
• The error sum of squares is
• The total sum of squares is
SST = SSB + SSE = 17.0493 + 8.028 = 25.0773
10-29
028.8
98.339.3498.333.33
56.366.3756.363.35
92.348.3592.340.34
22
22
22
1
2
1
2
1
2
BBA n
jBBj
n
jBBj
n
jAAj xxxxxxSSE
![Page 30: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/30.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3-10.4 Training Method Experiment Case
• The between-groups (treatment) mean squares is
• The error mean squares is
• The F statistic is
10-30
525.813
0493.171
pSSB
MSB
669.0315
023.8
pnSSE
MSE
74.12669.0525.8
MSEMSB
F
![Page 31: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/31.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.3-10.4 Training Method Experiment Case
• At a = 0.05 significance level, • F0.05 with p – 1 = 3 – 1 = 2 numerator and n – p = 15 – 3 = 12
denominator degrees of freedom
• From Table A.7, F0.05 = 3.89• F = 12.74 > F0.05 = 3.89
• Therefore reject H0 at the 0.05 significance level• There is strong evidence that at least one of the group means (μA,
μB, μC) is different• So at least one of the three different training methods (A, B, C)
have an effect on the average number of boxes packed per hour• But which ones?• Do pairwise comparisons (next topic)
10-31
![Page 32: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/32.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Table A.7: F0.05 DF1=2 and DF2 =12
Numerator df =2
Denominator df = 12
3.89
10-32
![Page 33: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/33.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.4: Calculations
10-33
L03
![Page 34: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/34.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.4: Excel ANOVA Output
10-34
![Page 35: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/35.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Pairwise Comparisons, IndividualIntervals
• Individual 100(1 - a)% confidence interval for mi – mh:
• ta/2 is based on n – p degrees of freedom
10-35
hiα/hi nn
MSEtxx11
2
L04
![Page 36: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/36.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Pairwise Comparisons, Simultaneous Intervals
• A Tukey simultaneous 100(1-α) percent confidence interval for μi – μh is
qa is the upper percentage point of the studentized range for p and (n – p), m denotes common sample size
• Tukey formula gives the most precise (shortest) simultaneous confidence interval
• Generally Tukey simultaneous confidence interval is longer than corresponding individual confidence interval• Penalty paid for simultaneous confidence by obtaining a longer
interval
10-36
A.10 Table from q
mMSE
qXX hi
L04
![Page 37: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/37.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.5 Training Method Experiment Case
• A versus B , = 0.05
10-37
Groups Count Average Variance MSEType A 5 34.92 0.587 0.669Type B 5 36.56 0.723Type C 5 33.98 0.697
0.513-2.7671,-
127.164.151
51
669.02.17936.5634.92
![Page 38: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/38.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.5 Training Method Experiment Case
• Tukey simultaneous confidence intervals for μA – μC is:
• For μA - μC and μB – μC
• Strong evidence that training method B yields the highest mean number of boxes packed
10-38
Click to see value lookup from table A.10
90.261,3.01
1.3791.64
5699.0
77.392.3456.3605.0
mMSE
qXX AB
319.2 ,439.0
379.198.3392.34 379.1
CA xx .9593 ,201.1
379.198.3356.36 379.1
CB xx
![Page 39: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/39.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Table A.10: q0.05p=3
15-3=12
3.77
Return to previous slide
10-39
![Page 40: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/40.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.5 Training Method Experiment Case
10-40
• 95% confidence interval for μB is:
357.37,763.35
797.056.365669.0
2.17936.56
Groups Count Average Variance MSEType A 5 34.92 0.587 0.669Type B 5 36.56 0.723Type C 5 33.98 0.697
![Page 41: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/41.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
The Randomized Block Design• A randomized block design compares p treatments
(for example, production methods) on each of b blocks (or experimental units or sets of units; for example, machine operators• Each block is used exactly once to measure the effect of each and
every treatment• The order in which each treatment is assigned to a block should be
random
• A generalization of the paired difference design, this design controls for variability in experimental units by comparing each treatment on the same (not independent) experimental units• Differences in the treatments are not hidden by differences in the
experimental units (the blocks)10-41
L05
![Page 42: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/42.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Randomized Block Design• Define:
• xij = the value of the response variable when block j uses IV (independent variable)
• = the mean of the b values of the response variable observed in group I• = the mean of the p values of the response variable when using block j• = the mean of the total of the bp values of the response variable
observed in the experiment
10-42
ix
jx
x
L05
![Page 43: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/43.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Randomized Block Design
Blocks1 2 3 … bG
roup
s
12...p
xij = response from treatment i and block j
px
xx
2
1
Group Means
Block Means bxxx 21
10-43
L05
![Page 44: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/44.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.6 The Defective Cardboard Box Case
• Investigate the effects of four production methods on the number of defective boxes produced in an hour
• Compare the methods; for each of the four production methods, the company would select several machine operators, train each operator to use the production method to which they have been assigned, have each operator produce boxes (in random order) for one hour, and record the number of defective boxes produced
• The randomized design would utilize a total of 12 machine operators
• The abilities of the machine operators could differ substantially, these differences might tend to conceal any real differences between the production methods
• To overcome this disadvantage, the company will employ a randomized block experimental design
10-44
![Page 45: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/45.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.6 The Defective Cardboard Box Case
• p = 4 groups (production methods)• b = 3 blocks (machine operators)• n = 12 observations
10-45
![Page 46: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/46.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
The ANOVA Table, RandomizedBlocks
10-46
SST = SSB + SSBL + SSE
L03
![Page 47: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/47.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Sum of Squares• SSB measures the amount of between-groups variability
• SSBL measures the amount of variability due to the blocks
• SST measures the total amount of variability
• SSE measures the amount of the variability due to error
SSE = SST – SSB – SSBL
10-47
p
ii xxbSSB
1
2
b
ij xxpSSBL
1
2
p
i
b
jij xxSST
1 1
2
L03
![Page 48: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/48.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.6 Sum of Squares• For p = 4 groups (production methods), b = 3 blocks
(machine operators), n = 12 observations• SSB = 3[(10.3333 - 7.5833)2 + (10.3333 - 7.5833)2 + (5.0 - 7.5833)2
+ (4.6667 - 7.5833)2] = 90.9167• SSBL = 4[(6.0 - 7.5833)2 + (7.75 - 7.5833)2 + (9.0 - 7.5833)2]
= 18.1667 • SST = (9 - 7.5833)2 + (10 - 7.5833)2 + (12 - 7.5833)2 + (8 - 7.5833)2
+ (11 - 7.5833)2 + (12 - 7.5833)2 + (3 - 7.5833)2 + (5 - 7.5833)2 + (7 - 7.5833)2 + (4 - 7.5833)2 + (5 - 7.5833)2 + (5 - 7.5833)2 = 112.9167
• SSE = 112.9167 - 90.9167 - 18.1667 = 3.8333
• MSB = SSB/(p-1) = 90.9167/3 = 30.3056• MSE = SSE/(p-1)(b-1)= 3.8333/(3)(2) = 0.6389• MSBL = SSBL/(b-1) = 18.1667/2 = 9.0834
10-48
![Page 49: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/49.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
MegaStat ANOVA • Locate SSB, SSBL, SST, SSE, MSB, MSE, MSBL on the
ANOVA output
10-49
B
BL
L03
![Page 50: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/50.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F(groups) and F(blocks)• F(groups)
• F(blocks)
10-50
43.476389.03056.30
MSEMSB
groupsF
22.146389.00834.9
MSEMSBL
blocksF
![Page 51: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/51.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Test for Group Effects• Hypothesis Test
• H0: No difference between group effects• Ha: At least one group effects differ
• Reject H0 if • F > Fa or
• p-value < a• F a is based on p-1 numerator and (p-1)(b-1) denominator
degrees of freedom
10-51
11
1 :Statistic Test
bp-SSE/p-SSB/
MSEMSB
F=
![Page 52: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/52.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.6 Group Effects• Test at the a = 0.05 level of significance
• Reject H0 if F(groups) > F0.05 (based on p-1 numerator and (p-1)(b-1) denominator degrees of freedom
• F(groups) = MSB/MSE = 30.306/0.639 = 47.43• F0.05 based on p-1 = 3 numerator and (p-1)(b-1) = 6 denominator
degrees of freedom is 4.76 (Table A.7)
• Since F(groups) > F0.05 (47.43 > 4.76) we reject the null hypothesis and conclude there is enough evidence at α = 0.05 that one production method has a different effect on the mean number of boxes produced per hour
10-52
![Page 53: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/53.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Table A.7: F0.05 DF1=3 and DF2 =6
Numerator df =2
Denominator df = 6
4.76
10-53
Return
![Page 54: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/54.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Test for Block Effects• Hypothesis Test
• H0: No difference between block effects• Ha: At least one block effects differ
• Reject H0 if • F > Fa or
• p-value < a• F a is based on b-1 numerator and (p-1)(b-1) denominator
degrees of freedom
10-54
11
1 :Statistic Test
bp-SSE/b-SSBL/
MSEMSBL
F=
![Page 55: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/55.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.6 Block Effects• Test at the a = 0.05 level of significance
• Reject H0 if F(groups) > F0.05 (based on b-1 numerator and (p-1)(b-1) denominator degrees of freedom
• F(block) = MSBL/MSE = 9.083/0.639 = 14.22• F0.05 based on b-1 = 2 numerator and (p-1)(b-1) = 6 denominator
degrees of freedom is 5.14 (Table A.7)
• Since F(block) > F0.05 (14.22 > 5.14) we reject the null hypothesis and conclude there is enough evidence at α = 0.05 that one machine operator has a different effect on the mean number of boxes produced per hour
10-55
![Page 56: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/56.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Table A.7: F0.05 DF1=2 and DF2 =6
Numerator df =2
Denominator df = 6
5.14
10-56
Return
![Page 57: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/57.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Point Estimates and Confidence Intervals in a Randomized Block ANOVA
• Consider the difference between groups i and h on the mean value of the response variable• A point estimate of this difference is • Individual 100(1 - a)% confidence interval for this difference is
• ta/2 is based on (p-1)(b-1) degrees of freedom
• A Tukey simultaneous 100(1 2 A) percent confidence interval for this difference is
• q a is the upper percentage point of the studentized range for p and (p-1)(b-1) from Table A.10
10-57
hi xx
bs
2t)xx( /2hi
bs
q)xx( hi
![Page 58: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/58.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.7 The Defective Cardboard Box Case
• There is extremely strong evidence that at least one production methods has a different mean number of defective boxes produced per hour
• Group means are = 10.3333, = 10.3333,
= 5.0, and = 4.6667.• Since is the smallest mean, we will use Tukey
simultaneous 95 percent confidence intervals to compare the effect of production method 4 with the effects of production methods 1, 2, and 3
10-58
1x 2x
3x 4x
4x
![Page 59: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/59.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.7 The Defective Cardboard Box Case
• q0.05 = 4.90 is the entry in Table A.10 corresponding to p = 4 and ( p - 1)(b - 1) = 6
• MSE = 0.639 from ANOVA
10-59
794.0639.0 s
![Page 60: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/60.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.7 The Defective Cardboard Box Case
• Tukey simultaneous 95 percent confidence interval for the difference between the effects of production methods 4 and 1 on the mean number of defective boxes produced per hour is
• Note q0.05 = 4.90 for 4 and 6 degrees of freedom
10-60
3.40541]--7.9281,[
2615.26666.53
0.79944.90)10.3333(4.6667
![Page 61: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/61.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Table A.10: q0.05p=4
(p-1)(b-1)=6
4.90
Return to previous slide
10-61
![Page 62: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/62.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Two-Way Analysis of Variance• A two factor factorial design compares the mean
response for a levels of factor 1 (for example, display height) and each of b levels of factor 2 ( for example, display width)
• A treatment is a combination of a level of factor 1 and a level of factor 2
10-62
L06
![Page 63: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/63.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.8 The Shelf Display Case
• The Tastee Bakery Company supplies a bakery product to many supermarkets
• Study the effects of two factors—shelf display height and shelf display width—on monthly demand (measured in cases of 10 units each) for this product
• The factor “display height” is defined to have three levels: B (bottom), M (middle), and T (top)
• The factor “display width” is defined to have two levels: R (regular) and W (wide)
10-63
L06
![Page 64: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/64.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Example 10.8 The Shelf Display Case
10-64
![Page 65: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/65.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Graphical Analysis ofBakery Demand (Plotting the Treatment Means)
10-65
![Page 66: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/66.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Possible Treatment Effects inTwo-Way ANOVA
10-66
![Page 67: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/67.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Two-Way ANOVA Table
10-67
L06
![Page 68: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/68.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
MegaStat Output
10-68
Data SummaryHeight Reg Wide MeanB 55.9 55.7 55.8M 75.5 78.9 77.2T 51.0 52.0 51.5Mean 60.8 62.2 61.5
Width
![Page 69: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/69.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
F Tests for Treatment Effects• Test Statistics
• Main Effects
• Fα is based on a-1 and ab(m-1) degrees of freedom
• Fα is based on b-1 and ab(m-1) degrees of freedom
• Interaction
• Fα is based on (a-1)(b-1) and ab(m-1) degrees of freedom
• Reject H0 if F > Fa or p-value < a10-69
1)]-SSE/[(ab(m1)-SS(1)/(a
MSEMS(1)
=F(1)
1)]-SSE/[(ab(m1)-SS(2)/(b
MSEMS(2)
=F(2)
1)]-SSE/[(ab(m1)]-1)(b-aSS(int)/[(
MSEMS(int)
=F(int)
L06
![Page 70: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/70.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Test for Treatment Effects• Hypothesis
• H0 that no interaction exists between factors 1 and 2 versus the alternative hypothesis Ha that interaction does exist
• Reject H0 in favour of Ha at level of significance α if
10-70
1)]-SSE/[(ab(m1)]-1)(b-aSS(int)/[(
MSEMS(int)
=F(int)
82.06.125.04
MSEMS(int)
=F(int)
![Page 71: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/71.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Conclusion• F(Int) = 0.82 is less than F0.05 = 3.89• Cannot reject H0 at the 0.05 level of significance
• Conclude that little or no interaction exists between shelf display height and shelf display width
10-71
![Page 72: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/72.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Estimation of Treatment DifferencesUnder Two-Way ANOVA, Factor 1
• Individual 100(1 - a)% confidence interval for μi - μi’
• ta/2 is based on ab(m-1) degrees of freedom
• Tukey simultaneous 100(1 - a)% confidence interval for μi - μi’
• qα is the upper percentage point of the studentized range for a and ab(m-1) from Table A.10
10-72
bm
MSE2
t)xx( /2i'i
bm
MSE1
q)xx( i'i
![Page 73: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/73.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Estimation of Treatment DifferencesUnder Two-Way ANOVA, Factor 2
• Individual 100(1 - a)% confidence interval for μj - μj’
• tα/2 is based on ab(m-1) degrees of freedom
• Tukey simultaneous 100(1 - a)% confidence interval for mj - mj’
• q a is the upper percentage point of the studentized range for b and ab(m-1) from Table A.10
10-73
am
MSEjj2
t)xx( /2'
am
MSEjj
1q)xx( '
![Page 74: Chapter 10](https://reader035.vdocument.in/reader035/viewer/2022062803/5681479b550346895db4d007/html5/thumbnails/74.jpg)
Copyright © 2011 McGraw-Hill Ryerson Limited
Summary• The purpose of most experiments is to compare the effects
of various treatments on a response variable• Factors are set before the response variables are observed,
the different values of the factors are called treatments• To analyse experimental data we study one way analysis of
variance (one way ANOVA)• Differences in experimental units can conceal differences in
treatments. In such cases we can employ randomized experimental block design. Each block is used exactly once to measure the effects of each and every treatment
• In two way analysis of variance (two-way ANOVA) we can study the effects of two factors by carrying out a two factor experiment
10-74