completely randomised design (crd) 2.ย ยท 2018-02-23ย ยท all copyrights retained by author...
TRANSCRIPT
All copyrights retained by author
Completely randomised design (CRD)
Model ๐ฆ๐๐ = ๐๐ + ๐๐๐
i=1,2,โฆt, j= 1, 2,โฆr
Assumptions โข Data are independent (study
design)
โข The residuals are normally
distributed (check histogram,
normal probability plot).
โข The residuals have equal
variances (boxplots, test of equal
variances)
Estimated
parameters
Standard
error
๐ = โ๐๐๐ธ
95% CI
Contrasts
๐ถ = โ ๐๐๐๐๐ก๐=1 , โ ๐๐ = 0๐ก
๐
โข ki indicates the coefficient of the contrast.
๐ = โ ๐๐๏ฟฝฬ ๏ฟฝ๐.๐ก๐=1 Var (c) = ๐2 โ
๐๐2
๐๐๐
๐ก๐๐๐๐ก =๐โ๐ถ
โ๐๐๐ธรโ๐๐
2
๐๐๐
, v= N-t (df of SSE)
Sum of squares: ๐๐๐ถ =๐2
โ๐๐
2
๐๐๐
โ ๐๐๐ถ = ๐๐๐๐๐ก๐๐๐๐ก ๐๐
Orthogonal contrast
Contrasts that convey independent information
If ๐ถ1 = ๐1๐1 + ๐2๐2 + โฏ + ๐๐ก๐๐ก
๐ถ2 = ๐1๐1 + ๐2๐2 + โฏ + ๐๐ก๐๐ก
C & D are orthogonal if
Complete set of orthogonal contrasts
โข t- 1 mutually orthogonal contrast
โข Each pair of contrasts is orthogonal
Multiple comparisons
1. Bonferroni
โข suitable for ad hoc comparisons
โข small number of tests
๐ผ๐ = ๐๐ฃ๐๐๐๐๐ ๐ ๐๐๐๐๐๐๐๐๐๐๐ ๐๐๐ฃ๐๐
๐ผ๐ = ๐๐๐๐๐ฃ๐๐๐ข๐๐ =๐ผ๐
๐, ๐ = ๐ก(๐ก โ 1)= no. of comparisons
95% CI:
๐ ๐๐๐๐๐๐2 =
(๐1 โ 1)๐ 12 + (๐2 โ 1)๐ 2
2
๐1 + ๐2 โ 2
2. Tukey
โข suitable for balanced study
โข less conservative
๐ป0: ๐1 = ๐2 = โฏ = ๐๐
k=t treatment, v = df of SSE
To find individual differences between means
โข Find threshold value ๐๐ผ,๐,๐ฃ ร โ๐๐๐ธ/๐
โข If |๐ฆ๐.ฬ โ ๐ฆ๐.ฬ ฬ ฬ | exceeds threshold value, then reject H0
3. Scheffe
โข suitable for post hoc comparisons
โข many tests
โข can only conduct two-tailed tests as we are using F-
values.
๐ = โ(๐ก โ 1)๐น๐ผ,๐กโ1,๐โ๐ก
๐ ๐ = โ๐๐๐ธ ร โ๐๐
2
๐๐๐
CI: cยฑS ร sc
If data is non-normal
โข Transformation: log or sqrt
โข Non-parametric test (Kruskal-Wallis)
Test the different between treatment medians
Assumption: all groups have similar shape
If variances are not equal
โข Use Leveneโs test (F distribution) or Barlettโs test
(chi-square distribution) to see if variances are equal
Randomised complete block design (RBD)
โข Suitable when we have homogenous groups
Model ๐ฆ๐๐ = ๐ + ๐๐ + ๐๐ + ๐๐๐
i=1,2,โฆt, j= 1, 2,โฆr
Assumption
Parameter
constraint
โข ๐๐๐~๐๐ผ๐ท(0, ๐2)๐๐๐๐๐๐๐๐ก๐๐ฆ
โข
Variance
Relative
efficiency ๐ ๐ธ =
(๐1+1)(๐2+3)๐ 22
(๐1+3)(๐2+1)๐ 12=
๐ถ๐ ๐ท
๐ ๐ต๐ท
If RE=2 => RBD is twice efficient. CRD
need a sample size 2 times greater to
achieve the same precision.
If assumptions fail:
โข Transformation
โข Non-parametric (Friedmanโs test)
Assumptions:
- Each block contains t random variables or
ranking
- The blocks are independent
- Within each block, observations can be arranged
in increasing order (not too many ties)
H0: Each ranking of the random variables within a
block is equally likely
H1: At least one treatment has larger observed values.
All copyrights retained by author
R(yij): rank from 1 to t assigned to yij within block j
Friedman test statistics S (chi-square distribution)
Factorial experiment
โข Examine multiple factors at the same time
โข Examine interaction first
(1) Significant: Need to carry out multiple
comparisons on the levels of one factor at each
level of the other factor
(2) Insignificant: Remove interaction term.
Model ๐ฆ๐๐๐ = ยต + ๐ผ๐ + ๐ฝ๐ + (๐ผ๐ฝ)๐๐ + ๐๐๐๐
i=1,2,โฆa, j= 1, 2,โฆb, k= 1, 2,..r
Assumptions โข ๐๐๐~๐๐ผ๐ท(0, ๐2)๐๐๐๐๐๐๐๐ก๐๐ฆ
โข
โข Main effect
Main effect of A
Two-factor
interaction effect
(a2b2 -a1b2) โ (a2b1 -a1b1)
Estimated
parameters
NOTE: Var(nX)=n2Var(X)
Var (X+Y)= Var(X) + Var(Y)
2n Factorial Design
๐๐๐ (๏ฟฝฬ๏ฟฝ)=๐2
๐2๐โ2 SST= ๐(.)2
2๐๐
Yatesโ Algorithm
Partial confounding: Confounding different effects in each rep
Fractional replication: (1) find identity relation, (2) find the
effect subgroup, (3) Decide fractional replicate and its aliases
Random effects model
โข Treatments which are drawn at random from a
population of treatments
Model ๐ฆ๐๐ = ยต + ๐ผ๐ + ๐๐๐
i=1,2,โฆt, j= 1, 2,โฆr
Assumption
โข ๐๐๐~๐๐ผ๐ท(0, ๐2)๐๐๐๐๐๐๐๐ก๐๐ฆ
โข ๐ผ๐~๐๐ผ๐ท(0, ๐2)๐๐๐๐๐๐๐๐ก๐๐ฆ
Hypothesis ๐ป0: ฯa2 = 0 (no treatment effects)
๐ป1: ฯa2 > 0 (there is treatment difference)
Estimated
parameters
The variance among X accounts for x% of
the variation and the variance within X
accounts for the other (1-x)%.
CI
Intraclass
correlation
coefficient
The analysis of covariance
โข Examine one factor, but also take into account
extraneous (continuous) variables
โข We can only measure covariate during the
experiment
โข The influence of covariate on the response is
unknown
Model
๐ฆ๐๐:(๐)๐๐๐ ๐๐กโ ๐ ๐ข๐๐๐๐๐ก ๐๐ ๐๐กโ ๐ก๐๐๐๐ก๐๐๐๐ก
ยต: ๐๐ฃ๐๐๐๐๐ ๐๐๐๐ (๐)
๐๐: ๐๐๐๐๐๐ก ๐๐ ๐กโ๐ ๐๐กโ ๐ก๐๐๐๐ก๐๐๐๐ก ๐๐ (๐)
ฮฒ: coefficient for the linear regression of yij on
xij
xij: covariate for jth subject in ith treatment
๐ฅ..ฬ : overall covariate mean
Assumption
Parameter
constraint
โข ๐๐๐~๐๐ผ๐ท(0, ๐2)๐๐๐๐๐๐๐๐ก๐๐ฆ
โข โข Common slope ฮฒ (not significant)
All copyrights retained by author
Adjusted
mean of Y ๐ฆ๐๐๐๐ฬ ฬ ฬ ฬ ฬ ฬ = ๐ฆ๐.ฬ ฬ ฬ + ๏ฟฝฬ๏ฟฝ(๐ฅ..ฬ โ ๐ฅ๐.ฬ ฬ ฬ )
All copyrights retained by author
Survey design
Sampling frame: a list of sampling units
Sampling unit: non-overlapping units for sampling
Unit: A group of elements
Element: an object on which measures are taken
NOTE: unit can be element.
Probability concepts
๐๐๐(๐) = โ (๐ฆ๐ โ ๐)2
๐๐๐๐=1 = E(Y2)-[E(Y)]2
Simple random sampling
โข Sampling is done without replacement.
โข Simpliest, appropriate with no prior information.
Sampling size
To estimate pop. total within D of true value
To estimate pop. proportion with a specified margin of error B
at siginifcance level ฮฑ
If no p is given, use p=0.5 (conversative + large sample size)
Stratified random sampling
โข Use to reduce variance
โข Obtain best results when within-stratum differences
is small, and large differences between stratum
means.
Sample size
Design effect
All copyrights retained by author
If deff <<1, then stratified random sampling is better than
SRS.
Cluster sampling
โข Easy to implement
โข Clusters are generally geographical entites
โข Useful when there is a large within-cluster variation
but small between-cluster variation.
Systematic sampling
โข Simple, save time and effort
โข Useful when we do not have a list of population
โข If the population is period, DO NOT USE.
Method A: When N/k is an integer, choose a unit at random
from the first kth unit. Take every kth unit from the starting
unit.
Method B: Choose a unit at random from the population.
Starting point depends on remainder.
When N/k is not an integer, use Method B to ensure an
unbiased estimator of ยต
Ratio estimator
All copyrights retained by author
Regression estimator
MSE=