completely randomised design (crd) 2.ย ยท 2018-02-23ย ยท all copyrights retained by author...

6
All copyrights retained by author Completely randomised design (CRD) Model = + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr Assumptions โ€ข Data are independent (study design) โ€ข The residuals are normally distributed (check histogram, normal probability plot). โ€ข The residuals have equal variances (boxplots, test of equal variances) Estimated parameters Standard error = โˆš 95% CI Contrasts =โˆ‘ =1 , โˆ‘ =0 โ€ข ki indicates the coefficient of the contrast. = โˆ‘ . =1 Var (c) = 2 โˆ‘ 2 = โˆ’ โˆš ร—โˆ‘ 2 , v= N-t (df of SSE) Sum of squares: = 2 โˆ‘ 2 โˆ‘ = Orthogonal contrast Contrasts that convey independent information If 1 = 1 1 + 2 2 +โ‹ฏ+ 2 = 1 1 + 2 2 +โ‹ฏ+ C & D are orthogonal if Complete set of orthogonal contrasts โ€ข t- 1 mutually orthogonal contrast โ€ข Each pair of contrasts is orthogonal Multiple comparisons 1. Bonferroni โ€ข suitable for ad hoc comparisons โ€ข small number of tests = = = , = ( โˆ’ 1)= no. of comparisons 95% CI: 2 = ( 1 โˆ’ 1) 1 2 + ( 2 โˆ’ 1) 2 2 1 + 2 โˆ’2 2. Tukey โ€ข suitable for balanced study โ€ข less conservative 0 : 1 = 2 =โ‹ฏ= k=t treatment, v = df of SSE To find individual differences between means โ€ข Find threshold value ,, ร— โˆš/ โ€ข If | . โˆ’ . | exceeds threshold value, then reject H0 3. Scheffe โ€ข suitable for post hoc comparisons โ€ข many tests โ€ข can only conduct two-tailed tests as we are using F- values. = โˆš( โˆ’ 1) ,โˆ’1,โˆ’ = โˆš ร— โˆ‘ 2 CI: cยฑS ร— sc If data is non-normal โ€ข Transformation: log or sqrt โ€ข Non-parametric test (Kruskal-Wallis) Test the different between treatment medians Assumption: all groups have similar shape If variances are not equal โ€ข Use Leveneโ€™s test (F distribution) or Barlettโ€™s test (chi-square distribution) to see if variances are equal Randomised complete block design (RBD) โ€ข Suitable when we have homogenous groups Model = + + + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr Assumption Parameter constraint โ€ข ~(0, 2 ) โ€ข Variance Relative efficiency = ( 1 +1)( 2 +3) 2 2 ( 1 +3)( 2 +1) 1 2 = If RE=2 => RBD is twice efficient. CRD need a sample size 2 times greater to achieve the same precision. If assumptions fail: โ€ข Transformation โ€ข Non-parametric (Friedmanโ€™s test) Assumptions: - Each block contains t random variables or ranking - The blocks are independent - Within each block, observations can be arranged in increasing order (not too many ties) H0: Each ranking of the random variables within a block is equally likely H1: At least one treatment has larger observed values.

Upload: others

Post on 09-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

Completely randomised design (CRD)

Model ๐‘ฆ๐‘–๐‘— = ๐œ‡๐‘– + ๐‘’๐‘–๐‘—

i=1,2,โ€ฆt, j= 1, 2,โ€ฆr

Assumptions โ€ข Data are independent (study

design)

โ€ข The residuals are normally

distributed (check histogram,

normal probability plot).

โ€ข The residuals have equal

variances (boxplots, test of equal

variances)

Estimated

parameters

Standard

error

๐‘  = โˆš๐‘€๐‘†๐ธ

95% CI

Contrasts

๐ถ = โˆ‘ ๐‘˜๐‘–๐œ‡๐‘–๐‘ก๐‘–=1 , โˆ‘ ๐‘˜๐‘– = 0๐‘ก

๐‘–

โ€ข ki indicates the coefficient of the contrast.

๐‘ = โˆ‘ ๐‘˜๐‘–๏ฟฝฬ…๏ฟฝ๐‘–.๐‘ก๐‘–=1 Var (c) = ๐œŽ2 โˆ‘

๐‘˜๐‘–2

๐‘Ÿ๐‘–๐‘–

๐‘ก๐‘๐‘Ÿ๐‘–๐‘ก =๐‘โˆ’๐ถ

โˆš๐‘€๐‘†๐ธร—โˆ‘๐‘˜๐‘–

2

๐‘Ÿ๐‘–๐‘–

, v= N-t (df of SSE)

Sum of squares: ๐‘†๐‘†๐ถ =๐‘2

โˆ‘๐‘˜๐‘–

2

๐‘Ÿ๐‘–๐‘–

โˆ‘ ๐‘†๐‘†๐ถ = ๐‘‡๐‘Ÿ๐‘’๐‘Ž๐‘ก๐‘š๐‘’๐‘›๐‘ก ๐‘†๐‘†

Orthogonal contrast

Contrasts that convey independent information

If ๐ถ1 = ๐‘˜1๐œ‡1 + ๐‘˜2๐œ‡2 + โ‹ฏ + ๐‘˜๐‘ก๐œ‡๐‘ก

๐ถ2 = ๐‘™1๐œ‡1 + ๐‘™2๐œ‡2 + โ‹ฏ + ๐‘™๐‘ก๐œ‡๐‘ก

C & D are orthogonal if

Complete set of orthogonal contrasts

โ€ข t- 1 mutually orthogonal contrast

โ€ข Each pair of contrasts is orthogonal

Multiple comparisons

1. Bonferroni

โ€ข suitable for ad hoc comparisons

โ€ข small number of tests

๐›ผ๐‘ = ๐‘œ๐‘ฃ๐‘’๐‘Ÿ๐‘Ž๐‘™๐‘™ ๐‘ ๐‘–๐‘”๐‘›๐‘–๐‘“๐‘–๐‘๐‘Ž๐‘›๐‘๐‘’ ๐‘™๐‘’๐‘ฃ๐‘’๐‘™

๐›ผ๐‘’ = ๐‘–๐‘›๐‘‘๐‘–๐‘ฃ๐‘–๐‘‘๐‘ข๐‘Ž๐‘™ =๐›ผ๐‘

๐‘˜, ๐‘˜ = ๐‘ก(๐‘ก โˆ’ 1)= no. of comparisons

95% CI:

๐‘ ๐‘๐‘œ๐‘œ๐‘™๐‘’๐‘‘2 =

(๐‘›1 โˆ’ 1)๐‘ 12 + (๐‘›2 โˆ’ 1)๐‘ 2

2

๐‘›1 + ๐‘›2 โˆ’ 2

2. Tukey

โ€ข suitable for balanced study

โ€ข less conservative

๐ป0: ๐œ‡1 = ๐œ‡2 = โ‹ฏ = ๐œ‡๐‘˜

k=t treatment, v = df of SSE

To find individual differences between means

โ€ข Find threshold value ๐‘ž๐›ผ,๐‘˜,๐‘ฃ ร— โˆš๐‘€๐‘†๐ธ/๐‘Ÿ

โ€ข If |๐‘ฆ๐‘–.ฬ… โˆ’ ๐‘ฆ๐‘—.ฬ…ฬ… ฬ…| exceeds threshold value, then reject H0

3. Scheffe

โ€ข suitable for post hoc comparisons

โ€ข many tests

โ€ข can only conduct two-tailed tests as we are using F-

values.

๐‘† = โˆš(๐‘ก โˆ’ 1)๐น๐›ผ,๐‘กโˆ’1,๐‘โˆ’๐‘ก

๐‘ ๐‘ = โˆš๐‘€๐‘†๐ธ ร— โˆ‘๐‘˜๐‘–

2

๐‘Ÿ๐‘–๐‘–

CI: cยฑS ร— sc

If data is non-normal

โ€ข Transformation: log or sqrt

โ€ข Non-parametric test (Kruskal-Wallis)

Test the different between treatment medians

Assumption: all groups have similar shape

If variances are not equal

โ€ข Use Leveneโ€™s test (F distribution) or Barlettโ€™s test

(chi-square distribution) to see if variances are equal

Randomised complete block design (RBD)

โ€ข Suitable when we have homogenous groups

Model ๐‘ฆ๐‘–๐‘— = ๐œ‡ + ๐œ๐‘– + ๐‘๐‘— + ๐‘’๐‘–๐‘—

i=1,2,โ€ฆt, j= 1, 2,โ€ฆr

Assumption

Parameter

constraint

โ€ข ๐‘’๐‘–๐‘—~๐‘๐ผ๐ท(0, ๐œŽ2)๐‘–๐‘›๐‘‘๐‘’๐‘๐‘’๐‘›๐‘ก๐‘™๐‘ฆ

โ€ข

Variance

Relative

efficiency ๐‘…๐ธ =

(๐‘“1+1)(๐‘“2+3)๐‘ 22

(๐‘“1+3)(๐‘“2+1)๐‘ 12=

๐ถ๐‘…๐ท

๐‘…๐ต๐ท

If RE=2 => RBD is twice efficient. CRD

need a sample size 2 times greater to

achieve the same precision.

If assumptions fail:

โ€ข Transformation

โ€ข Non-parametric (Friedmanโ€™s test)

Assumptions:

- Each block contains t random variables or

ranking

- The blocks are independent

- Within each block, observations can be arranged

in increasing order (not too many ties)

H0: Each ranking of the random variables within a

block is equally likely

H1: At least one treatment has larger observed values.

Page 2: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

R(yij): rank from 1 to t assigned to yij within block j

Friedman test statistics S (chi-square distribution)

Factorial experiment

โ€ข Examine multiple factors at the same time

โ€ข Examine interaction first

(1) Significant: Need to carry out multiple

comparisons on the levels of one factor at each

level of the other factor

(2) Insignificant: Remove interaction term.

Model ๐‘ฆ๐‘–๐‘—๐‘˜ = ยต + ๐›ผ๐‘– + ๐›ฝ๐‘— + (๐›ผ๐›ฝ)๐‘–๐‘— + ๐‘’๐‘–๐‘—๐‘˜

i=1,2,โ€ฆa, j= 1, 2,โ€ฆb, k= 1, 2,..r

Assumptions โ€ข ๐‘’๐‘–๐‘—~๐‘๐ผ๐ท(0, ๐œŽ2)๐‘–๐‘›๐‘‘๐‘’๐‘๐‘’๐‘›๐‘ก๐‘™๐‘ฆ

โ€ข

โ€ข Main effect

Main effect of A

Two-factor

interaction effect

(a2b2 -a1b2) โ€“ (a2b1 -a1b1)

Estimated

parameters

NOTE: Var(nX)=n2Var(X)

Var (X+Y)= Var(X) + Var(Y)

2n Factorial Design

๐‘‰๐‘Ž๐‘Ÿ (๏ฟฝฬ‚๏ฟฝ)=๐œŽ2

๐‘Ÿ2๐‘›โˆ’2 SST= ๐‘Œ(.)2

2๐‘›๐‘Ÿ

Yatesโ€™ Algorithm

Partial confounding: Confounding different effects in each rep

Fractional replication: (1) find identity relation, (2) find the

effect subgroup, (3) Decide fractional replicate and its aliases

Random effects model

โ€ข Treatments which are drawn at random from a

population of treatments

Model ๐‘ฆ๐‘–๐‘— = ยต + ๐›ผ๐‘– + ๐‘’๐‘–๐‘—

i=1,2,โ€ฆt, j= 1, 2,โ€ฆr

Assumption

โ€ข ๐‘’๐‘–๐‘—~๐‘๐ผ๐ท(0, ๐œŽ2)๐‘–๐‘›๐‘‘๐‘’๐‘๐‘’๐‘›๐‘ก๐‘™๐‘ฆ

โ€ข ๐›ผ๐‘–~๐‘๐ผ๐ท(0, ๐œŽ2)๐‘–๐‘›๐‘‘๐‘’๐‘๐‘’๐‘›๐‘ก๐‘™๐‘ฆ

Hypothesis ๐ป0: ฯƒa2 = 0 (no treatment effects)

๐ป1: ฯƒa2 > 0 (there is treatment difference)

Estimated

parameters

The variance among X accounts for x% of

the variation and the variance within X

accounts for the other (1-x)%.

CI

Intraclass

correlation

coefficient

The analysis of covariance

โ€ข Examine one factor, but also take into account

extraneous (continuous) variables

โ€ข We can only measure covariate during the

experiment

โ€ข The influence of covariate on the response is

unknown

Model

๐‘ฆ๐‘–๐‘—:(๐‘‹)๐‘“๐‘œ๐‘Ÿ ๐‘—๐‘กโ„Ž ๐‘ ๐‘ข๐‘๐‘—๐‘’๐‘๐‘ก ๐‘–๐‘› ๐‘–๐‘กโ„Ž ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘ก๐‘š๐‘’๐‘›๐‘ก

ยต: ๐‘œ๐‘ฃ๐‘’๐‘Ÿ๐‘Ž๐‘™๐‘™ ๐‘š๐‘’๐‘Ž๐‘› (๐‘‹)

๐œ๐‘–: ๐‘’๐‘“๐‘“๐‘’๐‘๐‘ก ๐‘œ๐‘“ ๐‘กโ„Ž๐‘’ ๐‘–๐‘กโ„Ž ๐‘ก๐‘Ÿ๐‘’๐‘Ž๐‘ก๐‘š๐‘’๐‘›๐‘ก ๐‘œ๐‘› (๐‘‹)

ฮฒ: coefficient for the linear regression of yij on

xij

xij: covariate for jth subject in ith treatment

๐‘ฅ..ฬ…: overall covariate mean

Assumption

Parameter

constraint

โ€ข ๐‘’๐‘–๐‘—~๐‘๐ผ๐ท(0, ๐œŽ2)๐‘–๐‘›๐‘‘๐‘’๐‘๐‘’๐‘›๐‘ก๐‘™๐‘ฆ

โ€ข โ€ข Common slope ฮฒ (not significant)

Page 3: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

Adjusted

mean of Y ๐‘ฆ๐‘–๐‘Ž๐‘‘๐‘—ฬ…ฬ… ฬ…ฬ… ฬ…ฬ… = ๐‘ฆ๐‘–.ฬ…ฬ… ฬ… + ๏ฟฝฬ‚๏ฟฝ(๐‘ฅ..ฬ… โˆ’ ๐‘ฅ๐‘–.ฬ…ฬ… ฬ…)

Page 4: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

Survey design

Sampling frame: a list of sampling units

Sampling unit: non-overlapping units for sampling

Unit: A group of elements

Element: an object on which measures are taken

NOTE: unit can be element.

Probability concepts

๐‘‰๐‘Ž๐‘Ÿ(๐‘Œ) = โˆ‘ (๐‘ฆ๐‘— โˆ’ ๐œ‡)2

๐‘๐‘—๐‘˜๐‘—=1 = E(Y2)-[E(Y)]2

Simple random sampling

โ€ข Sampling is done without replacement.

โ€ข Simpliest, appropriate with no prior information.

Sampling size

To estimate pop. total within D of true value

To estimate pop. proportion with a specified margin of error B

at siginifcance level ฮฑ

If no p is given, use p=0.5 (conversative + large sample size)

Stratified random sampling

โ€ข Use to reduce variance

โ€ข Obtain best results when within-stratum differences

is small, and large differences between stratum

means.

Sample size

Design effect

Page 5: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

If deff <<1, then stratified random sampling is better than

SRS.

Cluster sampling

โ€ข Easy to implement

โ€ข Clusters are generally geographical entites

โ€ข Useful when there is a large within-cluster variation

but small between-cluster variation.

Systematic sampling

โ€ข Simple, save time and effort

โ€ข Useful when we do not have a list of population

โ€ข If the population is period, DO NOT USE.

Method A: When N/k is an integer, choose a unit at random

from the first kth unit. Take every kth unit from the starting

unit.

Method B: Choose a unit at random from the population.

Starting point depends on remainder.

When N/k is not an integer, use Method B to ensure an

unbiased estimator of ยต

Ratio estimator

Page 6: Completely randomised design (CRD) 2.ย ยท 2018-02-23ย ยท All copyrights retained by author Completely randomised design (CRD) Model = ๐œ‡ + i=1,2,โ€ฆt, j= 1, 2,โ€ฆr โ€ข Assumptions

All copyrights retained by author

Regression estimator

MSE=