1 c. the t test with multiple samples till now we have considered replicate measurements of the same...

22
1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average difference is calculated and individual deviation from a mean difference is calculated and used to calculate a difference standard deviation, S d which is used in a successive step to calculate t. + t = D N 1/2 /S d S d = [ ( D i – D ) 2 / (N-1)] 1/2

Upload: emily-tate

Post on 25-Dec-2015

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

1

c.       The t Test with Multiple Samples

Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average difference is calculated and individual deviation from a mean difference is calculated and used to calculate a difference standard deviation, Sd which is used in a successive step to calculate t.

 + t = DN1/2/Sd

  Sd = [ ( Di – D )2 / (N-1)]1/2

Page 2: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

2

Sd is the standard deviation of the difference, D i is the difference between a result obtained by the standard method subtracted from that obtained by the proposed method for the same sample. D is the average of all differences.

 Example  Mercury in multiple samples was determined

using a standard method and a new suggested method. Six different samples were analyzed using the two procedures giving the following results in ppm:

Page 3: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

3

Sample No. New Method Standard method

1. 10.3 10.52. 12.7 11.93. 8.6 8.74. 17.5 16.95. 11.2 10.96. 11.5 11.1 Find the standard deviation of the difference. If the two

methods have comparable precisions, find whether there is any significant difference between the results of the two methods at the 95% confidence level. The tabulated t value for five degrees of freedom at 95% confidence level is 2.571.

Page 4: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

4

 

Sample No. New Method Standard method Di 1. 10.3     10.5 -0.22. 12.7 11.9 +0.83. 8.6 8.7 -0.14. 17.5 16.9 +0.65. 11.2 10.9 +0.36. 11.5 11.1 +0.4_______________________________________________ 

Di = 1.8 D = 1.80/6 = 0.30

Page 5: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

5

( Di – D )2 = { (-0.2-0.3)2 + (+0.8-0.3)2 + (-0.1-0.3)2 + (+0.6-0.3)2 + (+0.3-0.3)2 + (+0.4-0.3)2 } = {0.25+0.25+0.16+0.09+0+0.01}

( Di – D )2 = 0.76 

Sd = ( ( Di – D )2 / (N-1) )1/2

 

Sd = (0.76/5)1/2 = 0.39 

+ t = 0.30x61/2/0.39 =1.88

 The calculated t value is less than the tabulated t value

which means that there is no significant difference between the results of the two methods.

+ t = DN1/2/sd

Page 6: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

6

The Q Test

In several occasions, when replicate experiments are done one of the data point may look odd or faulty. The analyst is confused whether to keep it or reject it. The Q test provides a means to judge if it should be retained or rejected. This can be done by applying the Q test equation:

 Q = a/w Where a is the difference between the suspected result

and the result nearest to it in value, w is the difference between highest and lowest results.

Page 7: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

7

Once again, if the calculated Q value is less than the tabulated value, then the suspected data point should be retained. In contrast to F and t tests the statistical value of Q depends on the number of data points rather than the number of degrees of freedom.

 

Page 8: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

8

Page 9: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

9

ExampleIn the replicate determination of gold you got the

following results: 96, 99, 97, 94, 100, 95, and 72%. Check whether any point should be excluded at the 95% confidence level. Tabulated Q95% = 0.568 for 7 observations

Arrange results: 72, 94, 95, 96, 97, 99, 100

Q = a/w

Qcalc = (94-72)/(100 - 72) = 0.79

Qcalc > Qtab

The point 72% should be rejected.

Page 10: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

10

ExampleIn the replicate determination of gold you got the

following results: 96, 99, 97, 94, 100, 95, and 88%. Check whether any point should be excluded at the 95% confidence level. Tabulated Q95% = 0.568 for 7 observations.

 Arrange results: 88, 94, 95, 96, 97, 99, 100Solution Q = a/w

Qcalc = (94-88)/(100-88) = 0.50

Qcalc < Qtab

The point 88% should be retained.

Page 11: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

11

Linear Least Squares

Frequently, an analyst constructs a calibration curve using several standards and draws a straight line among the data points in the graph. In many cases, the line does not cross all points and the analyst starts judging where the straight line should pass. Human judgment is not perfect and, unfortunately, may be biased. The method of linear least squares is a mathematical method that help us choose the best path of the straight line.

Page 12: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

12

A least-squares plot gives the best straight line through experimental points. Excel will do this for you.

A least-squares plot gives the best straight line through experimental points. Excel will do this for you.

Residual = yi – yl

Page 13: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

13

It is well known that the equation of a straight line is mathematically represented by

 y = mx + bWhere m is the line and b is the line intercept, x and y are

variables. The slope, m, can be calculated from the relationship 

m = {xiyi – [(xiyi)/n]}/{ xi2 – [(xi)2/n]}

 b = y – mx

x, y are average values of xi and yi.

Page 14: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

14

The standard deviation of any of the yi points (Sy) is given by the relation

 

Sy = {([yi2 – (( yi)2/n)] – m2 [xi

2 – (( xi)2/n)])/(n-2)}1/2

 The uncertainty in slope can then be

calculated from Sy as follows 

Sm = {Sy2/ [xi

2 – (( xi)2/n)]}1/2

Page 15: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

15

Example

Using the following data and without plotting, if the fluorescence of a riboflavin sample was 15.4 find its concentration.

 

 

Page 16: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

16

)xi(2 = 2.250x = (xi)/n = 1.500/5 = 0.300y = (yi)/n = 83.6/5 = 16.72

 m = {xiyi – [(xiyi)/n]}/{ xi

2 – [(xi)2/n]} Substitution in the equation above gives m = {46.6 – [(1.500*83.6)/5]}/ {0.850 –[(2.250/5]} m = 53.75

Page 17: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

17

This Excel plot gives the same results for slope and intercept as calculated in the example.

This Excel plot gives the same results for slope and intercept as calculated in the example.

Page 18: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

18

To calculate b we use the equation

b = y – mx

b = 16.72 – 53.75*0.300 = 0.60

Now we are ready to calculate the sample concentration

 

y = mx + b

 

15.4 = 53.75*x + 0.60

x = 0.275 ng/L

Page 19: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

19

Correlation Coefficient (r)

When the points that are supposed to be on a straight line are scattered around that line then one should estimate the correlation between the two variables. The correlation coefficient serves as a measure for the correlation of these two variables. This can be very important if correlation between results obtained by a new method and a standard method is required.

r = {n xiyi – (xiyi)}/ {[nxi2 – (xi)2][nyi

2 – (yi)2]}1/2

Page 20: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

20

ExampleCalculate the correlation

coefficient of the data :

Solution

First we find yi2 and

(yi)2 from the table in previous example

yi2 = 2554.66

(yi)2 = 6988.96

Page 21: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

21

Substituting in the correlation coefficient equation above:

r = {5*46.6-(1.500*83.6)} / {[5*0.850 – 2.250][5*2554.66-6988.96]}1/2

r = 1.00

The correlation coefficient occurs between + 1. As the correlation coefficient approaches unity, correlation increases and exact correlation occurs when r = 1. An r value less than 0.90 is considered bad while that exceeding 0.99 is considered excellent.

Page 22: 1 c. The t Test with Multiple Samples Till now we have considered replicate measurements of the same sample. When multiple samples are present, an average

22

Currently, many scientists prefer to use the square of the correlation coefficient, r2

rather than r, to express correlation. Evidently, the use of r2 is a more strict criterion as a smaller value is always obtained when fractions are squared.