introduction statistics are increasingly prevalent in medical practice, and for those doing...
TRANSCRIPT
![Page 1: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/1.jpg)
Introduction
• Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely important therefore, to understand basic statistical ideas relating to research design and data analysis, and to be familiar with the most commonly used methods of analysis.
![Page 2: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/2.jpg)
• Although data analysis is certainly an important part of the statistical process, there is an equally vital role to be played in the design of the research project. Without a properly designed study, the subsequent analysis may be unsafe, and/or a complete waste of time and resources.
![Page 3: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/3.jpg)
• Types of data • Descriptive statistics• Data distributions• Comparative statistics• Non-parametric tests• Paired data• Comparison of several means• Comparing proportions• Exploring the relationship between 2
variables• Correlation• Linear regression• Survival analysis
![Page 4: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/4.jpg)
Pro
po
rtio
n o
f to
tal
Platelet count
0 100 400 1000 1500
0
.05
.1
.15
900
![Page 5: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/5.jpg)
Types of Data
• Categorical– binary or dichotomous e.g.
diabetic/non-diabetic, smoker/non-smoker
– nominal e.g. AB/B/AB/O, short-sighted/long-sighted/normal
– ordered categorical (ordinal) e.g. stage 1/2/3/4, mild/moderate/severe
![Page 6: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/6.jpg)
• Discrete numerical - e.g. number of children - 0/1/2/3/4/5+
• Continuous - e.g. Blood pressure, age • Other types of data
– ranks, e.g. preference between treatments– percentages, e.g. % oxygen uptake– rates or ratios, e.g. numbers of infant
deaths/1000– scores, e.g. Apgar score for evaluating new-
born babies– visual analogue scales, e.g. perception of pain– survival data – two components, outcome and
time to outcome
![Page 7: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/7.jpg)
Descriptive Statistics • For continuous variables there are a
number of useful descriptive statistics– Mean - equal to the sum of the observations
divided by the number of observations, also known as the arithmetic mean
– Median - the value that comes half-way when the data are ranked in order
– Mode - the most common value observed – Standard Deviation - is a measure of the
average deviation (or distance) of the observations from the mean
– Standard Error of the mean - is measure of the uncertainty of a single sample mean as an estimate of the population mean
![Page 8: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/8.jpg)
Data Distributions
• Frequency distribution – If there are more than about 20
observations, a useful first step in summarizing quantitative data is to form a frequency distribution. This is a table showing the number of observations at different values or within certain ranges. If this is then plotted as a bar diagram a frequency distribution is obtained.
![Page 9: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/9.jpg)
PAGE
65.0 - 66.061.0 - 62.057.0 - 58.053.0 - 54.049.0 - 50.045.0 - 46.041.0 - 42.037.0 - 38.033.0 - 34.029.0 - 30.025.0 - 26.021.0 - 22.017.0 - 18.0
Histogram of patient ages for HD
Frequency
80
70
60
50
40
30
20
10
0
Std. Dev = 11.43
Mean = 34.3
N = 1712.00
![Page 10: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/10.jpg)
The Normal Distribution • In practice it is found that a reasonable
description of many variables is provided by the normal distribution (Gaussian distribution). The curve of the normal distribution is symmetrical about the mean and bell-shaped. The bell is tall and narrow for small standard deviations, and short and wide for large ones.
![Page 11: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/11.jpg)
PARAP
90.0
85.0
80.0
75.0
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
10.0
5.0
0.0
paraprotein in myeloma
Frequency
20
18
16
14
12
10
8
6
4
2
0
Std. Dev = 18.81
Mean = 33.0
N = 103.00
![Page 12: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/12.jpg)
PARAP
90.0
85.0
80.0
75.0
70.0
65.0
60.0
55.0
50.0
45.0
40.0
35.0
30.0
25.0
20.0
15.0
10.0
5.0
0.0
20
18
16
14
12
10
8
6
4
2
0
Std. Dev = 18.81
Mean = 33.0
N = 103.00
![Page 13: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/13.jpg)
Duration of disease (y)
32.030.028.026.024.022.020.018.016.014.012.010.08.06.04.02.00.0
Duration of disease pre ASCT for HD
Frequency
700
600
500
400
300
200
100
0
Std. Dev = 3.90
Mean = 3.2
N = 1676.00
![Page 14: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/14.jpg)
Descriptives
3.1813 9.515E-02
2.9946
3.3679
2.5920
1.8000
15.174
3.8954
.10
33.40
33.30
2.3000
3.115 .060
12.507 .119
Mean
Lower Bound
Upper Bound
95% ConfidenceInterval for Mean
5% Trimmed Mean
Median
Variance
Std. Deviation
Minimum
Maximum
Range
Interquartile Range
Skewness
Kurtosis
DODStatistic Std. Error
![Page 15: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/15.jpg)
Comparative statistics • When there are two or more sets of
observations from a study there are two types of design that must be distinguished: independent or paired. The design will determine the method of statistical analysis
• If the observations are from different groups of individuals, e.g. ages of males and females, or spectacle use in diabetics/non-diabetics, then the data is independent. The sample size may vary from group to group
![Page 16: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/16.jpg)
• If each set of observations is made on the same group of individuals, e.g. WBC count pre- and post- treatment, then the data is said to be paired. This indicates that the observations are on the same individuals rather than from independent samples, and so we have the same number of observations in each set of data
![Page 17: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/17.jpg)
Independent data
• With independent continuous data, we are interested in the mean difference between the groups, but the variability between subjects becomes important. This is because the two sample t test (the most common test used), is based on the assumption that each set of observations is sampled from a population with a Normal Distribution, and that the variances of the two populations are the same.
![Page 18: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/18.jpg)
Non-parametric test
• If the continuous data is not normally distributed, or the standard deviations are very different, a non-parametric alternative to the t test known as the Mann-Whitney test can be utilised (another derivation of the same test is due to Wilcoxon)
![Page 19: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/19.jpg)
1967314N =
MMSTAGE
4.003.002.001.00
BET2MG
30
25
20
15
10
5
0
93
94107109
111752
![Page 20: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/20.jpg)
![Page 21: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/21.jpg)
Group Statistics
67 3.6006 1.45738 .17805
19 7.5179 5.14869 1.18119
MMSTAGE3.00
4.00
BET2MGN Mean Std. Deviation
Std. ErrorMean
Independent Samples Test
11.739 .001 -5.559 84 .000 -3.9173 .70463 -5.31852 -2.51607
-3.279 18.825 .004 -3.9173 1.19453 -6.41906 -1.41553
Equal variancesassumed
Equal variancesnot assumed
BET2MGF Sig.
Levene's Test forEquality of Variances
t df Sig. (2-tailed)Mean
DifferenceStd. ErrorDifference Lower Upper
95% ConfidenceInterval of the
Difference
t-test for Equality of Means
T-test
![Page 22: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/22.jpg)
Mann-Whitney Test
Ranks
67 36.78 2464.50
19 67.18 1276.50
86
MMSTAGE3.00
4.00
Total
BET2MGN Mean Rank Sum of Ranks
Test Statisticsa
186.500
2464.500
-4.685
<0.0001
Mann-Whitney U
Wilcoxon W
Z
Asymp. Sig. (2-tailed)
BET2MG
Grouping Variable: MMSTAGEa.
![Page 23: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/23.jpg)
Group Statistics
4 2.1050 .40673 .20337
31 2.7852 .96692 .17366
MMSTAGE1.00
2.00
BET2MGN Mean Std. Deviation
Std. ErrorMean
Independent Samples Test
.964 .333 -1.377 33 .178 -.6802 .49411 -1.68544 .32512
-2.543 8.518 .033 -.6802 .26743 -1.29039 -.06993
Equal variancesassumed
Equal variancesnot assumed
BET2MGF Sig.
Levene's Test forEquality of Variances
t df Sig. (2-tailed)Mean
DifferenceStd. ErrorDifference Lower Upper
95% ConfidenceInterval of the
Difference
t-test for Equality of Means
Test Statisticsb
27.000
37.000
-1.816
.069
.073a
Mann-Whitney U
Wilcoxon W
Z
Asymp. Sig. (2-tailed)
Exact Sig. [2*(1-tailedSig.)]
BET2MG
Not corrected for ties.a.
Grouping Variable: MMSTAGEb.
![Page 24: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/24.jpg)
122141N =
TBIDOSE
14.412
NEUTS
60
50
40
30
20
10
0
327
83416
444
577
743
3
![Page 25: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/25.jpg)
Neutrophil engraftment following allogeneic SCT for CML
141 88.1% 19 11.9% 160 100.0%
122 95.3% 6 4.7% 128 100.0%
TBIDOSE12
14.4
NEUTSN Percent N Percent N Percent
Valid Missing Total
Cases
![Page 26: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/26.jpg)
Descriptives
22.9787 .5816
21.8289
24.1286
22.5816
22.0000
47.692
6.9060
11.00
56.00
45.00
9.0000
1.162 .204
3.184 .406
26.6148 .5544
25.5172
27.7123
26.1184
26.0000
37.495
6.1233
15.00
53.00
38.00
7.2500
1.453 .219
4.157 .435
Mean
Lower Bound
Upper Bound
95% ConfidenceInterval for Mean
5% Trimmed Mean
Median
Variance
Std. Deviation
Minimum
Maximum
Range
Interquartile Range
Skewness
Kurtosis
Mean
Lower Bound
Upper Bound
95% ConfidenceInterval for Mean
5% Trimmed Mean
Median
Variance
Std. Deviation
Minimum
Maximum
Range
Interquartile Range
Skewness
Kurtosis
TBIDOSE12
14.4
NEUTSStatistic Std. Error
![Page 27: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/27.jpg)
NEUTS
55.050.045.040.035.030.025.020.015.010.0
50
40
30
20
10
0
NEUTS
55.050.045.040.035.030.025.020.015.0
50
40
30
20
10
0
![Page 28: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/28.jpg)
98128N =
TBIDOSE
14.412
PLATES
200
100
0
62302
135
561462190
66294
444371416
87327
2631084349287
43
3
![Page 29: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/29.jpg)
Platelet engraftment following allogeneic SCT for CML
128 80.0% 32 20.0% 160 100.0%
98 76.6% 30 23.4% 128 100.0%
TBIDOSE12
14.4
PLATESN Percent N Percent N Percent
Valid Missing Total
Cases
![Page 30: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/30.jpg)
Descriptives
32.7891 1.6694
29.4857
36.0924
30.5556
29.5000
356.703
18.8866
14.00
186.00
172.00
11.7500
5.244 .214
37.479 .425
42.8776 3.9481
35.0417
50.7134
37.1973
27.0000
1527.572
39.0842
14.00
185.00
171.00
18.0000
2.368 .244
4.780 .483
Mean
Lower Bound
Upper Bound
95% ConfidenceInterval for Mean
5% Trimmed Mean
Median
Variance
Std. Deviation
Minimum
Maximum
Range
Interquartile Range
Skewness
Kurtosis
Mean
Lower Bound
Upper Bound
95% ConfidenceInterval for Mean
5% Trimmed Mean
Median
Variance
Std. Deviation
Minimum
Maximum
Range
Interquartile Range
Skewness
Kurtosis
TBIDOSE12
14.4
PLATESStatistic Std. Error
![Page 31: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/31.jpg)
PLATES
190.0
180.0
170.0
160.0
150.0
140.0
130.0
120.0
110.0
100.0
90.0
80.0
70.0
60.0
50.0
40.0
30.0
20.0
10.0
40
30
20
10
0
PLATES
195.0
185.0
175.0
165.0
155.0
145.0
135.0
125.0
115.0
105.0
95.0
85.0
75.0
65.0
55.0
45.0
35.0
25.0
15.0
5.0
60
50
40
30
20
10
0
![Page 32: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/32.jpg)
Group Statistics
141 22.9787 6.9060 .5816
122 26.6148 6.1233 .5544
128 32.7891 18.8866 1.6694
98 42.8776 39.0842 3.9481
TBIDOSE12
14.4
12
14.4
NEUTS
PLATES
N Mean Std. DeviationStd. Error
Mean
Independent Samples Test
2.291 .131 -4.486 261 .000 -3.6360 .8105 -5.2320 -2.0401
-4.525 260.837 .000 -3.6360 .8035 -5.2182 -2.0539
28.139 .000 -2.557 224 .011 -10.0885 3.9448 -17.8622 -2.3148
-2.354 131.572 .020 -10.0885 4.2865 -18.5679 -1.6091
Equal variancesassumed
Equal variancesnot assumed
Equal variancesassumed
Equal variancesnot assumed
NEUTS
PLATES
F Sig.
Levene's Test forEquality of Variances
t df Sig. (2-tailed)Mean
DifferenceStd. ErrorDifference Lower Upper
95% ConfidenceInterval of the
Difference
t-test for Equality of Means
![Page 33: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/33.jpg)
Test Statistics
6172.500 5543.500
11023.500 15554.500
-.204 -4.977
0.83 0.0006
Mann-Whitney U
Wilcoxon W
Z
P-value (2-tailed)
PLATES NEUTS
![Page 34: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/34.jpg)
Describing continuous data
• If the data is normally distributed– Mean and standard deviation
• If the data is skewed or non-normally distributed or is from a small sample (N<20)– Median and range
![Page 35: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/35.jpg)
Comparison of several means
• Data sets comprising more than two groups are common, and their analysis often involves the comparison of the means for the component subgroups. It is obviously possible to compare each pair of groups using t tests, but this is not a good approach. It is far better to use a single analysis that enables us to look at all the data in one go, and the method of choice is called analysis of variance
• If the data are not normally distributed or have different variances, a non-parametric equivalent to the analysis of variance can be used, and is known as the Kruskal-Wallis test
![Page 36: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/36.jpg)
Paired data• When we have more than one group of
observations it is vital to distinguish the case where the data are paired from that where the groups are independent. Paired data arise when the same individuals are studied more than once, usually in different circumstances. Also, when we have two different groups of subjects who have been individually matched, for example in a matched pair case-control study, then we should treat the data as paired.
![Page 37: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/37.jpg)
• A one sample t test is used to examine the data. The value t is calculated from– t = sample mean - hypothesised mean
standard error of sample mean
• In a paired analysis where one set of observations are subtracted from the other set, the hypothesised mean is zero. Thus the calculation of the t statistic reduces to – t = sample mean / standard error of sample
mean
• The non-parametric equivalent to this test is the Wilcoxon matched pairs signed rank sum test
![Page 38: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/38.jpg)
Ratio AZU1/GAPDHSamples CD34 MNC
A1 2273 0.0379 0.4328A2 3667 0.0007 0.1021A3 2943 0.0003 0.0007A5 2334 0.014 0.0226A6 1759 0.0696 0.5604A8 2164 0.0349 0.3249
A10 3022a 0.159 0.2487
I1 1503 0.6225 0.9253I2 1684 0.0647 1.4268I5 1615 0.2571 0.2516
![Page 39: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/39.jpg)
Wilcoxon Signed Ranks Test
Test Statisticsb
-2.599a
.009
Z
Asymp. Sig. (2-tailed)
MNC - CD34
Based on negative ranks.a.
Wilcoxon Signed Ranks Testb.
![Page 40: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/40.jpg)
Telomere length in Dyskeratosis Congenitaparent child-4.7163 -6.6238-4.7163 -5.9629-4.7163 -6.1392-3.8798 -0.6173-1.4062 -2.2264-5.1662 -5.4028-2.2144 -3.6383-4.439 -2.0056-4.2654 -6.7593-4.9991 -3.8157-0.5679 -2.5027-5.3408 -6.4229-1.2779 -5.1118-1.2779 -3.9373-4.1954 -5.9093-0.1936 0.42620.1764 -1.5093-1.0408 2.45081.0755 0.38980.1737 1.4716-2.1199 -0.4074-2.3117 1.34261.2593 1.65270.5821 1.37010.5821 -1.4691.3701 4.27721.3701 -1.2765-1.469 -1.4892-1.469 0.8735
![Page 41: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/41.jpg)
Paired Samples Test
-.1303 2.09654 .38932 -.9278 .6672 -.335 28 .740CHILD - PARENTPair 1Mean Std. Deviation
Std. ErrorMean Lower Upper
95% ConfidenceInterval of the
Difference
Paired Differences
t df Sig. (2-tailed)
Paired Samples Statistics
-2.0335 29 3.13806 .58272
-1.9032 29 2.27896 .42319
CHILD
PARENT
Pair1
Mean N Std. DeviationStd. Error
Mean
![Page 42: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/42.jpg)
Comparison of groups : continuous data
• Paired on non-paired?
• If non-paired and normally distributed with similar variances : T-test
• If non-paired non-normally distributed or with non-similar variances or very small numbers : Mann-Whitney test
• Paired data – paired t-test or Wilcoxon Signed Ranks Test
![Page 43: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/43.jpg)
Comparing Proportions• Qualitative or categorical data is best
presented in the form of table, such that one variable defines the rows, and the categories for the other variable define the columns. Thus in a European study of ASCT for HD, patient gender was compared between the UK and Europe
• The data are arranged in a contingency table • Individuals are assigned to the appropriate
cell of the contingency table according to their values for the two variables
![Page 44: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/44.jpg)
COUNTRYG * PSEX Crosstabulation
Count
16 610 828 1454
100 160 260
16 710 988 1714
europe
uk
COUNTRYG
Total
Female Male
PSEX
Total
![Page 45: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/45.jpg)
COUNTRYG * PSEXG Crosstabulation
828 610 1438
57.6% 42.4% 100.0%
83.8% 85.9% 84.7%
160 100 260
61.5% 38.5% 100.0%
16.2% 14.1% 15.3%
988 710 1698
58.2% 41.8% 100.0%
100.0% 100.0% 100.0%
Count
% within COUNTRYG
% within PSEXG
Count
% within COUNTRYG
% within PSEXG
Count
% within COUNTRYG
% within PSEXG
europe
uk
COUNTRYG
Total
1.00 2.00
PSEXG
Total
![Page 46: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/46.jpg)
Chi-squared test (2)
• A chi-squared test (2) is used to test whether there is an association between the row variable and the column variable. When the table has only two rows or two columns this is equivalent to the comparison of proportions.
![Page 47: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/47.jpg)
• The first step in interpreting contingency table data is to calculate appropriate proportions or percentages. The chi-squared test compares the observed numbers in each of the four categories and compares them with the numbers expected if there were no difference between the distribution of patient gender
• The greater the differences between the observed and expected numbers, the larger the value of 2 and the less likely it is that the difference is due to chance
![Page 48: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/48.jpg)
COUNTRYG * PSEXG Crosstabulation
828 610 1438
57.6% 42.4% 100.0%
83.8% 85.9% 84.7%
160 100 260
61.5% 38.5% 100.0%
16.2% 14.1% 15.3%
988 710 1698
58.2% 41.8% 100.0%
100.0% 100.0% 100.0%
Count
% within COUNTRYG
% within PSEXG
Count
% within COUNTRYG
% within PSEXG
Count
% within COUNTRYG
% within PSEXG
europe
uk
COUNTRYG
Total
1.00 2.00
PSEXG
Total
![Page 49: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/49.jpg)
Chi-Square Tests
1.418b 1 .234
1.260 1 .262
1.428 1 .232
.246 .131
1.417 1 .234
1698
Pearson Chi-Square
Continuity Correction a
Likelihood Ratio
Fisher's Exact Test
Linear-by-LinearAssociation
N of Valid Cases
Value dfAsymp. Sig.(2-sided)
Exact Sig.(2-sided)
Exact Sig.(1-sided)
Computed only for a 2x2 tablea.
0 cells (.0%) have expected count less than 5. The minimum expected count is108.72.
b.
![Page 50: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/50.jpg)
Fisher’s Exact Test
• When the overall total of the table is less than 20, or if it is between 20 and 40 with the smallest of the four expected values is less than 5, then Fisher’s Exact Test should be used.
![Page 51: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/51.jpg)
Crosstab
15 15
100.0% 100.0%
88.2% 83.3%
2 1 3
66.7% 33.3% 100.0%
11.8% 100.0% 16.7%
17 1 18
94.4% 5.6% 100.0%
100.0% 100.0% 100.0%
Count
% within SURV
% within TRMV
Count
% within SURV
% within TRMV
Count
% within SURV
% within TRMV
.00
1.00
SURV
Total
DISG3.00
.00 1.00
TRMV
Total
![Page 52: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/52.jpg)
Chi-Square Tests
5.294b 1 .021
.847 1 .357
3.905 1 .048
.167 .167
5.000 1 .025
18
Pearson Chi-Square
Continuity Correction a
Likelihood Ratio
Fisher's Exact Test
Linear-by-LinearAssociation
N of Valid Cases
DISG3.00
Value dfAsymp. Sig.(2-sided)
Exact Sig.(2-sided)
Exact Sig.(1-sided)
Computed only for a 2x2 tablea.
3 cells (75.0%) have expected count less than 5. The minimum expected count is .17.b.
![Page 53: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/53.jpg)
• The chi-squared test can also be applied to larger tables, generally called r x c tables, where r denotes the number of rows in the table, and c the number of columns.
• The standard chi-squared test for a 2 x c table is a general test to assess whether there are differences among the c proportions. When the categories in the columns have a natural order, however, a more sensitive test is to look for an increasing (or decreasing) trend in the proportions over the columns. This trend can be tested using the chi-squared test for trend.
![Page 54: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/54.jpg)
Cesarean section
Shoe Size
<4 4 4.5 5 5.5 6 Total
Yes 5 7 6 7 8 10 43
No 17 28 36 41 46 140 308
• In the table below the relation between frequency of Cesarean section and maternal foot size is presented
![Page 55: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/55.jpg)
• The standard chi-squared test of this 2 x 6 table gives and a 2 value of 9.29, with 5 d.f., for which P=0.098. Analysis of the data for trend gives a 2
trend = 8.02, with 1 d.f. (P=0.005). Thus there is strong evidence of a linear trend in the proportion of women giving birth by Cesarean section in relation to shoe size. This relation is not causal, but reflects that shoe size is a convenient indicator of small pelvic size
![Page 56: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/56.jpg)
Categorical data – comparing proportions
• Studies where there are 2 groups and the total number of patients > 40 : Chi-squared test
• Studies where there are 2 groups and the total number of patients < 40 or if more than 40 and a single cell has less than 5 : Fisher’s Exact Test
• Studies where there are more than 2 groups but not ordered : - Chi-squared test
• Studies where there are more than 2 groups which are ordered : - Chi-squared test for trend
![Page 57: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/57.jpg)
Exploring the relationship between two variables
• Three possible purposes :– a.) assess association e.g. body
weight and blood pressure – b.) prediction e.g. height and weight
– c.) assess agreement e.g. blood pressure measurement
![Page 58: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/58.jpg)
Correlation• Method for investigating the linear
association between two continuous variables
• The association is measured by the correlation coefficient
• A correlation between two variables shows that they are associated but does not necessarily imply a ‘cause and effect’ relationship
![Page 59: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/59.jpg)
• A t test is used to test whether the correlation coefficient obtained is significantly different from zero, or in other words whether the observed correlation could simply be due to chance
• The significance level is a function of both the size of the correlation coefficient and the number of observations. A weak correlation may therefore be statistically significant if based on a large number of observations, while a strong correlation may fail to achieve significance if there are only a few observations
![Page 60: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/60.jpg)
Correlations
1 -.393** .620** .395**
. .000 .000 .000
121 121 121 121
-.393** 1 -.220* -.465**
.000 . .015 .000
121 121 121 121
.620** -.220* 1 .152
.000 .015 . .097
121 121 121 121
.395** -.465** .152 1
.000 .000 .097
121 121 121 121
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
BET2MG
OPG
CRP
NTX
BET2MG OPG CRP NTX
Correlation is significant at the 0.01 level (2-tailed).**.
Correlation is significant at the 0.05 level (2-tailed).*.
![Page 61: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/61.jpg)
OPG
20100
CRP
300
200
100
0
P=0.015
![Page 62: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/62.jpg)
BET2MG
3020100
CRP
210
180
150
120
90
60
30
0
P=<0.0001
![Page 63: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/63.jpg)
Problems with correlation analyses
• Biological systems are multifactoral so a simple two-way correlation may not be a true reflection of what is being observed
• Spurious correlations
![Page 64: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/64.jpg)
FOOT SIZE
12108642
READING ABILITY
110
100
90
80
70
60
50
40
30
![Page 65: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/65.jpg)
Assessing agreement
• Neither correlation nor linear regression are appropriate
• There may be a very high correlation, but one method gives a systematically higher/lower reading
• Linear regression, the data is not independent
• The only appropriate way is to subtract one observation from the other, and plot against an index variable
![Page 66: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/66.jpg)
Correlation between PCR and TAQman for measuring MRD
1.000
107
.739 1.000
0.0006
107 107
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
PCR
TAQ
PCR TAQ
![Page 67: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/67.jpg)
TAQ
.7.6.5.4.3.2.10.0
PCR
.7
.6
.5
.4
.3
.2
.1
0.0
![Page 68: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/68.jpg)
SAMPLE
120100806040200
DIFFER
.2
.1
0.0
-.1
-.2
-.3
-.4
-.5
![Page 69: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/69.jpg)
Paired Samples Test
2.830E-02 8.117E-02 7.847E-03 1.274E-02 4.386E-02 3.606 106 0.0002PCR - TAQMean Std. Deviation
Std. ErrorMean Lower Upper
95% ConfidenceInterval of the
Difference
Paired Differences
t df Sig. (2-tailed)
![Page 70: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/70.jpg)
Linear regression
• Linear regression gives the equation of the straight line that describes how the y variable increases (or decreases) with an increase in the x variable. y is commonly called the dependent variable, and x the independent, or explanatory variable
• A t test is used to test whether the gradient b differs significantly from a specified value (usually zero)
![Page 71: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/71.jpg)
• Assumptions– for any value of x, y must be normally
distributed– the magnitude of the scatter of the
points about the regression line is the same throughout the length of the line
– the relation between the two variables should be linear
![Page 72: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/72.jpg)
AGE
706050403020100
TLENGTH
22
20
18
16
14
12
![Page 73: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/73.jpg)
AGE
706050403020100
Telomer length
22
20
18
16
14
12
![Page 74: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/74.jpg)
Coefficientsa
17.893 .317 56.390 .000
-.049 .010 -.462 -4.809 .000
(Constant)
AGE
Model1
B Std. Error
UnstandardizedCoefficients
Beta
StandardizedCoefficients
t Sig.
Dependent Variable: TLENGTHa.
![Page 75: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/75.jpg)
AGE
706050403020100
Unstandardized Residual
4
3
2
1
0
-1
-2
-3
-4
![Page 76: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/76.jpg)
Practical application
• Y = mx + c
• Telomere length = age * -0.049 + 17.89
• Substituting in the above equation for ages of 30 and 60
• 16.42 = 30*-0.049 +17.89
• 14.95 = 60*-0.049 +17.89
![Page 77: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/77.jpg)
AGE
706050403020100
Telomer length
22
20
18
16
14
12
![Page 78: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/78.jpg)
Survival data
• Has 2 components
• The event of interest and the time to the event
• Special statistical methods are required – it is not appropriate to use tests for categorical data
![Page 79: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/79.jpg)
Life Table Analysis
• Survival data are usually summarised as survival or Kaplan-Meier curves
• Based on a series of conditional probabilities
• For example, the probability of a patient surviving 10 days after a transplant, is the probability of surviving nine days, multiplied by the probability of surviving the 10th day given that the patient survived the first nine days.
![Page 80: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/80.jpg)
0 40 80 120 160 200 240 280 320
Days post BMT
0123456789
101112131415
Pat
ien
t n
um
ber
Alive
Dead
![Page 81: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/81.jpg)
Table 1. Life table for fifteen patients who received an allogeneic stem cell transplant Time (days) Status Number at risk Probability of
survivalStandard error
16* 0 15 1.00
26 1 14 0.93 0.069
66 1 13 0.86 0.094
69* 0 12
74 1 11 0.78 0.113
82* 0 10
88 1 9 0.69 0.129
89* 0 8
117* 0 7
133* 0 6
144* 0 5
172* 0 4
252* 0 3
291* 0 2
305* 0 1
![Page 82: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/82.jpg)
0 50 100 150 200 250 300 3500
20
40
60
80
100
Days post BMT
Pro
bab
ility
%
![Page 83: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/83.jpg)
Outcomes suitable for Kaplan-Meier analyses
• Survival (event of interest is death, patients alive are censored)
• Disease-free survival (events of interest are either death or disease relapse, patients alive and in remission are censored)
• Primary graft failure• Acute graft versus host disease
![Page 84: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/84.jpg)
0 1 2 3 4 5
Years post BMT
0
20
40
60
80
100
Pro
ba
bil
ity
(%
)Overall and leukaemia-free survival for 111 patients with CML
67%
LFS
OS
45%
in CP allografted with stem cells from HLA-identical sibling donors
HH/ICSM May 2003
![Page 85: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/85.jpg)
0 60 120 180 240 300 360
Days post BMT
0
5
10
15
20
Pro
bab
ility
of
gra
ft f
ailu
re (
%)
Graft failure following BMT for 1stCP CML with a VUD
ICSM/HH April 2003
13.2 Gy (n=57)9%
14.4 Gy (n=44)0%
![Page 86: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/86.jpg)
0 28 56 84 112 140 168 196 224 252 280 308 336 364
Days post BMT
0
20
40
60
80
100
Pro
ba
bili
ty o
f C
MV
re
ac
tiv
ati
on
(%
)CMV reactivation following BMT with a VUD
ICSM/HH May 2003
35%
43%
effect of ganciclovir treatment
prophylactic treatment (n=49)
post infection treatment (n=72)
![Page 87: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/87.jpg)
Use of computers for data collection/analysis
• Decide what data needs collecting (for statistical purposes) and then try if appropriate design a form (this is best done in a database, eg Microsoft ACCESS)
• Get the computer to do as much of the work as possible. ie calculation of ages, surface area etc
• Think ahead to what format the spreadsheet/stats package requires the data to be in
![Page 88: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/88.jpg)
• For analysis purposes, its much easier to work with numbers and codes, as opposed to descriptions ie instead of male/female or m/f, use 1 or 2
• Use a ‘code’ to identify missing data, eg 999 or something ‘unlikely’
• Check the data before analysis, get ‘descriptive statistics’
• Use appropriate statistical methods • Statistical packages - SPSS, BMDP,
STATA, Statgraphics, MINITAB, STATXACT, GENSTAT, SAS
![Page 89: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/89.jpg)
Presentation of results• Where possible give actual P values rather than
ranges– ie P=0.041 rather than P<0.05
• If a P value is not significant give the actual value and not just NS– ie P=0.15 rather than P=NS
• When presenting data it may be more useful to present confidence intervals rather than a P-value– ie lens A was more durable than lens B by 2.4 days
(P=0.03), it might be more informative to write - lens A was more durable than lens B by 2.4 days (95%CI 0.3-4.5days)
![Page 90: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/90.jpg)
• It is not necessary to give test results– ie t=33.5, 28 dof, P=0.0001
• If a continuous variable is normally distributed present, as a description of the data, the mean and standard deviation, if not normally distributed, a median and range
• Don’t quote more significant figures than necessary – ie mean patient age 34.2550 (std dev
11.4337), 34.3 (std dev 11.4) will suffice
![Page 91: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/91.jpg)
0.000
0.200
0.400
0.600
0.800
1.000
1.200
32D xl-1 xl-2 xl-3 xl-4 xl-5 xl-6 32DP210
Cell Line
Adhesion to FN (A
490
absorbance units)
![Page 92: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/92.jpg)
3636N =
GROUP
XL2
.8
.6
.4
.2
0.0
8
![Page 93: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/93.jpg)
Writing the statistics section in a paper
• If power calculations were used to calculate the sample sizes, details should be given– eg Based on samples sizes of x in each arm, we
should have been able to detect a difference of y given 80% power at a significance level of 0.05.
• State which statistical tests were used (reference obscure ones). – eg in order to investigate the differences between
the groups, a t-test was used for continuous data, and a chi-squared test for categorical data
![Page 94: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/94.jpg)
• If applicable, state whether standard deviations or standard errors are quoted
• State whether p-values are from one or two-tailed tests– eg all quoted p-values are two-tailed
• Not necessary to quote which stats package was used
![Page 95: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/95.jpg)
Suggested Reading Material
• Essentials of Medical Statistics – Betty Kirkwood
• Practical Statistics for Medical Research– Doug Altman
• Statistical Methods in Medical Research– Armitage and Berry
![Page 96: Introduction Statistics are increasingly prevalent in medical practice, and for those doing research, statistical issues are fundamental. It is extremely](https://reader035.vdocument.in/reader035/viewer/2022070306/55180bc855034696438b45e8/html5/thumbnails/96.jpg)
Summary
• If at all possible - consult a statistician before starting your study
• Get a feel of your data by plotting results - don’t rely on descriptive statistics alone
• Use appropriate statistical tests, not those that give the ‘best’ results