statistical methods in research - oluwadiyaoluwadiya.com/documents/workshops/advanced statistical...

88
Effective Use of Advanced Statistical Methods in Research OLUWADIYA KS Professor of Surgery Ekiti State University, Ado-Ekiti www.oluwadiya.com

Upload: others

Post on 24-Jul-2020

6 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Effective Use of Advanced

Statistical Methods in Research

OLUWADIYA KS

Professor of Surgery

Ekiti State University, Ado-Ekiti

www.oluwadiya.com

Page 2: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Objectives

1. Data transformation

2. Limitations of P-value

3. Statistics for comparing 2 or more groups with continuous data

4. Regression and Correlation

5. Risk Ratios and Odds Ratios

6. Survival Analysis

7. Sensitivity, Specificity and ROC Curves

8. Finding the right test for specific data

9. Introduction to Endnote

2

Page 3: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Why Transform Data?

The assumptions of most parametric methods:

Homogeneity of variance (Homoscedasticity)

Normality

Linearity

Data transformation is used to make your data

conform to the assumptions of the statistical

methods

Page 4: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Skewed data

Page 5: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Homoscedasticity and Normality

The data deviates from both homoscedasticity and normality.

Page 6: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Homoscedasticity and Normality

This data are both normal and have equal variance

Won’t it be nice if we would make the previous data look this way?

Page 7: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

• We should always check the assumptions

that data follow a normal distribution with

uniform variance:

i. If the data meet the assumptions we can

analyze the raw data as described.

ii. If they are not met, we have two

possible strategies:

When to transform data?

Page 8: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

1-We can use a method which does not

require these assumptions, such as a rank-

based (non-parametric) method.

2-We can transform the data mathematically

to make them fit the assumptions more

closely before analysis.

Page 9: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Methods of data transformation

In healthcare research, there are three commonly

used transformations for quantitative data:

1. Logarithmic transformation,

2. Square root transformation

3. Inverse (reciprocal) transformation.

Page 10: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

Graphical method

Histograms and Normality plots

Boxplots

Normal Q-Q plots and detrended Q-Q plots

Statistical method

Skewness and kurtosis

Smolgrov-Smirnov statistics

10

Page 11: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

11

Which of the two variables

has a normal distribution?

Page 12: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

Age PCV

12

Page 13: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

AGE PCV

13

Page 14: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

AGE PCV

14

Page 15: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Determining normality of data

15

Page 16: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Normalizing data

We know that Age is

not normally

distributed

Log transformation:

use SPSS Compute

Sub menu (transform-

compute):

16

Page 17: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Normalize Age

The transformed variable

(log_age) which we asked

SPSS to create has been

created

17

Page 18: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Normalize Age: Result

Original Data (Age) Transformed data

18

Page 19: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Normalize PCV: Result

Log_Age Age

19

Page 20: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

The problem with P

P values provide less information than confidence intervals.

Statistical significance tells us whether there is a difference and not how much difference there is.

A P value provides only a probability that an estimate is due to chance

A P value could be statistically significant but of limited clinical significance.

o A very large study might find that a difference of 0.5mmHg in BP between 2 rx groups is statistically significant but is this clinically relevant?

“A large study dooms you to statistical significance”Anonymous Statistician

20

Page 21: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Statistical tests in inferential statistics are designed to answer the

question ‘‘how likely is the difference found in a sample due to chance

(when actually no such difference exists in the population, the null-

hypothesis)?’’.

This is the only purpose they serve—the calculation of a

probability value.

They do not indicate clinical significance!

21

The problem with P

Page 22: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

The clinical significance of a research finding – the extent

to which it may influence clinical practice – depends on

many factors.

I. Many of these factors are related to study design.

II. are the adverse effects of a treatment also studied in addition to

benefits?

III. Are the outcome measures clinically relevant (e.g., improvement

in symptoms and functioning rather than cognitive distortions)?

IV. Are the effects lasting?

V. Is the cost of treatment worth the effect and

VI. Can the study findings be generalized to patients across social

and clinical settings?

22

Clinical significance

Page 23: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Effects size (Cohen’s d)

Odds ratio

Absolute Risk Reduction (ARR)

Numbers needed to treat (NNT): this is the

reciprocal of ARR

23

Magnitude and clinical significance

Page 24: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Magnitude and clinical significance

No

response

response Total

Placebo A(60) B(40) A+B(100)

Drug C(40) D(60) A+B(100)

24

Risk of response on

antidepressant=0.6

Risk of response on

placebo = 0.4

Absolute risk reduction =

0.2

Number Needed to Treat

(NNT) = 1/0.2 = 5

Page 25: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Statistical Tests

Parametric tests

Continuous data; normally distributed

Non-parametric tests

Continuous data; not normally distributed

(Categorical or Ordinal data)

25

Page 26: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Comparison of two sample mean

Student’s T test

Assumes normally distributed continuous data.

No need to do the math, commonly generated by most statistics software

But…

Understand the underlying theory and assumption

26

Page 27: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Paired T-Test

Whereas T-test assumes there is independence of observations

Related samples i.e.

Paired T-test is meant for “before” and “after” studies

27

Page 28: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Used to determine if two or more samples are

from the same population i.e. no significant

difference between their means

Requires that….

Dependent variable is continuous data

Independent variable is categorical data

Independent variable = Grouping variable = Factor

The variable will consist of a number of categories or levels

There will be 2 or more of such categories or levels.

If there are only 2 categories, then the result will be identical to t-test

. 28

ANalysis Of VAriance (ANOVA)

Page 29: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

ANalysis Of VAriance

An example

You measure the PCV of 50 patients who has sustained fractures.

Does the number of bones fractured affect the PCV of the patients ?

number of bone fractured is the independent variable or factor

0 bone fractured is one level of the factor

1 bone fractured is one level of the factor

2 bones fractured is one level of the factor

3 bones fractured is one level of the factor

PCV is the dependent variable

Page 30: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

ANOVA in SPSS

Page 31: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Significant result…now what?

Yes oo! & there is a

significant

difference among

the means. I’m so

happy

Don’t

know

Na wa o

Ok

then

There are

more than 2

means

But is it

among all

means, or

just two?

Better do a

post hoc test

my friend.

What is that?

Better do it

now

Page 32: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Post Hoc tests

After the Fact comparisons of means used

to identify which specific pairs of means are

significantly different

Designed to reduce errors regardless of

how many pairs of means are compared

Page 33: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Post Hoc tests

Also called follow-up tests

Should be computed only after a significant ANOVA

They are like a collection of little t-tests

But they control overall type 1 error comparatively well

They do not have as much power as the omnibus test (the ANOVA) – so you might get a significant ANOVA & no sig. Follow-up

Purpose is to identify the locus of the effect (what means are different, exactly?)

Page 34: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Post Hoc tests

Page 35: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Post Hoc tests: Homogeneous subset

Page 36: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Correlation

Assesses the linear relationship between two variables Example: height and weight

Strength of the association is described by a correlation coefficient- r

r = 0 - .2 low, probably meaningless

r = .2 - .4 low, possible importance

r = .4 - .6 moderate correlation

r = .6 - .8 high correlation

r = .8 - 1 very high correlation

Can be positive or negative

Pearson’s or Spearman’s correlation coefficient

Tells nothing about causation

36

Page 37: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Correlation

Positive Correlation Negative Correlation

37

Page 38: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Regression

Based on fitting a line to data

Provides a regression coefficient, which is the slope of the line

o For example; y = 0 + 1x (simple linear regression)

Use to predict a dependent variable’s value based on the value of an independent variable.

E.g. In analysis of height and weight, for a known height, one can predict weight.

Much more useful than correlation

Allows prediction of values of Y rather than just whether there is a relationship between two variable.

38

Page 39: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Regression

Types of regression

Linear and Multiple - uses continuous data to predict

continuous data outcome

Logistic- uses continuous data to predict probability of

a dichotomous outcome

Poisson regression- time between rare events.

Cox proportional hazards regression- survival analysis.

39

Page 40: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

40

• The simple linear regression equation is:

y = 0 + 1x

• Graph of the regression equation is a straight line.

• 0 is the intercept of the regression line on the yaxis.

• 1 is the slope of the regression line.

• y is the expected value of y for a given x value.

Simple Linear Regression Equation

Page 41: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

41

Simple Linear Regression Equation

Positive Linear Relationship

y

x

Slope 1

is positive

Regression line

Intercept0

Page 42: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

42

Simple Linear Regression Equation

Negative Linear Relationship

Slope 1

is negative

E(y)

x

Regression line

Intercept0

Slope 1

is negative

Page 43: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

43

Simple Linear Regression Equation

No Relationship

x

E(y)

Slope 1

is 0

Regression line

Intercept0

Page 44: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Risk Ratios

Risk is the probability that an event will happen.

Number of events divided by the number of people at risk.

Risks are compared by creating a ratio

Example: risk of colon cancer in those exposed to a factor vs. those unexposed

Page 45: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Risk Ratios

Typically used in cohort studies

Prospective observational studies comparing groups with various exposures.

Allows exploration of the probability that certain factors are associated with outcomes of interest

For example: association of smoking with lung cancer

Usually require large and long-term studies to determine risks and risk ratios.

45

Page 46: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Interpreting Risk Ratios

A risk ratio of 1 equals no increased risk

A risk ratio of greater than 1 indicates increased risk

A risk ratio of less than 1 indicates decreased risk

95% confidence intervals are usually presented

Must not include 1 for the estimate to be statistically significant.

Example: Risk ratio of 3.1 (95% CI 0.97- 9.41) includes 1, thus would not be statistically significant.

46

Page 47: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Odds Ratios

Odds of an event occurring divided by the

odds of the event not occurring.

Odds are calculated by the number of times an

event happens by the number of times it does

not happen.

o Odds of heads vs. the odds of tails is 1:1 or 1.

47

Page 48: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Odds Ratios

Are calculated from case control studies

Case control: patients with a condition (often rare) are compared to a group of selected controls for exposure to one or more potential etiologic factors.

Cannot calculate risk from these studies as that requires the observation of the natural occurrence of an event over time in exposed and unexposed patients (prospective cohort study).

Instead we can calculate the odds for each group.

48

Page 49: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Comparing Risk and Odds Ratios

For rare events, ratios very similar If 5 of 100 people have a complication:

o The odds are 5/95 or .0526.

o The risk is 5/100 or .05.

If more common events, ratios begin to differ If 30 of 100 people have a complication:

The odds are 30/70 or .43

The risk is 30/100 or .30

Very common events, ratios very different Male versus female births

The odds are .5/.5 or 1

The risk is .5/1 or .5

49

Page 50: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Risk reduction

Absolute risk reduction: amount by which risk is reduced.

Relative risk reduction: proportion or percentage reduction.

Example:

Death rate without treatment: 10 per 1000

Death rate with treatment: 5 per 1000

ARR = 5 per 1000

RRR = 50%

50

Page 51: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

What is survival analysis?

Survival analysis is form of regression technique which models time to an event (death, recurrence, recover).

Unlike linear regression, survival analysis has a dichotomous (binary) outcome

Unlike logistic regression, survival analysis analyzes the time to an event

Able to account for censoring

Can compare survival between groups

Assessses relationship between covariates (variables) and survival time

Page 52: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

defined event of interest (such as death, recurrence, new primary)

specify start and end time of study’s observation period

determine time to event or censoring for each subject

Survival analysis requires

Page 53: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

† death from specific cancer = event

? lost to follow-up = censor

alive at last visit = censor

Survival Data

?

End of follow upStart

Page 54: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

What is Censored data?

Patients who do not reach the event by the end of

the study or who are lost to follow-up.

Types of censored data include:

Event did not occur by the end of the study

Subject died from an unrelated cause

Subject lost to follow-up

Subject withdrew from study

54

Page 55: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Regression vs. Survival Analysis

Technique Mathematical

model

Yields

Linear

Regression

Y=B1X + Bo

(linear)

Linear changes

Logistic

Regression

Ln(P/1-P)=B1X+Bo

(sigmoidal prob.) Odds ratios

Survival

Analyses

h(t) =

ho(t)exp(B1X+Bo)

Hazard rates

Page 56: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

When to use survival analysis

Estimate time-to-event for a group of individuals,

such as time until second heart attack for a group of

MI patients.

To compare time-to-event between two or more

groups, such as treated vs. placebo patients in a

randomized controlled trial.

To assess the relationship of co-variables to time-

to-event, such as: does weight, smoking, or

cholesterol influence survival time of MI patients?

Page 57: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Kaplan-Meier survival curves

Accounts for censoring

Provides a graphical means of comparing the

outcomes of two groups that vary by intervention

or other factor.

Survival rates can be measured directly from curve.

Difference between curves can be tested for

statistical significance.

Does not account for confounding or effect

modification by other covariates

Page 58: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Days

Surv

ival P

robabili

ty

Males

Female

Survival after Surgery (Kaplan-Meier curves)

Ticks = censor

Step = event

Page 59: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Cox Regression Model

Also called Proportional Hazards Survival Model.

Used to investigate relationship between an event (death,

recurrence) occurring over time and possible explanatory

factors (Covariates).

Reported result: Hazard ratio (HR).

Ratio of the hazard in one group divided by the hazard in another.

Interpreted same as risk ratios and odds ratios

HR 1 = no effect

HR > 1 increased risk

HR < 1 decreased risk

59

Page 60: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Cox Regression Model

Common use in long-term studies where

various factors called covariates might

predispose to an event.

Example: after radiotherapy, which factors

(age, race, tumor staging, etc.) might make

recurrence more likely.

60

Page 61: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Sensitivity

61

Sensitivity = A/(A+C)

Ability of a test to identify

correctly, individuals who are

affected

Proportion of people testing

positive

among affected individuals

Page 62: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Specificity

62

Sensitivity = D/(D+B)

Ability of a test to identify

correctly, individuals who are

not affected

Proportion of people testing

negative among non-affected

individuals

Page 63: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

TN

Sp =

TN + FP

TP

Se =

TP + FN

Disease

Test

FP

TN

TP

Performance of a test

FN

NoYes

+

-

63

Page 64: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

0 5 10 15 20

Quantitative result of the test

Distribution of quantitative test results among

affected and non-affected people (ideal case)

TN

Non affected:

Affected:

TP

Nu

mb

er o

f p

eople

tes

ted

Threshold for

positive result

64

Page 65: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

0 5 10 15 20

TN TP

FN FP

Distribution of quantitative results among affected

and non-affected people (real life)

Non-affected:Threshold for

positive result

Quantitative result of the test

Nu

mb

er o

f p

eop

le t

este

d

Affected:

65

Page 66: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

TNTP

FN

FP

Non affected:

Affected:Threshold for

positive result

Effect of Decreasing the Threshold

Nu

mb

er o

f p

eop

le t

este

d

Quantitative result of the test

0 5 10 15 20

66

Page 67: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

TP

Se =

TP + FN

TN

Sp =

TN + FP

Effect of decreasing the Threshold

Disease

Test

FP

TN

TP

FN

NoYes

+

-

67

Page 68: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

0 5 10 15 20

TNTP

FN

FP

Non-affected:

Affected:Threshold for

positive result

Nu

mb

er o

f p

eop

le t

este

d

Quantitative result of the test

Effect of Increasing the Threshold

68

Page 69: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

TP

Se =

TP + FN

TN

Sp =

TN + FP

Effect of Increasing the Threshold

Disease

Test

FP

TN

TP

FN

NoYes

+

-

69

Page 70: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Performance of a Test and Threshold

Sensitivity and specificity vary in opposite

directions when changing the threshold

The choice of a threshold is a compromise

to best reach the objectives of the test What are the consequences of having false positives?

What are the consequences of having false negatives?

70

Page 71: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

When false diagnosis (FP)

is worse than missed diagnosis (FN)

Example: Screening for congenital

toxoplasmosis

One should minimise false positives

Prioritise SPECIFICITY

71

Page 72: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

When missed diagnosis (FN)

is worse than false diagnosis (FP)

Example: Testing for Helicobacter pylori

infection

One should minimise the false negatives

Prioritise SENSITIVITY

72

Page 73: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Receiver Operating Characteristics curve

(ROC curve)

Representation of relationship

between sensitivity and specificity for a test

Simple tool to:

define best cut-off value of a test

compare performance of two tests

73

Page 74: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

ROC Curve 1

0 20 40 60 80 100

100

80

60

40

20

0

100-Specificity

Sensitiv

ity gcs

KTS

RTS

Triage RTS

74

Page 75: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

ROC Curve

Score AUC 95% C. I Optimal cutoff

point

Cutoff sensitive/specific

Sensitivity

%

Specificity %

GCS 0.880 0.825 to 0.

923

9 100 68.4

tRTS 0.881 0.825 to 0.

924

9 100 54

RTS 0.883 0.827 to 0.

925

5.7 83.3 83.3

KTS 0.914 0.864 to 0.

950

12 100 70.7

75

Page 76: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

76

Determining the Test (I)

What kind of variables are they?

1. Numerical variable

2. Ordinal variable

3. Categorical variable (Nominal)

How many groups are there?

T-test vs ANOVA

Page 77: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

77

Determining the Test (II)

Are they “normal distribution”?

Parametric vs. nonparametric methods.

T-test vs. Mann-Whitney U test

ANOVA vs. Kruskal-Wallis test

Page 78: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

78

Determining the Test (III)

Measurements are taken from the same

patient for more than one time (before

and after treatment); you should use

Paired t-test

Repeat-measures ANOVA

Page 79: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

79

Determining the Test (IV)

Usually, data are analyzed after they are

completed (all the measurements are

finished); but there are some studies that

data input are still ongoing while the

researcher have started their analysis.

For these data; do survival analysis

Page 80: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Selecting the appropriate procedure among

the common statistical procedures

80

Independent Variable

Dependent Variable

Categorical Continuous

Categorical Chi Square Logistic regression

t-tests One-way ANOVA

Continuous Logistic regression Correlation Linear regression

For a more complete table, please see the book:

Getting to know SPSS. Second Edition

by Oluwadiya Kehinde

Page 81: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Further studies?

Getting to know SPSS.

Second Edition by

Oluwadiya Kehinde

[email protected]

www.oluwadiya.sitesled.com

81

Page 82: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Final thoughts…….

82

When reading a journal article…….

Page 83: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Keep In Mind That

No study is perfect

All data is dirty is some way or another;

research is what you do with that dirty data

Measurement involves making choices

Page 84: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Be Critical About Numbers

Every statistic is a way of summarizing complex

information into relatively simple numbers.

How did the researchers arrive at these numbers?

Who produced the numbers and what is their bias?

How were key terms be defined & in how many

different ways?

Page 85: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Be Critical About Numbers

How was the choice for the measurement made?

What type of sample was gathered & how does that affect result?

Is the statistical result interpreted correctly?

If comparisons are made, were they appropriate?

Are there competing statistics?

Page 86: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Be Critical About Numbers

With one foot in a

bucket of ice water,

and one foot in a

bucket of boiling

water, you are, on

the average,

comfortable.

Page 87: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Be critical about numbers:

Bias and Error

Page 88: Statistical Methods in Research - Oluwadiyaoluwadiya.com/Documents/Workshops/Advanced Statistical Method… · Poisson regression- time between rare events. Cox proportional hazards

Thanks for your attention

88