8.processing

43
Processing & Analysis of data D.A. Asir John Samuel, MPT (Neuro Paed), Lecturer, Alva’s college of Physiotherapy, Moodbidri Dr.Asir John Samuel (PT), Lecturer, ACP

Upload: asir-john-samuel

Post on 23-Jun-2015

531 views

Category:

Education


1 download

DESCRIPTION

Contains Research methodology might be useful to medical and paramedical UG and PG students pursuing Research

TRANSCRIPT

Page 1: 8.processing

Processing &

Analysis of data D.A. Asir John Samuel, MPT (Neuro Paed),

Lecturer, Alva’s college of Physiotherapy,

Moodbidri

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 2: 8.processing

Processing operations

• Editing

• Coding

• Classification

• Tabulation

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 3: 8.processing

Editing

• Process of examining the collected raw data

• Editing is done to assure that data are

accurate, consistent with other facts gathered,

uniformly entered, as complete as possible

• Field editing

• Central editing Dr.Asir John Samuel (PT), Lecturer, ACP

Page 4: 8.processing

Field editing

• Review of reporting forms by the investigator

for completing, translating or rewriting

• Individual writing styles

• On the very next day or on the next day

• Not correct errors of omission by simply

guessing Dr.Asir John Samuel (PT), Lecturer, ACP

Page 5: 8.processing

Central editing

• Take place when all forms or schedules have

been completed and returned to fitness

• Correct errors such as an entry in wrong place,

wrong month, and the like

• Respondent can be contacted for clarification

• No bias

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 6: 8.processing

Coding

• Process of assigning numerals or other

symbols to answers

• Should be appropriate to research problem

under consideration

• Necessary for effective analysis

• Extraction of data

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 7: 8.processing

Classification

• Large volume of raw data is reduced into

homogeneous group

• Arranging data in groups or classes on basis of

common characteristics

• Classification according to attributes

• Classification according to class-intervals

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 8: 8.processing

Tabulation

• Arranging in concise and logical order

• Summarising raw data and displaying in

compact form

• Orderly arrangement of data in columns and

rows

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 9: 8.processing

Tabulation is essential because of

• Conserves space and reduces explanatory and

descriptive statement to a minimum

• Facilitates process of comparison

• Facilitates summation of items and detection

of errors and omissions

• Basis for various statistical computations Dr.Asir John Samuel (PT), Lecturer, ACP

Page 10: 8.processing

Problems in processing

• Problem concerning “Don’t Know” responses

• Use of percentages

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 11: 8.processing

Problem concerning “Don’t Know” responses

• When DK group is small, it is of little significance

• In big group, it becomes mater of concern

• Actually may not know the answer or

• Researcher may fail in obtaining appropriate

information (failure of questioning process)

• Keep as a separate category in tabulation Dr.Asir John Samuel (PT), Lecturer, ACP

Page 12: 8.processing

Use of percentages

• 2/more percentages must not be averaged

unless each is weighted by group size

• Too large percentages should be avoided

because difficult to understand and confuse

• Hide base value

• Real differences may not be correctly read

• Can never exceed 100 percent and for decrease

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 13: 8.processing

Statistics in Medical Research

• Documentation of medical history of disease,

their progression, variability b/w patient,

association with age, gender, etc.

• Efficacy of various types of therapy

• Definition of normal range

• Epidemiological studies

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 14: 8.processing

Statistics in Medical Research

• Study the effect of environment, socio-

economic and seasonal factors

• Provide assessment of state health in

common, met and unmet needs

• Success/failure of specific health programme

• Promote health legislation

• Evaluate total health programme of action Dr.Asir John Samuel (PT), Lecturer, ACP

Page 15: 8.processing

Statistics in Medical Research - Limitation

• Does not deal with individual fact

• Conclusion are not exact

• Can be misused

• Common men cannot handle properly

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 16: 8.processing

Normal distribution

• Represented by a family of infinite curves

defined uniquely by 2 parameter the mean

and the SD of the population

• The curve are always symmetrically bell

shaped. The width of the curve is defined by

population, SD

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 17: 8.processing

Normal distribution

• Mean, median and mode coincide

• It extends from - ∞ to + ∞

• Symmetrically about the mean

• Approx 68% of distribution is within 1SD of

mean (68.27%)

- 95% - 2SD (1.96 SD)

- 99% - 3SD (2.58 SD) Dr.Asir John Samuel (PT), Lecturer, ACP

Page 18: 8.processing

Normal distribution

• The total area under the curve is 1

• The value of measure of skewness is zero. It is

not skewed

• The curve is asymptotic. It approaches but

never touches baseline at extremes

• The curve extends on the both sides -3σ

distance on left to +3σ distance on the right

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 19: 8.processing

Normal distribution - Uses

• Construct confidence interval

• Many statistical techniques makes an

underlying assumption of normality

• Distribution of sample means is normal

• Normality is important in statistical inference

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 20: 8.processing

Skewness

• Measure of lack of symmetry in a distribution

• Positive skewed

- Right tail is longer

- Mass of distribution is concentrated on left

side

- Distribution is said to be right skewed

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 21: 8.processing

Negative skewed

• Left tail is longer

• Mass of distribution concentration on right

side

• Distribution is said to be left skewed

• Value of skewness is 0 for normal distribution

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 22: 8.processing

Kurtosis

• Measure of degree of peakness in distribution

• For normal distribution, value of kurtosis is 3

• Leptokurtic – High peakness

• Mesokurtic – normal

• Platykurtic – Low peakness

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 23: 8.processing

Descriptive statistics

• Measures of location

- Central tendency

- Mean, median and mode

• Measures of variation

- Dispersion

- Range, quartile, IQR, variance and SD Dr.Asir John Samuel (PT), Lecturer, ACP

Page 24: 8.processing

Mean

• Sum of all observation divided by total no. of

observation

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 25: 8.processing

Mean - merits

• Well understood by most people

• Computation of mean is easy

• More stable

• All items in a series are taken into account

• Used in further statistical calculation

• Good basis for comparison Dr.Asir John Samuel (PT), Lecturer, ACP

Page 26: 8.processing

Mean - Demerits

• Affected by extreme values

• Cannot be computed by mere observation

• Not suitable for skewed distribution

• May not be an actual item

• Not in qualitative data

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 27: 8.processing

Median

• Middle most observation when data is

arranged in ascending/descending order of

magnitude

• Divides number into 2 halves such that no.of

items below it is same as no.of items above

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 28: 8.processing

Median

Odd = n+1/2

Even = n/2 + (n+1)/2

2

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 29: 8.processing

Median - Merits

• Widely used measures of CD

• Not influenced by extreme values

• Can be determined if extremes are not known

• Not a typical representation of series

• Useful for skewed distribution

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 30: 8.processing

Median - Demerits

• When no. of items are small, median may not

be representative

• It is effected by frequency of neighboring

items

• Not a typical representation of series

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 31: 8.processing

Mode

• Most frequently occurring observation in data

• If all values are different then no mode

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 32: 8.processing

Mode - Merits

• Can be computed by mere observation

• Simple

• Precise

• Less time consuming

• Less strain

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 33: 8.processing

Mode - Demerits

• Not an amenable to further algebraic

treatment

• Not rigidly defined

• Affected by no. of frequency of items

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 34: 8.processing

Measures of Dispersion (variation)

• Range

• Interquartile range

• Variance

• Standard Deviation

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 35: 8.processing

Range

• Difference between largest and smallest value

Range = Largest no. – Smallest no.

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 36: 8.processing

Quartile

• Value that divide data into 4 equal parts when

data is arranged in ascending order

Q1 = (n+1/4)th ordered observation

Q1 = [2(n+1)/4]th ordered observation

Q3 = [3(n+1)/4]th ordered observation

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 37: 8.processing

Interquartile range

• Provides range which covers middlemost 50%

of observation

• Good measures of dispersion if there are

extreme values

IQR = Q3 – Q1

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 38: 8.processing

Variance

• Sum of squares of difference of each

observation from mean, divided by n-1

Variance = 𝜀 𝑥−𝑥 2

𝑛−1

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 39: 8.processing

Variance - Merits

• Easy to calculate

• Indicate the variability clearly

• Most informative

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 40: 8.processing

Variance - Demerits

• Units of expression of variance is not the same

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 41: 8.processing

Standard Deviation (SD)

• Square root of variance

SD = √𝜀 𝑥−𝑥 2

𝑛−1

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 42: 8.processing

Standard Deviation - Merits

• Most widely used

• Used in calculating standard error

Dr.Asir John Samuel (PT), Lecturer, ACP

Page 43: 8.processing

Standard Deviation -Demerits

• Lengthy process

• Gives weightage to only extreme valves

Dr.Asir John Samuel (PT), Lecturer, ACP