processing of data
DESCRIPTION
Processing of Data. Data Processing. Data Processing - Processing data into proper format that becomes information which can be understand very easily. As simply, data processing is a process of converting data into information. - PowerPoint PPT PresentationTRANSCRIPT
Processing of DataProcessing of Data
Data ProcessingData Processing Data Processing - Processing data into proper Data Processing - Processing data into proper
format that becomes information which can be format that becomes information which can be understand very easily.understand very easily.
As simply, data processing is a process of As simply, data processing is a process of converting data into information. converting data into information.
Data collected during the research is processed Data collected during the research is processed with a view to reducing them to manageable with a view to reducing them to manageable proportions.proportions.
A careful and systematic processing will A careful and systematic processing will highlight the important characteristics of the highlight the important characteristics of the data, facilitates comparisons and render it data, facilitates comparisons and render it suitable for further statistical analysis and suitable for further statistical analysis and interpretations.interpretations.
1) Editing the Primary Data1) Editing the Primary Data
After the data have been collected by the After the data have been collected by the primary method, the next step is to edit.primary method, the next step is to edit.
1) Editing for completeness1) Editing for completeness
2) Editing for accuracy2) Editing for accuracy
3) Editing for uniformity3) Editing for uniformity
4) Editing for deciphering (unreadable entries)4) Editing for deciphering (unreadable entries)
2) Coding2) Coding After editing of the collected data, the next step
to follow is coding. Coding refers to assigning of number digits or
letters or both to various responses so as to enable tabulation of information easy.
The purpose of coding is to classify the answers to a question into meaning full categories which is essential for tabulation
In most of the surveys , certainly whenever results are to be put in quantitative form, the immediate stage is coding of the answers.
Ex. For feedback Excellent -5Excellent -5 Very Good- 4Very Good- 4 Good – 3Good – 3 Average -2Average -2 Below Average - 1Below Average - 1
ClassificationClassification
Once the data is collected and edited, the first task of the statistician is the organization of the figures in such a form that their significance and comparison with masses of similar data may be felicitated and further analysis may be possible.
This is done through Classification and Tabulation. The process of arranging the data into groups or
classes according to their common characteristics, or separating them into different but related parts.
ExampleExample
The number of students registered in Delhi University during the academic year 2009-2010 may be classified on the basis of any of the following criteria
Sex Age The state to which they belong Religion Different faculties like Arts, Science, Commerce
and Science Heights Institutions
The family budget data relating to nature, quality and quantity of the commodities consumed with expenditure on different items of consumption may be classified under the following heads:
Food Clothing Fuel and Lighting House Rent Miscellaneous (including items like education,
recreation, medical expenses, gifts, newspaper, laundry etc.)
Objects of ClassificationObjects of Classification
1)1) To present the facts in a simple formTo present the facts in a simple form
2)2) To bring out clearly points of similarity and To bring out clearly points of similarity and dissimilaritydissimilarity
3)3) To facilitate comparisonTo facilitate comparison
4)4) To bring out relationshipTo bring out relationship
5)5) To present a mental pictureTo present a mental picture
6)6) To prepare a basis for tabulationTo prepare a basis for tabulation
Types of ClassificationTypes of Classification
I.I. Classification based on differences in kindClassification based on differences in kind
II.II. Classification based on differences of degree Classification based on differences of degree of a given characteristicsof a given characteristics
III.III. Geographical ClassificationGeographical Classification
IV.IV. Chronological ClassificationChronological Classification
V.V. Alphabetical ClassificationAlphabetical Classification
I) Classification Based on Differences I) Classification Based on Differences in Kindin Kind
It is also called as qualitative classification and classes are set up on the basis of qualitative differences
1) Showing the data classified according to one attribute (unemployment)
Employed Persons UnemployedPersons
Grand
500 150 650
2) Showing the data classified according 2) Showing the data classified according
to unemployment and sexto unemployment and sex
Employed PersonsEmployed Persons Unemployed personsUnemployed persons Grand Grand TotalTotal
MalesMales FemalesFemales TotalTotal MalesMales FemalesFemales TotalTotal
Number Number OfOf
EmployedEmployedMalesMales
Number ofNumber ofEmployedEmployedFemalesFemales
Number Number OfOf
UnemployedUnemployedMalesMales
Number Number OfOf
UnemployedUnemployedFemalesFemales
380380 120120 500500 105105 4545 150150 500 +500 +150 -650150 -650
II) Classification Based on II) Classification Based on Differences of Degree of a Given Differences of Degree of a Given
CharacteristicsCharacteristics The classification of statistical data based on differences of degree of a given
characteristics is also called quantitative classification.
1) Showing the Number of Persons According to Income
Income Rs. Number of Persons
50-99 15
100-149 18
150-199 25
200-249 32
250-299 16
300-349 8
III) Geographical ClassificationIII) Geographical Classification In this type of classification, the data are classified In this type of classification, the data are classified
according to the geographical location such as according to the geographical location such as continents, countries, states, districts and other sub-continents, countries, states, districts and other sub-
divisiondivision
Name of the District Per-acre Yield of wheat (Qnt.)
Pune 25
Kolhapur 39
Solapur 40
Aurangabad 42
Nagpur 45
Amravati 43
IV.Chronological ClassificationIV.Chronological Classification When the given data are classified on the basis of time, When the given data are classified on the basis of time,
it is named chronological classification. The data may it is named chronological classification. The data may be classified the basis of time i.e. years, months, week, be classified the basis of time i.e. years, months, week, days or hours.days or hours.
Years Population (Crores)
Density of Population(Per sq. km)
1931 27.89 88
1941 31.85 100
1951 36.09 113
1961 43.90 138
1971 54.73 182
V. Alphabetical ClassificationV. Alphabetical Classification When the data are arranged according to alphabetical
order, it is called alphabetical classification
Name of states Density of Population(Per Sq. Km.)
Andhra Pradesh 157
Assam 150
Bihar 324
Gujarat 136
Haryana 225
Himachal Pradesh 62
Kerala 548
Statistical SeriesStatistical Series The table in which classification is given is known as
statistical series. Types of Series1. Individual Series- When the measurement of individual
items are arranged either in ascending order or descending order or according to some other scientific order, it is known as individual series
2. Discrete Series- When we count the number of times (frequency) each value of the variable occurs, it is known as discrete series.
3. Continuous series- The presentation of data into continuous series along with the corresponding frequencies is known as continuous series.
The basic components of a continuous series are- Class interval and class limits
Marks of 400 studentsMarks of 400 students
Mark Number of Students
0 - 9 50
10 - 19 70
20 - 29 120
30 - 39 90
40 - 49 70
Total 400
TabulationTabulation
After the data has been classified, the next step After the data has been classified, the next step is to arrange them in form of tables.is to arrange them in form of tables.
Tabulation involves the orderly and systematic Tabulation involves the orderly and systematic presentation of numerical data in a form presentation of numerical data in a form designed to elucidate problem under designed to elucidate problem under considerationconsideration..