measures of central tendency final
TRANSCRIPT
-
7/28/2019 Measures of Central Tendency Final
1/39
Quantitative Aptitude 2.1 Measures of Central Tendency
CHAPTER-2
MEASURES OF CENTRAL TENDENCY
Introduction:After classifying and tabulating the data, the statistician is usually interested in analyzing
the data. This is done with the help of various numerical measures which describe theinherent characteristics of a frequency distribution.A statistical measure summarizes the data and brings out the important characteristics ofthe data in such a way that a clear and accurate picture emerges. We will discuss certainstatistical measures which will be helpful in analyzing the data.
Meaning:The word measures means methods and the word central tendency means averagevalue of any statistical series.The combined term Measures of central tendency meansthe methods of finding out the central value or average value of a statistical series or anyother series of quantitative information.An average is a single figure that is computed from a given series to give complete ideaabout the entire series. Its value lies between the maximum and the minimum value of aseries and represents all the items belonging to the series.
Need for the average:1. It is not possible to remember each and every fact relating to a field of enquiry as
human mind is not capable of storing all facts.2. It is not desirable to remember or take note of all the facts relating to a study because
it may lead to confusion in the mind of a person if all the facts are presented before him.3. It is the average value which is quite fit to give a clear picture or efficacy about the
field under study for guidance and necessary conclusion.
Definition:1. Average is a value which is typical or representative of a set of data.2. Average is an attempt to find one single figure to describe whole of figures.3. An average is a single number describing as number some features of a set of data.4. The average is sometimes described as number which is typical of the whole
group.
Characteristics of an average:1. It is a single figure expressed in some quantitative form.2. It lies between the extreme values of a series.3. It is a typical value that represents all the values in a series.4. It is capable of giving a central idea about the series it represents.5. It is determined by some method or procedure.Objectives of an average:The main objectives of an average are:1. To determine one single value that may be used to describe the characteristics of the
entire series.2. To facilitate comparison at a particular point of time or over a period of time.
-
7/28/2019 Measures of Central Tendency Final
2/39
Quantitative Aptitude 2.2 Measures of Central Tendency
3. To facilitate statistical inference: An average obtained from a sample is used inestimating the average of the population.
4. To facilitate quick understandingof complex data.5. To help the decision-making process: The averages help the managers in decision-
making. The managers are often interested in knowing normal output of a plant,
representative sales volume, overall productivity index, price index, etc., these all arethe connotations of an average.
Characteristics of an Ideal average:1. It should be rigidly defined so that there is no confusion with regard to its meaning
connotation.
2. It should be easy to understand.3. It should be simple to compute.4. Its definition should be in the form of mathematical formula.5. It should be based on all the items in the data.
6. Any single item or a group of items should not unduly influence it.7. It should be capable of further algebraic treatment.8. It should be capable of being used in further statistical computation.9. It should have sampling stability.
Advantages of an average:1. It simplifies the complexity of data.2. It facilitates comparative study.3. It is easy to remember.Disadvantages of an average:1. It may give rise to a figure which may not belongto the series.2. It may give rise to a result which is practically impossible and absurd.3. It may lead to dangerous results.4. It may give different results.5. It does not reveal the entire story of a phenomenon.6. It does not have sympathy for individual item.Different Types of Average:There are three types of averages;
A.Mathematical Averages:
1. Arithmetic Average or mean2. Geometric Mean3. Harmonic Mean
B. Positional Averages:1. Median2. Mode
C. Positional Averages:1. Moving average2. Progressive average
-
7/28/2019 Measures of Central Tendency Final
3/39
Quantitative Aptitude 2.3 Measures of Central Tendency
A.Mathematical Averages:1.Arithmetic Mean:
Meaning of Arithmetic Mean:The arithmetic mean is a measure of central tendency and is popularly known as mean.Arithmetic mean is obtained by dividing the sum of the values of all items of a series by
the number of items of that series. Normally, arithmetic mean is denoted by X which isread as X bar. It can be computed for unclassified or ungrouped data or individual seriesas well as classified or grouped data or discrete or continuous series.
1.Arithmetic MeanA. Simple arithmetic Mean.B. Weighted arithmetic Mean.C. Combined arithmetic Mean.A.Simple Arithmetic Mean:The simple arithmetic mean can be computed in two ways;
1) Direct method.2) Short cut method.
Computation of simple Arithmetic Mean:Individual series:Direct Method Steps involved are:1. Identify the variable and denote it by x.2. Sum up the observations i.e. x.3. Count the number of observations. i.e. n.4. Apply the following formula.
nxx
Where, x =Arithmetic Mean, x =sum of the variable x, and n=the total number of
observations.
Short cut MethodSteps involved are:1. Identify the variable and denote it by x.2. Select an assumed mean or provision mean A from the values of variables or any
value.3. Take the deviations of the values of the variablex from the assumed mean,
i.e., (d=x-A). The deviations are denoted byd4. Sum up the deviations d i.e. d 5. Divide the sum of deviations by the number of observations and add the resulting
figure to the assumed mean or apply the following formula.
n
dAX
Where,
X = Arithmetic meanA = Assumed meand = Sum of the deviations of individual values from assumed mean.
N = f=Number of observations.
-
7/28/2019 Measures of Central Tendency Final
4/39
Quantitative Aptitude 2.4 Measures of Central Tendency
Discrete SeriesDiscrete series means where frequencies are given but without class intervals.Direct Method Steps involved are:1. Identify the variable (x) and frequency (f) from the data given.2.Multiply each variable with respective frequency. i.e. fXx3.Apply the following formula.
N
fxX
Where, f = frequency,x = the value of the variable and N = the sum of frequency or f
Short Cut Method Steps involved are:1. Identify the variable (x) and frequency (f) from the data given.2.Select an assumed mean or provision mean A from the values of variables or any value.3.Take the deviations of the values of the variable x from the assumed mean, i.e., (d=x-A).
The deviations are denoted by d
4.Multiply each frequency with respective deviation (fXd) and their summation (fXd).5.Apply the following formula.
f
fdAX
Where, A = Assumed Mean, dx = (x A), f = frequency And f or N = total number of
items.
Continuous Series:Continuous series means where frequencies are given along with class intervals:Direct Method Steps involved are:
Here we take the mid point of every class interval as x multiple these values withrespective frequencies.1.Convert the class series into exclusive series if given in the open-ended or cumulative
series. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2.Find out the mid-value of each class and denote them byx.3.Find N; the total of the frequencies (N = f )4.Multiply each mid-value with the respective frequency and find out (fx).5.Apply the following formula
N
fx
f
fx
X
.
Where, f is frequency; x the mid point of interval and N = f .
Short Cut Method Steps involved are:1.Convert the class series into exclusive series if given in the open-ended or cumulative
series. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2. Find out the mid-value of each class and denote them byx.3.Select an assumed mean or provision mean A from the values of variables or any value.4.Take the deviations of the values of the variable X from the assumed mean, i.e., (d=x-A).
The deviations are denoted by d
-
7/28/2019 Measures of Central Tendency Final
5/39
Quantitative Aptitude 2.5 Measures of Central Tendency
5.Multiply each frequency with respective deviation (fxd) and find their summation (fxd).6.Apply the following formula.
N
fdAX
Where, A = Assumed Mean dx= A); f = frequency And or N = total number of
items.
B.Weighted Arithmetic MeanThe term weight stands for the relative importance of the different items of the series.Weighted arithmetic mean refers to the arithmetic mean calculated after assigning weightsto different values of variable. It is suitable where the relative importance of different itemsof variable is not the same. It is especially useful in problems relating to:a. Construction of index numbers.b. Standardised birth and death rates.In case actual weights are available, actual weights should be used. In case actual weights
are not available, arbitrary weights may be used.The relationship between simple arithmetic mean and weighted arithmetic mean issummarized below:1. AM=WAM if equal weights are assigned to all the items of a series.2. AM>WAM if smaller weight is assigned to the higher values and greater weight is
assigned to the lower values.3. AM
-
7/28/2019 Measures of Central Tendency Final
6/39
Quantitative Aptitude 2.6 Measures of Central Tendency
3. The formula of arithmetic mean can be extended to compute the combined average oftwo or more related series.
4. If the given items ofx can be changed to items on Y = a + bx, then Y = a + b.5. If each of the values of a variable x is increased or decreased by some constant C, the
arithmetic mean also increases or decreases by C. Similarly, when the values of a
variable xare multiplied by a constant, say k, the arithmetic mean is also multiplied bythe same quantity k. when the values of variable x are divided by a constant say d,the arithmetic mean is also divided by the same quantity d.
6. The arithmetic mean of the first n natural numbers can be calculated by applying theformula:
Merits of Arithmetic Mean:1. It is easy to understand.2. It is simple to calculate.3. It is based on all the items of the series. In its computation no item is ignored.4. It is rigidly defined by a mathematical formula so that the same answer is derived by
every one who computes it.5. It is capable of further algebraic treatment so that its utility is enhanced. The formula of
arithmetic mean can be extended to compute the combined average of two or morerelated series.
6. It has sampling stability. It is least affected by sampling fluctuations.7. Its computation does not require arrangement of items since it is not based on position
in the series.8. It is characterized as a centre of gravity a point of balance, balancing the value on
either side of it.
9. Arithmetic average can be calculated if we know the number of items and aggregate.10. It provides a good basis for comparison.11. The mean is a more stable measurer of central tendency (Ideal average)
Limitations of Arithmetic Mean:1. Since it includes all the items, its value may be distorted by extreme values.2. It cannot be calculated if any item of the series is missing.3. The average may not coincide with any of the actual items in a series.4. In cases where the items cannot be represented quantitatively, like intelligence, honesty
and character but can be ranked, the arithmetic average is not an appropriate measure
of central tendency.5. It cannot be located by observation or the graphic method.6. It gives greater importance to bigger items of a series and lesser importance to smaller
items.7. It cannot be calculated in a distribution with open-ended class series and cumulative
series without converting the class series into exclusive series.8. It fails to provide a characteristic value or a representative value where the distribution
of the series is not normal and gives a U shaped curve rather than a bell shaped ( curve.
-
7/28/2019 Measures of Central Tendency Final
7/39
Quantitative Aptitude 2.7 Measures of Central Tendency
2. Geometric Mean:A. Simple Geometric Mean.B. Weighted Geometric Mean.C. Combined Geometric Mean.A.Simple Geometric Mean.Meaning of geometric mean:
Geometric mean is the nthroot of the product of n number of values. It means if there aretwo values then Geometric mean is square root of the product of two values. Symbolically,
G.M. = n nXXXX ........321
Where, X1, X2. refer to the values of various items of the seriesn = Total no. of items of the series.
Computation of Geometric Mean:Practical steps involved in the computation of G.M. in case of individual series
1.Take the logarithms of each item of variable, and enter in the column headed as log Xand obtain their total i.e., log X.
2. Calculate G.M. As follows:G.M. = Antilog
n
Xlog
Where, n = Total no. of items
Calculation of Geometric Mean Discrete SeriesLet X1, X2, X3.Xn be the given values of a variable and f1, f2, f3 fn, are their respectivefrequencies. Then the geometric mean is given by the formula.
Log g =n
nn
ffffXfXfXf
......log......loglog
321
2211
Log g =
n
i
iXf1
log
or g = Antilog
n
i
iXf1
log
Steps:1. Take log values of all the items of a given series.2. Multiply each log value to its respective frequencies3. Add the values and divide by the total number of frequencies4. Take the value of antilog from the antilog table and the result would be the geometric
mean.
Calculation of Geometric Mean Continuous SeriesIn case of continuous series, the following steps are observed.1. Convert the class series into exclusive series if given in the open-ended or cumulative
series. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2. Take the mid-value of each of the class group and denote it by X.3. Take log value of these mid-values i.e. Log x.4. Multiply log values with their respective frequencies i.e. f x log X.
-
7/28/2019 Measures of Central Tendency Final
8/39
-
7/28/2019 Measures of Central Tendency Final
9/39
Quantitative Aptitude 2.9 Measures of Central Tendency
Note: If any three values of the four used in the above formula are known, the fourth valuecan be found out.Tutorial Note: In case annual rate of decrease in variable (e.g. depreciation on fixed assets)is to be calculated, the Compound Interest Formula will be modified as follows:Pn = Po (1-r) n
Properties of Geometric mean:1. The logarithm of G for a set of observationsx1,x2,x3, x4,.xn is the arithmetic mean of the
logarithm of the observation, i.e. Log G=
n
nxx klog.......loglog 21 .
2. If all the observations of a variable assumes the same constant value k, i.e. x1=x2=x3=x4.xn=k, then the geometric mean of these observations is also k.
3. For any series of positive values the geometric mean is smaller than the arithmeticmean.
4. If any value is zero, the value of geometric mean is infinity and thus inappropriate.5. It cannot be calculated if number of negative values is odd.6. Since geometric mean or G=
nXXXMG .....
21. , the geometric mean raised to the
nth power equals to the product of values.7. The geometric mean of two variables is the product of their G.M. i.e., if Z=xy, then G.M.
of Z= (G.M. ofx) x (G.M. of y).8. The geometric mean of the ratio of two variables is the ratio of the G.M. of these two
variables i.e., if Z=x/y. then G.M. of Z= (G.M. ofx) (G.M. of y).
Merits of Geometric mean
1.
It is based on all observations2. It is rigidly defined3. It is useful in averaging ratios and percentages and in determining rates of increase and
decrease.4. It is capable of algebraic treatment. Its formula can be extended to calculate combined
G.M. as follows:
12.M.G = Antilog
21
2211 ..log..log
NN
MGNMGN
5. It gives less weight to large items and more to small ones than does the arithmeticaverage. It is because of this reason that geometric mean is never larger than the
arithmetic mean, on occasions it may turn out to be same as the arithmetic mean, butusually it is smaller.
6. It is useful in studying economic and social data.Limitations of Geometric Mean1. It is difficult to understand.2. It is difficult to compute. Non-mathematical persons cannot do calculations.3. The Geometric mean cannot be computed if any item in the series is negative or zero.4. It has restricted application.5. It is biased for small values as it gives more weight to small values.6. At times it gives a value which may not be found in the series and even may be absurd.
-
7/28/2019 Measures of Central Tendency Final
10/39
Quantitative Aptitude 2.10 Measures of Central Tendency
3. Harmonic Mean:A. SimpleHarmonic Mean.B. WeightedHarmonic Mean.C. Combined Harmonic Mean.A. Simple Harmonic Mean.Meaning of Harmonic Mean:Harmonic mean of various items of a series is the reciprocal of the arithmetic mean of theirreciprocals.
Individual series:Steps to calculate H.M1. Identify the variable and denote it by X.2. Calculate the reciprocal of values of variable i.e. 1/x and find its summation.3. Apply the following formula.
nXXXX
nMH1
...111
..
321
Where, X1, X2..Xn refer to the value of various items of the series.n = Total no. of items of the series.
Note: To simplify the calculations, reciprocals of the various items are taken from themathematical tables.
Discrete series:Steps to calculate H.M
1. Identify the variable(x) and frequency (f) from the data given.2. Divide each frequency with the respective value of the variable i.e. (f/x) and find theirsummation i.e. (f/x).
3. Apply the following formula.
nx
f
x
f
x
f
x
f
NMH
...
..
321
Where, N= f.
Continuous Series:Steps to calculate
1. Convert the class series into exclusive series if given in the open-ended or cumulativeseries. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2. Take the mid-value of each of the class group and denote it by X.3. Divide each frequency with the respective value of the variable i.e. (f/x) and find their
summation i.e. (f/x).4. Apply the following formula.
n
x
f
x
f
x
f
x
f
NMH
...
..
321
Where, N= f.
-
7/28/2019 Measures of Central Tendency Final
11/39
Quantitative Aptitude 2.11 Measures of Central Tendency
B. Weighted Harmonic MeanSteps to calculate:
1. Identify the variable(X) and weights (W) from the data given.2. Divide each weight with the respective value of the variable i.e. (w/x) and find their
summation i.e. (w/x).
3. Apply the following formula: xWW
Wx
Wx
Wx
WMH
n
n
w/1
....11
..
2
2
1
1
C. Combined Harmonic MeanSteps to calculate:1.Identify the number of observations in each group and denote them n1, n2, n3, n4nm.2.Identify the respective harmonic means H.M1, H.M2, H.M3, H.M4 H.Mm.3.Apply the formula:
2
2
1
1
21..
H
n
H
nnnMH
Properties of Harmonic Mean1. If all the observations of a variable assumes the same constant value k, i.e. x1=x2=x3=
x4.xn=k, then the geometric mean of these observations is also k.2. Harmonic mean can be computed from a series with any number of negative values.3. Harmonic mean cannot be computed from a series from a series if any of its value is
zero.
4. For any series in which all the values are not equal nor any value is zero, the value ofthe harmonic is less than the geometric mean and arithmetic mean.
5. If there are two groups with n1 and n2 observations and H1 and H2 as respective H.Msthen the combined H.M. is given by
2
2
1
1
21..
H
n
H
n
nnMH
Uses of Harmonic Mean:1. For the rates and ratios involving speed, time and distance, harmonic mean is used to
find out the average speed at which journey has been performed.
2.
For the rates and ratios involving price and quantity (both amount of money spent andthe units per rupee are given), harmonic mean is used. To find out the average price atwhich an article had been sold or purchased.
3. When it is desired to assign greater weight to smaller values and smaller weight tolarger values of a variate, its use is recommended.
4. In a given data set if there are a few large values, the reciprocals will turn down theeffect of large numbers. In such cases harmonic mean is to be used.
Merits:1. It is rigidly defined.2. It is based on the all the observations of the series.3. It is suitable in case of series having wide dispersion.
-
7/28/2019 Measures of Central Tendency Final
12/39
Quantitative Aptitude 2.12 Measures of Central Tendency
4. It is suitable for further mathematical treatment.5. It gives less weight to large items and more weight to small items.6. It is the most suitable average for measuring the time, speed, etc.,Demerits:
1.
It is difficult to calculate and not easy to understand.2. All the values must be available for computation.3. It is not popular.4. It is usually a value which does not exist in series.5. It cannot be computed when one or more items are zero6. It gives largest weight to smallest items. Hence, it is not useful for analyzing the
economic data.
Relationship among the Averages
(a)where values of all items are identical(b)where values of all items are not identical A.M. = G.M. = H.M.A.M. > G.M. > H.M.
B. Positional Averages:1. Median
Meaning of Median:Median is the central value of the variable that divides the series into two equal parts insuch a way that half of the items lie above this value and the remaining half lie below thisvalue. Median is called a positional average because it is based on the position of a givenobservation in a series arranged in an ascending or descending order and the position ofthe median is such that an equal number of items lie on either side of it. It concentrates onthe middle or centre of a distribution. According to L.R. Connor, Median is that value of
the variable which divides the group into two equal parts, one part comprising all thevalues greater, and the other all values less than the Median. Median is usually denotedby Med. or Md. Median can be computed for both ungrouped data (Individual Series) andgrouped data (or Discrete/continuous Series).
Calculation of MedianIndividual Series
To find the value of Median in this case the values are arranged in ascending or descendingorder first; and then the middle most value is taken as Median.
The terms are arranged in ascending or descending order and then
2
1nth term is taken as
Steps to Calculate1. Arrange the terms in ascending or descending order2. Count the number of terms. Put = n3. Calculate
2
1nth value,
2
1nth value is Median.
Discrete SeriesSteps to Calculate1. Arrange the data in ascending or descending order2. find cumulative frequencies
-
7/28/2019 Measures of Central Tendency Final
13/39
Quantitative Aptitude 2.13 Measures of Central Tendency
3. find the value of the middle item by using the formula; Size of
2
1Nth item
4. By inspecting cumulative frequencies, Find out that cumulative frequency in thecumulative frequency column which is equal to
2
1Nth or next greater than th
value.5. Locate the value of the variable corresponding to that cumulative frequency. This is the
value of median.
Continuous series:Steps to Calculate:
1. Convert the class series into exclusive series if they are given in inclusive series, mid-value series or cumulative series. Median can be calculated if class series are given inexclusive series or open-ended series.
2. Arrange the data in ascending order.3. Find the cumulative frequencies.4. Find the Median class by ascertaining the class where the cumulative frequency is equal
to or greater than N/2.
5. Apply the formula: M = L + cf
fCN o
.1
N1 = N/2
C.fo = cumulative frequency preceding the cumulative frequency of median class.F = frequency of the median class.C = class interval of the median class.
Mathematical property of median
1. If the two variablesxand y are related by Y= a+bx, where a and b are the constants.Also the value of median ofx is given, then Ymd =a+bxmd.
2. The sum of the absolute deviations (i.e., deviations ignoring signs) from the medianis minimum i.e.,
dMX is the minimum.
Merits of median1. It is easy to understand.2. It is quite rigidly defined.3. It does not require all the observations of the data for its determination.4. Median is useful in case of open-ended series such as income distribution since median
is based on the position and not on the values of items.5. Median is easier to compute than mean in case of unequal class-intervals.6. It is not affected by extreme values.7. It is most suitable average for dealing with qualitative data i.e., where ranks are given.8. It can be determined graphically.9. It is capable of being expressed in qualitative form as it is not computed but located.10.It gives a value, which very much exists in the series, and is a round figure in most of
the cases.11.It minimizes the total absolute deviations.
2
1N
-
7/28/2019 Measures of Central Tendency Final
14/39
Quantitative Aptitude 2.14 Measures of Central Tendency
Limitations of median1. It requires the arrangement of data in ascending/descending order.2. It is not rigidly defined and as such its value cannot be computed but located.3. It is not based on all items of the series.4. It is not capable of algebraic treatment. Its formula cannot be extended to calculate
combined Median of two or more related groups.5. It is affected more by sampling fluctuations than the value of mean.6. The computation formula of a median is in a way an interpolation under the
assumption that the items in the median class are uniformly distributed, which is notvery true, i.e. In case of continuous series, the median is estimated, but not calculated.
7. Typical representative of the observations cannot be computed if the distribution ofitem is irregular. For example 1, 2, 3, 100 and 500, the median is 3.
8. It ignores the extreme items.9. Where the number of items is large, the prerequisite process. i.e. arraying the items is a
difficult process.
Partition valuesPartition values are the positional measures which divide the series into equal parts say 4equal parts or 10 equal parts or 100 equal parts. The most popular partition values used areQuartiles, Octiles, Deciles and Percentiles.
QuartilesMeaning of QuartilesThe values of a variate that divide the series or the distribution into 4 equal parts areknown as Quartiles. Since three points are required to divide the data into 4 equal parts, we
have three quartiles Q1, Q2, Q3.Range
Q1 Q2 Q3Median = Q2The first quartile (Q1), known as a lower quartile, is the value of a variate below whichthere are 25% of the observations and above which there are 75% of the observations.The second quartile (Q2) known as middle quartile or median is the value of a variatewhich divides the distribution into two equal parts. it means, there are 50% observations
above it and 50% below it.The third quartile (Q3), known as an upper quartile, is the value of a variate below whichthere are 75% observations and above which there are 25% observations.It is clear that Q1 < Q2 < Q3.
Computation of QuartilesThe procedure for computing quartiles is the same as the median as explained earlier.I. In case of Individual and discrete Series (after arranging the size of item in
ascending/descending order)
Q1 = Size of
4
1Nth item, Q2 = Size of
4
12 Nth item, and Q3 = size of
4
13 Nth item
-
7/28/2019 Measures of Central Tendency Final
15/39
Quantitative Aptitude 2.15 Measures of Central Tendency
II. In case of Continuous Series (i.e., frequencies with Class Intervals)c
f
fcN
LQcf
fcN
LQcf
fcN
LQ
..
4
3
,
..4
2
,
..4
321
Where, L = Lower limit of quartile class
c.f = Cumulative frequency preceding the quartile classf = Simple frequency in the quartile classc= class-interval of quartile class
OctilesMeaning of OctilesThe value of a variate which divides a given series or the distribution into 8 equal parts areknown as octiles. Each octile contains 12.5% of the total number of observations. Sinceseven points are required to divide the data into 8 equal parts, we have 7 octiles in thedistribution.O1 to O7.Computation of Octiles
I. In case of Individual and Discrete Series (after arranging the size of items inascending/descending order)
8
)1(
NjofsizeOj
th item where, j = 1 to 7
(e.g. O4 = size of8
)1(4 Nth item
II. In case of continuous series (i.e., frequencies with class-intervals)i
f
fcjN
LOj
..
8 Where, j = 1 to 7
Where c.f. is the cumulative frequency preceding the j th octile class, the other symbols haveusual meaning.
DecilesMeaning of DecilesThe values of a variate that divide the series or the distribution into 10 equal parts arecalled Deciles. Each part contains 10% of total observations. Since nine points are requiredto divide the data into 10 equal parts, we have 9 deciles, D 1 to D9. They are called firstdecile, second decile, etc., The 5th decile (D5) is the median.Computation of Deciles
I. In case of Individual and Discrete Series (after arranging the size of items inascending/descending order)
Dj = Share of
10
1Nj th item. Where j = 1 to 9
II. In case of continuous series (i.e., frequencies with Class Intervals)i
f
fcjN
LDj
.10 Where j = 1 to 9
Where c.f. is the cumulative frequency preceding the j th decile class, the other symbols have
usual meaning.
-
7/28/2019 Measures of Central Tendency Final
16/39
Quantitative Aptitude 2.16 Measures of Central Tendency
PercentilesMeaning of PercentilesThe value of a variate which divides a given series or distribution into 100 equal parts areknown as percentiles. Each percentile contains 1% of the total number of observations.Since ninety nine points are required to divide the data into 100 equal parts, we have 99
percentiles P1 to P99. The percentile Pj is that value of the variate upto which lie exactly j% ofthe total number of observations. For ExampleP10= Value of a variate upto which lies exactly 10% of observations. This is same as D 1.P25= Value of a variate upto which lies exactly 25% of the total number of observations. This
is same as Q1.P50= Value of a variate upto which lies exactly 50% of the total number of observations. This
is the same as D5 or Q2, or median.
Computation of PercentilesI. In case of Individual and Discrete Series after arrangement of the size of items in
ascending/descending order)
Pj = size of100
)1( Njth item where j = 1 to 99
II. In case of Continuous Series (i.e. frequencies with Class Intervals)Pj = i
f
fcjN
L
.100 where j = 1 to 99
Where, c.f. is the cumulative frequency preceding the j th percentile class. The other symbolshave usual meaning.
ModeMeaning of mode:Mode is often said to be that value in a series which occurs most frequently or which hasthe greatest frequency. But it is not exactly true for every frequency distribution. Rather it isthat value around which the items tend to concentrate most heavily. It is also called themost typical or fashionable value of a distribution because it is the value which has thegreatest frequency density in its immediate neighbourhood. It is usually denoted by M0.It maybe noted that a distribution may have one mode or two modes or several modes.
Unimodal A distribution is said to be unimodal if it has only one mode.
Bimodal A distribution is said to be bimodal if it has two modes.Multimodal A distribution is said to be multimodal if it has more than two modes.Mode may be calculated for ungrouped data or individual series and grouped data ordiscrete and continuous series.
Calculation of Mode:
Individual Series:The terms are arranged in any order, Ascending or Descending, if each term of the series isoccurring once, then there is no mode, otherwise the value that occurs Maximum numbertimes is known as Mode.
-
7/28/2019 Measures of Central Tendency Final
17/39
Quantitative Aptitude 2.17 Measures of Central Tendency
Method to calculate Mode1. Arrange the terms in ascending or descending order2. Note the term occurring maximum number of times if any.3. This term is mode.Note: If all terms occur once or some terms occur equal number of times, we cant find
Mode by this method)
Discrete SeriesThe mode can be determined by Inspection Method or grouping method.Under Inspection method Mode is that value of the variable which has the highestfrequency.But this method is applicable if the following conditions are fulfilled:1. There must be one item which has maximum frequency.2. The maximum frequency should be located at most in the middle or centre of the series.3. The frequencies should follow a particular pattern, i.e., they should gradually rise and
reach maximum level and thereafter decline.4. The difference between highest frequency and second highest frequency should belarge.
However, some times it becomes impossible to locate Mode by inspection as concentrationof frequencies is not in a unique manner or fashion as desired for this method. In such casesgrouping method should be used to find the modal value. And for such a distribution wehave to prepare the (1) grouping Table and (2) Analysis Table
1. Grouping Table:It has Six Steps as given below:1. Take the Frequency column as column number 12. Frequencies are added in groups of two(s) and leave the incomplete group.3. Leaving first frequency, add frequencies in groups of two(s) & leave the incomplete
group.4. Add the frequencies in groups of three(s) & leave the incomplete group.5. Leaving first frequency, add the frequencies in the groups of three(s) and leave the
incomplete group.6. Leaving first two frequencies, add frequencies in groups of three(s) and leave the
incomplete group.In each case, take maximum total and put it in a circle or a box to distinguish it from others.
2. Analysis Table:It has following steps:1. Note highest value in each column.2. Note the variable(s) in each column corresponding to that total.3. Check if that total is of individual items or of group of many items (2 or 3).4. It the total consists of two or more frequencies, all such variables have to be marked
with the signs as or .5. Count or marks in each column.6. Variable with maximum or marks denotes mode.
-
7/28/2019 Measures of Central Tendency Final
18/39
Quantitative Aptitude 2.18 Measures of Central Tendency
Continuous Series:In the case of Continuous Series, we go only one step ahead of the method for discreteseries. We get the value of Mode by Interpolation as is the case with Median.Steps involved:1. Convert the class series into exclusive series if it is given in inclusive series, mid-value
series or cumulative series. Mode can be calculated the series is exclusive or open-endedseries.
2. Arrange the data in the ascending order.3. Convert the class intervals into equal class intervals if they are unequal.4. Find out the modal class either by inspection method or grouping method.5. Apply the formula: c
fff
ffLMode
201
01
2
Where, L = is the lower limit of Modal Interval
1f is the frequency corresponding to Modal Interval
0
fis the frequency preceding Modal Interval
2f is the frequency succeeding Modal Interval
C is the length of Modal Interval.We can put this formula in following shape also:
Mode cDD
DL
21
1
Here; 011 ffD
212 ffD
Here only the positive values are takenMode can also be calculated taking upper Limit.
cfff
ffLZMode
201
21
2)(
Or cDD
DLZMode
21
1)(
Here: L is the upper limitOther values are the same as in above given formula.Note:
1. If first class is the modal class then 0f will be zero.2. Similarly if last class is modal class, then 2f is zero.3. If Modal Value lies in any other interval than with highest frequency, the following
method is suggested to calculate method
iff
fLMode
20
2
Merits of mode1. It is easy to understand as well as easy to calculate. In certain cases, it can be found out
by inspection
2. It is usually an actual value as it occurs at the most highest frequency in the series.
-
7/28/2019 Measures of Central Tendency Final
19/39
Quantitative Aptitude 2.19 Measures of Central Tendency
3. The mode is a measure which actually indicates what many people incorrectly believethe arithmetic mean indicates. In certain situations mode is the only suitable average,e.g., modal size of garments, modal size of shoes, modal wages, etc.,
4. It is not affected by extreme values.5. Its value can be determined in open-end distribution without ascertaining the class-
limits.6. It can be used to describe qualitative phenomenon. For e.g. in market research to
determine the consumer preferences for different types of products, modal preferencecan be used.
7. Its value can be determined graphically.8. It indicates the point of maximum concentration in case of highly skewed or non-normal
distributions.
Limitations of Mode1. In case of bi-modal/multimodal series, mode cannot be determined.2. It is not capable of further algebraic treatment. For example, combined mode of two ormore series cannot be calculated.3. It is not based on all the items of series.4. It is not rigidly defined measure because different formulae give somewhat different
answers.5. Its value is affected significantly by the size of the class-intervals.6. It is difficult to compute when there are both positive and negative items in a series and
when there is one or more items are zero.7. It is stable only when the sample is large.8. Mode is influenced by magnitude of the class-intervals9. It may be give weight to extreme items.Properties of mode1. If the two variables xand y are related by Y= a+bx, where a and b are real constants
variates and Also the value of mode of x is given, then the mode of Y is given by theequation Ymo =a+bxmo.
Relationship between Mean, Median and ModeEmpirical relationship between Mean, Median and Mode was discovered and propoundedby Prof. Karl Pearson. Through his experiments he found that in a moderately skeweddistribution the distance between Mean and the median is about one-third the distance
between the mean and the mode. Thus, Karl Pearson has expressed the relationship asfollows:
Mode = 3 Median 2 MeanMedian= (Mode + 2 Mean)/3Mean = (3 Median Mode)/2
MOVING AVERAGESMeaning: Moving average is an arithmetic average of data arising over a period of timeand is calculated by replacing the first item in the average by the newly arising item. Eachmoving average is based on values covering a fixed time span which is called period of
moving averages.
-
7/28/2019 Measures of Central Tendency Final
20/39
Quantitative Aptitude 2.20 Measures of Central Tendency
UsefulnessThe successive averaging process does a smoothing operation in the time series data, i.e., itirons out fluctuations of uniform period and intensity. They can be completely eliminatedby choosing the period of moving average that coincides with the period of the cycle.
LimitationThe moving averages cannot be computed for all the given years.
Progressive AverageMeaning Progressive average is an arithmetic mean of the progressive total (i.e.,cumulative total).
ComputationProgressive totals of variable are calculated and then divided by the progressive (i.e.,cumulative) number of items. No previous figure is left as is done in the case of movingaverage. The progressive average for the first year would remain the same; the progressive
average for the second year is equal to2ba ; for the third year
3cba , for the fourth year
4
dcba , and so on.
UsefulnessProgressive averages are useful in order to find out how a business/economy is growing.Such averages can be obtained for all the years for which the data are given.
Choice of a suitable averageNo single average can be considered as best for all circumstances. The suitability of a
particular average depends on the following factors:1. Purpose: The choice should be made according to the purpose that an average isdesigned to serve. Following guidelines are suggested in this regard:
Purpose Suitable average(a)To give equal importance to all the items of series (a)Arithmetic Mean(b)To locate the position of an item in relation to other
items(b)Median and other partition
values
(c)To find out the most common or most fashionableitem
(c)Mode(d)To give more importance to small items than big
items
(d)Geometric Mean(e)To give greatest importance to small items (e)Harmonic Mean2. Nature and form of Data: The choice should be made according to the nature and form
of data available. Following guidelines are suggested in this regard:
(a)For open-end distributions particularly when thedistribution is found to be j-shaped or reverse j-shaped. For example, price distribution andIncome Distribution
(a)Median
(b)To describe qualitative data. For example, to studythe consumer preferences for different products
(b)Mode
-
7/28/2019 Measures of Central Tendency Final
21/39
Quantitative Aptitude 2.21 Measures of Central Tendency
(c)To compute average rates of increase/decrease,average ratios, averages percentages
(c)Geometric Mean(d)To compare the value of a variable with another
variable which is constant. For examples varyingspeed with constant distance, varying quantities
bought/sold per rupee.
(d)Harmonic Mean
(e)In other cases (e)Arithmetic MeanNote: Arithmetic mean should not be used in the following cases:1. In case of highly skewed distributions.2. In case of open-end distribution.3. When there are extreme items i.e., very large and very small items.4. To compute average ratios and rates of change.5. When the distribution is unevenly spread, concentration being small or large at
irregular points.
3. Amenability to further Algebraic Treatment: If an average is to be used for furtheralgebraic treatment, arithmetic mean is considered to be the best as it is very widelyused.
4.Special Purposes: For calculating trend in time-series analysis, the moving averagewould be the most suitable average.
-
7/28/2019 Measures of Central Tendency Final
22/39
Quantitative Aptitude 2.22 Measures of Central Tendency
MEASURES OF DISPERSIONMeaning of Dispersion:The word dispersion means deviation, difference or spread over of certain values from theircentral value. In relation to a statistical series, dispersion refers to the deviation of thevarious items of a series from their central value or difference between any two extremevalues of the series. Further, the word measure means a method of measuring orascertaining certain values. Thus, the phrase measures of dispersion means the variouspossible methods of measuring the dispersion or deviations of the different values froma designated value of the series. The designated value may be average value or any otherextreme value. Dispersion is also known as Scatter, Spread or Variation, It may be notedthat the measures of dispersion measure only the degree (i.e., the amount of variation)but not the direction of the variation. The measures of dispersion are also called averagesof the second order because these measures give an average of the differences of variousitems from an average.
Characteristics of dispersion:1. It consists of different methods through which variation can be measured in
quantitative manner.2. It deals with a statistical series.3. It indicates the degree or extent to which various items of a series deviate from its
central value.4. It supplements the measures of central tendency in revealing the characteristics of a
frequency distribution.5. It speaks of the reliability or otherwise of the average value of a series.Significance of measuring DispersionMeasures of dispersion are calculated to serve the following purposes:1. To determine the reliability of an average Measuring variability determines the
reliability of an average by pointing out to what extent the average is representative ofthe entire data.
2. To facilitate comparison Measures of dispersion facilitate comparisons of two or moredistributions with regard to their variability.
3. To facilitate control Measures of dispersion determine the nature and cause ofvariation in order to control the variation itself.
4. To facilitate the use of other statistical measures Measuring variability facilitates theuse of other statistical measures such as correlation, regression, statistical inference, etc.,
Properties of a good measure of DispersionSince a measure of dispersion is the average of the deviations of items from an average, itshould also posses all the qualities of a good measure of an average. According to Yule andKendall, the qualities of a good measure of dispersion are as follows: -1. Simple to Understand It should be simple to understand2. Easy to Calculate It should be easy to calculate.3. Rigidly Defined - It should be rigidly defined. For the same data, all the methods
should produce the same answer. Different methods of computation leading to different
answers are not proper.
-
7/28/2019 Measures of Central Tendency Final
23/39
Quantitative Aptitude 2.23 Measures of Central Tendency
4. Based on all Items It should be based on all items. When it is based on all items, it willproduce a more representative value.
5. Amenable to further Algebraic Treatment It should be amenable to further algebrictreatment. This means combining groups, calculation of missing values, adjustment forwrong entries, etc., which are possible without the knowledge of actual values of all
items. Such treatment should be possible with a good measure of dispersion also.6. Sampling Stability It should have sampling stability. It means that the average
difference between the values obtained from the sample and the corresponding valuesfrom the population should be the least. If it is so far a measure of dispersion, it is thebest measure.
7. Not Unduly Affected by Extreme Items It should not be unduly affected by theextreme items. Extreme items, many times, are not true representatives of the data. Sotheir presence should not affect the calculation to a large extent.
Measures of Dispersion
Measures of Dispersion may be either absolute or relative.1.Absolute Measure of Dispersion Absolute measure is the measure of dispersion which
is expressed in the same statistical unit in which the original data are given such asKilograms, Tonnes, Kilometers, and Rupees etc. These measures are suitable forcomparing the variability in two distributions having variables expressed in the sameunits and of the same average size. These measures are not suitable for comparing thevariability in two distributions having variables expressed in different units. Followingare the absolute measures of dispersion:
2. Relative Measures of Dispersion Relative measure of dispersion is the ratio of ameasure of absolute dispersion to an appropriate average or the selected items of thedata. It may be noted that the same average base should be used as has been used whilecomputing absolute dispersion. Relative measures of dispersion are also called asCoefficient if Dispersion because relative measure is a pure number that isindependent of the unit of measurement. Following are the relative measures ofdispersion
Absolute Measures of Dispersion
Based on Selected Items Based on All Items
1. Range2. Inter-Quartile Range 1. Mean Deviation2. Standard Deviation
-
7/28/2019 Measures of Central Tendency Final
24/39
Quantitative Aptitude 2.24 Measures of Central Tendency
1. RangeRange is defined as the difference between the value of largest item and the value ofsmallest item included in the distribution. Measures of range may be absolute or relative.
Individual seriesSteps to calculate:1. Ascertain largest value and smallest value.2. Calculate Range by taking out the difference between largest value and smallest value:
Range = Largest value Smallest value.3. Calculate coefficient of Range by dividing the difference between largest value and
smallest value by the aggregate of largest value and smallest value.
Coefficient of Range = L-S/L+S.
Discrete seriesSteps to calculate:1. Identify the variable(x) and frequency (f) from the data given.2. Ascertain largest and smallest value from the values of variable.3. Calculate Range by taking out the difference between largest value and smallest value:
Range = Largest value Smallest value.4. Calculate coefficient of Range by dividing the difference between largest value and
smallest value by the aggregate of largest value and smallest value.
Coefficient of Range = L-S/L+S.
Tutorial Note: Frequencies are not considered at all for computing Range and itsCoefficient.
ContinuousseriesSteps to calculate:1. Convert the class series into exclusive series if given in any other series.2. Ascertain largest and smallest value from the values of variable i.e. Ascertain the upper
limit of highest class and lower limit of lowest class.3. Calculate Range by taking out the difference between largest value and smallest value:
Range = Largest value Smallest value.4. Calculate coefficient of Range by dividing the difference between largest value and
smallest value by the aggregate of largest value and smallest value.
Relative Measures of Dispersion
Based on Selected Items Based on All Items
1. Coefficient of Range2. Coefficient of quartile 1. Coefficient of Mean Deviation2. Coefficient of Standard Deviation
or Coefficient of Variation
-
7/28/2019 Measures of Central Tendency Final
25/39
Quantitative Aptitude 2.25 Measures of Central Tendency
Coefficient of Range = L-S/L+S.Tutorial Note: Frequencies are not considered at all for computing Range and itsCoefficient.
Interpretation of Range
If the averages of the two distributions are almost same, the distribution with smaller rangeis said to have less dispersion and the distribution with larger range is said to have moredispersion.
Merits of RangeThe merits/advantages of range are as follows:1. It is easy to understand.2. It is easy to calculate3. It does not require any special knowledge.4. It takes minimum time to calculate the value of range.Limitations of rangeThe demerits/disadvantages/limitations of range are as follows:1. It does not take into consideration all items of the distribution2. Only two extreme values are taken into consideration3. It is affected by extreme values4. It is not possible to find out the range in open-end frequency distribution5. It does not present very accurate picture of the series.6. It is not suitable for mathematical treatment7. It does not indicate the direction of the variability.Uses of Range:1. Range is used in industries for the statistical quality control of the manufactured
product by the construction of Control Chart. If the range increases beyond a certainpoint, the product may be examined to find out the reasons for variations.
2. Range is useful in studying the variations in the prices of stock, shares and othercommodities that are sensitive to price changes from one period to one period.
3. The meteorological department uses the range for weather forecasts since public isinterested to know the limits within which the temperature is Likely to vary on aparticular day.
Properties of range
1. If all the observations assumed by a variable are constant, say k, then the range is zero.2. Range remains unaffected due to a change of origin but is affected in the same ratio due
to a change in scale, if for any two constants a and b, the two variables x and y arerelated by Y= a+bx, then the range of y is given by Ry=|b|x Rx.
Quartile Deviation:Meaning Of Quartile Deviation or Semi-Inter Quartile Range:Quartile Deviation (or Semi-Inter Quartile Range) is half of the difference between upperquartile (Q3) and lower quartile (Q1). Quartile deviation indicates the average amount bywhich the two quartiles differ from the median. In symmetrical distribution the two
quartiles (Q1
and Q3
) are equidistant from the median [i.e., Median Q1
= Q3
- median]. As
-
7/28/2019 Measures of Central Tendency Final
26/39
Quantitative Aptitude 2.26 Measures of Central Tendency
a result Median Q.D. covers exactly 50% of the observations of the distribution.Measures of Quartile Deviation may be absolute or relative.
Individual series, Discrete series and Continuous series.Steps to calculate:
1.
Calculate lower quartile (Q1)2. Calculate Upper Quartile (Q3)3. Calculate Quartile deviation and its coefficient by applying the formula given below.Measure of Quartile Deviation Formula
1. Absolute Measure of QuartileDeviation
Q.D. =2
13 QQ
Q.D. = Quartile DeviationQ3 = Third Quartile or Upper QuartileQ1 = First Quartile or Lower Quartile
2. Relative Measure of QuartileDeviation
Coefficient of Q.D. =13
13
QQ
QQ
Tutorial Note: In case of symmetrical distribution, Median = (Q3 + Q1)/2
Merits of Quartile Deviation:1. It is easy to understand2. It is easy to calculate3. It can be used in Open-end frequency distribution4. It is lest affected by extreme items.Limitations of Quartile Deviation:
1. This method of dispersion is not based on all the items of the series. It ignores the first25% of the items and the last 25% of the items.
2. It is a positional average; hence not amenable to further mathematical treatment.3. It is very much affected by sampling fluctuations4. It is not a proper measure of dispersion since it shows the distance on a scale and not
the scatter around an average. It is a positional average and is not measured from anaverage.
5. It gives only a rough measurement.Properties of Quartile Deviation
1. Quartile Deviation remains unaffected due to a change of origin but is affected in thesame ratio due to a change in scale, if for any two constants a and b, the two variables xand y are related by Y= a+bx, then the Quartile Deviation of y is given byQ.D of y =|b|x Q.D ofx.
Mean DeviationMeaning of mean deviationMean Deviation is the arithmetic mean of the absolute deviations (i.e., deviations treated aspositive regardless of their actual signs) of all items of the distribution from a measure ofcentral tendency, which could be mean or median or sometimes even mode. It is alsoknown as first moment of dispersion.
-
7/28/2019 Measures of Central Tendency Final
27/39
Quantitative Aptitude 2.27 Measures of Central Tendency
Individual seriesSteps to calculate:1. Calculate Mean, Median or mode of the series.2. Take deviations |D| from either mean, median or mode ignoring +(-) signs. |D| is
difference between value of the variable (x) and mean, median or mode.
3. Calculate the total of these deviations i.e. |D|.4. Calculate Mean deviation by dividing the sum of deviations by total no. of observations.5. Calculate Coefficient of Mean Deviation by applying formula given below.Discrete seriesSteps to calculate:1. Calculate Mean, Median or mode of the series.2. Take deviations |D| from either mean, median or mode ignoring +(-) signs. |D| is
difference between value of the variable (x) and mean, median or mode.3. Calculate the total of these deviations i.e. |D|.4. Multiply each deviation by its corresponding frequency and obtain the total of theseproducts i.e. f|D|.5. Calculate mean deviation by dividing the total of deviations by total no. of observations.6. Calculate Coefficient of Mean Deviation by applying formula given below.Continuous seriesSteps to calculate:1.Convert the class series into exclusive series if given in any other series.2.Calculate mid-values and denote them by x.3.Calculate Mean, Median or mode of the series.4.Take deviations |D| from either mean, median or mode ignoring +(-) signs. |D| isdifference between value of the variable or mid-values (x) and mean, median or mode.5.Calculate the total of these deviations i.e. |D|.6.Multiply each deviation by its corresponding frequency and obtain the total of these
products i.e. f|D|.7.Calculate mean deviation by dividing the total of deviations by total no. of observations.8.Calculate Coefficient of Mean Deviation by applying formula given below.
Measures of Mean Deviations Formula
1. Absolute Measure of Mean Deviations(i)
For Individual Observations M.D. =
ND
Where, M.D. = Mean Deviation
D = Deviations of items from
Mean or MedianIgnoring signs
N = No. of Observations
(ii) For Discrete Series (i.e., whenfrequencies are given) M.D. = N
Df
M.D. = Mean Deviation
D = Deviations of Items from
-
7/28/2019 Measures of Central Tendency Final
28/39
Quantitative Aptitude 2.28 Measures of Central Tendency
Mean or Medianignoring signs
N = No. of Observations
where, Df =Product of Deviations
and respective frequencyN = No. of Total Frequencies
(iii) For Continuous Series (i.e.,when both classes andfrequencies are given)
M.D. =N
Df
Where M.D. = Mean Deviation
D = Deviation of Mid-Points of
Classes from Mean orMedian
Df = Product of Deviations and
respective frequencyN = No. of Total Frequencies
2. Relative Measure of MeanDeviation
(i) Coefficient of M.D. about Mean(ii) Coefficient of M.D. about
Median(iii) Coefficient of M.D. about Mode
=Mean
MeanaboutM.D.
=Median
MedianaboutM.D.
=Mode
ModeaboutM.D.
Interpretation of mean Deviation
Nature of Distribution Range of Mean Mean Deviation1. Normal Distribution2. Moderately Skewed Distribution
Approximately 57.7% of the items expectedto fall in this rangeApproximately 57.5% of the items expectedto fall in this range
Amount of mean Deviation Interpretation
I. Small Mean Deviation
II. High Mean Deviation
Distribution is highly uniform since morethan 50% of the items fall within smallrange around the meanDistribution is not uniform since more than
50% of the items do not fall within smallrange around the mean
Merits of Mean deviation:1. It is simple to understand.2. It is easy to calculate as compared to standard deviation.3. M.D. is a calculated value.4. It is not much affected by the fluctuations of sampling5. It is based on all items of the series and gives weight according to their size.6. It is less affected by extreme values than the standard deviation.7. It is rigidly defined.8. It is flexible, because it can be calculated from any measurer of central tendency.
-
7/28/2019 Measures of Central Tendency Final
29/39
Quantitative Aptitude 2.29 Measures of Central Tendency
9. It is a better measure for comparison.10.It is useful for small samples when no detailed analysis is required.Demerits of mean Deviation:1. It lacks algebraic properties such as (+) and (-) signs which are not taken into
consideration. All the deviations are treated as positive.2. Algebraic positive and negative signs are completely ignored. In mean deviationmeasure +5 and 5 have the same meaning. It is mathematically unsound and illogical.
3. It is not a very accurate measure of dispersion.4. It is not suitable for further mathematical calculation.5. It is rarely used it is not as popular as standard deviation.6. It is much affected by sampling fluctuations.7. It is not capable of further algebraic treatment.8. It may not give accurate result when the degree of variability in a series is very high.Uses of mean deviation:It will help to understand the standard deviation. It is useful in marketing problems. It isuseful while using small samples. It is used in statistical analysis of economic business andsocial phenomena. It is useful in calculating the distribution of wealth in a community or anation. It is useful in forecasting business cycles.
Properties of mean deviation1. If all the observations assumed by a variable are constant, say k, then the mean deviation
is zero.2.Mean Deviation about median is the least. In other words, the sum of the deviations
taken from median is the minimum.
3. Mean Deviation remains unaffected due to a change of origin but is affected in the sameratio due to a change in scale, if for any two constants a and b, the two variables x and yare related by Y= a+bx, then the Mean Deviation of y is given by M.D. of y=|b|x M.D ofx.
Standard deviationMeaning of standard deviation or root mean square deviationStandard Deviation is the square root of the arithmetic mean of the squares of deviations ofall items of the distribution from arithmetic mean.
Measures of standard deviationMeasures of Standard Deviation may be absolute or relative
Individual seriesA.When the deviations are taken from the actual mean.Steps involved.1. Calculate the Actual Mean (X) of the series.2. Take deviations from the actual mean by taking out the difference (x-x).3. Square these deviations and obtain the total. (x-x).4. Calculate Standard Devotion () by applying the formula given below.5. Calculate Coefficient of Variation (C.V) by dividing the standard deviation by actual
mean.
-
7/28/2019 Measures of Central Tendency Final
30/39
Quantitative Aptitude 2.30 Measures of Central Tendency
B.When actual values are taken:Steps involved.1. Identify the variable and denote it by x.2. Square the values and find summation of x and x2.3. Calculate Standard Devotion () by applying the formula given below.4. Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
C.When deviations are taken from the assumed mean.Steps involved.5. Identify the variable and denote it by x.6. Take any value of the variable as Assumed Mean i.e. A.7. Take deviations from the Assumed Mean by taking out the difference (X-A) between the value
of the variable (X) and the assumed mean (A) and denote the deviations byd and obtain thetotal of these deviations i.e. d.
8.
Square these deviations and obtain the total.d
.9. Calculate Standard Devotion () by applying the formula given below.10.Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
Discrete seriesA.When the deviations are taken from the actual mean.Steps involved.1. Calculate the Actual Mean (x) of the series.2. Calculate Deviations from the actual mean by taking out the difference (x-x) between the
value of item (x) and the actual mean (x).
3. Square these deviations (x-x).4. Multiply the squared deviation of each row with the respective frequency and obtain
the total of these products i.e. f(x-x).5. Calculate the Standard Deviation () by applying the formula given below.6. Calculate Coefficient of Variation (C.V.) by dividing standard deviation by actual mean.B.When actual values are taken:Steps involved.1. Identify the variable and denote it by x.2. Square the values of x.3. Multiply each value of x with the respective frequency and each value of x2 withrespective frequency.4. Find the summation of fx and fx2.5. Calculate Standard Devotion () by applying the formula given below.6. Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
C.When deviations are taken from the assumed mean.Steps involved.1. Identify the variable and denote it by x.2. Take any value of the variable as Assumed Mean i.e. A.
-
7/28/2019 Measures of Central Tendency Final
31/39
Quantitative Aptitude 2.31 Measures of Central Tendency
3. Take deviations from the Assumed Mean by taking out the difference (x-A) between the valueof the variable (x) and the assumed mean (A) and denote the deviations byd and obtain thetotal of these deviations i.e. d.
4. Square these deviations.5. Multiply the deviation (d) of each row by the respective frequency and obtain the total
of these products i.e. fd.6. Multiply the squared deviation (d) of each row the respective frequency and obtain the
total of these products i.e. fd2.7. Calculate Standard Devotion () by applying the formula given below.8. Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
Continuous seriesA.When the deviations are taken from the actual mean.Steps involved.
1.
Convert the class series into exclusive series if given in the open-ended or cumulativeseries. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2. Take the mid-value of each of the class group and denote it by x.3. Calculate the Actual Mean (x) of the series.4. Calculate Deviations from the actual mean by taking out the difference (x-x) between the
value of item (x) and the actual mean (x).7. Square these deviations (x-x).8. Multiply the squared deviation of each row with the respective frequency and obtain
the total of these products i.e. f(x-x).
9.
Calculate the Standard Deviation () by applying the formula given below.10. Calculate Coefficient of Variation (C.V.) by dividing standard deviation by actual mean.B.When actual values are taken:Steps involved.1. Convert the class series into exclusive series if given in the open-ended or cumulative
series. It may be noted that the mean can be easily calculated if the problem is given ininclusive series or mid-value series.
2. Take the mid-value of each of the class group and denote it by x.3. Square the values of x.4.
Multiply each value of x with the respective frequency and each value of x
2
withrespective frequency.5. Find the summation of fx and fx2.6. Calculate Standard Devotion () by applying the formula given below.7. Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
C.When deviations are taken from the assumed mean.Steps involved.1. Convert the class series into exclusive series if given in the open-ended or cumulative
series. It may be noted that the mean can be easily calculated if the problem is given in
inclusive series or mid-value series.
-
7/28/2019 Measures of Central Tendency Final
32/39
Quantitative Aptitude 2.32 Measures of Central Tendency
2. Take the mid-value of each of the class group and denote it by x.3. Identify the variable and denote it by x.4. Take any value of the variable as Assumed Mean i.e. A.5. Take deviations from the Assumed Mean by taking out the difference (x-A) between the value
of the variable (x) and the assumed mean (A) and denote the deviations byd and obtain the
total of these deviations i.e. d.6. Square these deviations.7. Multiply the deviation (d) of each row by the respective frequency and obtain the total
of these products i.e. fd.8. Multiply the squared deviation (d) of each row the respective frequency and obtain the
total of these products i.e. fd2.9. Calculate Standard Devotion () by applying the formula given below.10.Calculate Coefficient of Variation (C.V) by dividing Standard Deviation by Actual
Mean.
Steps involved in the calculation of combined standard deviation.1. Calculate mean and standard deviation of each group.2. Calculate combined mean of given groups as follows.3. Calculate deviations of each group from combined mean and ignore signs.4. Calculate combined standard deviation by applying formula given below.
Measure of Standard Deviation Formula
1. Absolute Measure of Mean Deviation(i) For Individual Observations
(a) Actual Mean Method
N
x2
Where, = standard Deviation
x = (X - X ) i.e., deviationstaken from Actual Mean
x2 = Squares of DeviationsN = Total No. of observations
(b) Assumed Mean Method 22
N
d
N
d
Where, = Standard Deviations,
d = (X-A) i.e., deviations takenfrom Assumed Meand2 = Squares of DeviationsN = Total No. of observations
(iii) For Discrete Series(a) Actual Mean Method
N
fx2
Where, = standard Deviation
x = (X - X ) i.e., Deviationstaken from Actual Mean
x2 = Squares of Deviations
-
7/28/2019 Measures of Central Tendency Final
33/39
Quantitative Aptitude 2.33 Measures of Central Tendency
fx2 = Product of square ofdeviation and respectivefrequency
N = Total No. of frequencies
(b) Assumed Mean Method 22
N
fd
N
fd
Where, = Standard Deviationd = (X A) i.e., deviations taken
fromAssumed Mean
d2 = Squares of Deviationsfd2= Product of square of
deviation and respective
frequencyN = Total No. of frequencies
(c) Step Deviation Method(When a common factor is taken fromthe given data)
CN
fd
N
fd
22
Where, = Standard Deviation
d =C
AXi.e., deviations taken from
Assumed Mean and divided byCommon factor (c)
d2
= Squares of Deviationsfd2 = Product of square ofdeviation and respectivefrequency
N = total No. of frequencies
(iii) Continuous Series (i.e., when both theclass-intervals and frequencies are given)
(a) Actual Mean MethodN
fx2
Where, = Standard Deviation
x = (m - X ) i.e., deviations taken
from Actual Meanx2 = Squares of Deviations
fx2 = Product of square of deviationand
respective frequencyN = Total No. of frequencies
III S.D. of any two values = (Largest Value Smallest value)/2or = Range/2
-
7/28/2019 Measures of Central Tendency Final
34/39
Quantitative Aptitude 2.34 Measures of Central Tendency
Similarity between Mean deviation and standard deviationBoth the Mean Deviation and Standard Deviation measures of dispersion are based on eachand every item of the distribution.
Properties of standard deviation
The main properties of standard deviation are as follows:1. Independent of change of origin:Standard Deviation is independent of change of origin. In other words, the value ofstandard deviation remains the same if each of the observations in a series is increased ordecreased by a constant value. Thus, if Y = X + K where K is a constant quantity, thenstandard deviation of Y is equal to standard deviation of X.
2. Not Independent of change of scale:Standard Deviation is not independent of change of scale. In other words, the value ofstandard deviation is also affected if each of the observations in a series is multiplied or
divided by a constant value. Thus, if Y = Y x K or Y = Y/K where K is a constant quantity,then standard deviation of Y is not equal to standard deviation of X.
3. If all the observations assumed by a variable are constant, say k, then the standarddeviation is zero.
4. If for any two constants a and b, the two variablesx and y are related by Y= a+bx, thenthe Standard Deviation of y is given by S.D. of y=|b|x S.D ofx.
5. Fixed Relationship among Measures of Dispersion:In a normal distribution there is fixed relationship between Quartile Deviation, Mean
Deviation and Standard Deviation as follows;Quartile Deviation (Q.D) = 2/3 , Mean Deviation (M.D.) = 4/5 Thus, standard deviation is never less than Quartile Deviation and Mean Deviation.
6. Fixed Area Relationship for a symmetrical distribution:In a symmetrical distribution there is fixed area relationship as follows:Mean 1 covers 68.27% of the items,Mean 2 covers 95.45% of the items,Mean 3 covers 99.73% of the items,
7. Minimum sum of squares:The sum of squares of deviations of items in the series from their arithmetic mean isminimum. In other words, the sum of squares of the deviations of items of any series from avalue other than the arithmetic mean will always be greater than standard deviation.
8. Standard Deviation of n natural numbers:The standard deviation of n natural numbers can be computed by using the followingformula:
Standard Deviation of n natural numbers = 112
1 2 N
9. Combined Standard Deviation 12 - Combined Standard Deviation 12 of twogroups can be computed by using the following formula:
-
7/28/2019 Measures of Central Tendency Final
35/39
Quantitative Aptitude 2.35 Measures of Central Tendency
21
2
22
2
11
2
22
2
1112
NN
dNdNNN
Where, 12 = Combined Standard deviation of two groups (1, 2)
1= Standard Deviation of First group (1)
2 = Standard Deviation of Second group (2)
1N = No. of Items in First Group
2N = No. of Items in Second Group
1d = 121 XX
1222 XXd
1X Actual Mean of First Group
2X Actual Mean of Second Group
12X Actual Mean of Two Groups
Distinction between mean deviation and standard DeviationMean Deviation differs from Standard Deviation in the following respects:
Basis of Distinction Mean Deviation Standard deviation
1. Algebraic Signs Actual Signs are ignoredand all deviations are takenas positive
Actual signs are notignored
2. Use of Measure ofCentral Tendency
Mean Deviations can becomputed from arithmeticmean, median or mode
Standard Deviation iscomputed from arithmeticmean since the sum ofsquares of the deviation of
items from arithmetic meanis minimum
3. Formulan
dfDM
)(..
n
fxDS
2
).(.
Merits of standard deviation:1. It is rigidly defined and its value is always definite and based on all the observations
and the actual signs of deviations are used.2. As it is based on arithmetic mean, it has all the merits of arithmetic mean.3.
It is the most important and widely used measures of dispersion.4. It is possible for further algebraic treatment.
5. It is less affected by the fluctuations of sampling, and hence stable.6. Squaring the deviations make all of them positive; as such there is no need to ignore the
signs (as in mean deviation)7. It is the basis for measuring the coefficient of correlation sampling and statistical
inferences.8. The standard deviation provides the unit of measurement for the normal distribution.9. It can be used to calculate the combined standard deviation of two or more groups.10. The coefficient of variation is considered to be the most appropriate method for the
purpose of comparing the variability of two or more distributions, and this is based on menand standard deviation.
-
7/28/2019 Measures of Central Tendency Final
36/39
Quantitative Aptitude 2.36 Measures of Central Tendency
Limitations of Standard Deviation1. It cannot be used for comparing the variability of two or more series of observations
given in different units. A Coefficient of standard deviation is to be calculated for thispurpose.
2. It is difficult to compute as compared to other measures.3. It is very much affected by the extreme values. The process of squaring deviations gives
undue importance to large deviations from arithmetic mean which is obtained onlyfrom extreme items and it gives less importance to items which are nearer to mean.
4. The standard deviation cannot be computed for a distribution with open-end classes.5. As it is an absolute measure of variability, it cannot be used for the purpose of
comparison.6. It has not found favour with the economists and businessmen.Interpretation of coefficient of variation% of Coefficient of Variation Interpretation
I. Lower Group is moreconsistent/uniform/stable/homogeneous
II. Higher Group is lessconsistent/uniform/stable/homogeneous
VarianceMeaning of VarianceVariance is the arithmetic mean of the squares of deviations of all items of the distributionsform arithmetic mean. In other words, variance is the square of the standard deviation orstandard deviation is the square root of the variance.
Thus, Variance = 2
Or, = Variance
Interpretation of Variance
Value of Variance 2 InterpretationSmaller the value of 2 Lesser the variability or greater the
uniformity in the population
Larger the value of 2 Greater the variability or lesser theuniformity in the population
Lorenz CurveLorenz curve is a graphic method of studying dispersion. Basically, Lorenz curve is acumulative percentage curve in which the percentage of items is combined with thepercentage of other things as wealth, income, sales, profits etc., the steps involved in thecomposition of Lorenz curve are given below:
Practical steps involved in the construction of Lorenz curveStep 1 Calculate cumulative values of items of each of the given variables and convert
each of the cumulative values into percentage taking grand total as 100.
Step 2 Calculate cumulative frequencies of items of each of the variables and convert
-
7/28/2019 Measures of Central Tendency Final
37/39
Quantitative Aptitude 2.37 Measures of Central Tendency
each of the cumulative values into percentage taking grand total as 100.Step 3 Plot the percentages of cumulative frequencies on X-axis (scale 0 to 100)Step 4 Pot the percentages of cumulative values of the variables on Y-axis (Scale 0 to 100)Step 5 Draw a diagonal line joining 0 on X-axis with 100 on Y-axis. This is known as line
of equal distribution. Any point on this diagonal shows the same percentage on X
as on Y.Step 6 Plot the various points corresponding to X and Y and joint them. The line so
obtained, unless all items exactly equal, will always form a curve below the line ofequal distribution. The area between line of equal distribution and the plottedcurve gives the extent of inequality in the items. If curves of various distributionsare shown on the same Lorenz presentation, the curve that is farthest from thediagonal line represents greatest inequality.
LIST OF FORMULAEMeasure Type of Series Absolute Measure Relative Measure
. Range (i) Individual/Discrete(ii) Continuous
R = L S
R = UL - LS Coefficient of Range =SL
SL
Coefficient of Range =
SL
SL
UU
UU
. Inter-Quartile RangeInter-Quartile Range= 13 QQ
. Percentile RangePercentile Range = 1090 PP Coefficient of Percentile Range =
1090
1090
PP
PP
. Quartile Deviation Quartile Deviation orSemi-Inter Quartile Range =
2
13 QQ
Coefficient of Quartile Deviation =
13
13
QQ
QQ
. Mean Deviation (i) Individual Series
(ii) Discrete Series
(iii) Continuous Series
N
DDM
..
N
DfDM
..
N
DfDM ..
Coefficient of M.D. about
Mean/median/Mode
=ModeMedianMean //
n/ModeMean/MediaaboutM.D.
. Standard Deviation (i) Individual Series(a) Actual Mean
Method
(b) Assumed MeanMethod
N
x2
22
N
d
N
d
Coefficient of Variation
= 100MeanArithemtic
DeviationStandard
LIST OF FORMULAE
Measure Type of Series Absolute Measure Relative Measure
. Range (i) Individual/Discrete(ii) Continuous R = L SR = UL - LS Coefficient of Range =
Coefficient of Range =
. Inter-QuArtile RangeInter-Quartile Range=
SL
SL
SL
SL
UU
UU
13 QQ
-
7/28/2019 Measures of Central Tendency Final
38/39
Quantitative Aptitude 2.38 Measures of Central Tendency
. Percentile RangePercentile Range =
Coefficient of Percentile Range =
0. Quartile Deviation Quartile Deviation orSemi-Inter Quartile Range = Coefficient of Quartile Deviation =
1. Mean Deviation (i) Individual Series
(ii) Discrete Series
(iii) Continuous Series
Coefficient of M.D. about
Mean/median/Mode
=
2. Standard Deviation (i) Individual Series(a) Actual Mean
Method
(b) Assumed MeanMethod
Coefficient of Variation
=
(ii) Discrete Series(a) Actual Mean
Method
(b) Assumed MeanMethod
(c) Step-DeviationMethod
N
fx2
22
N
fd
N
fd
cN
fd
N
fd
22
(iii) Continuous Series(a) Actual Mean
Method
(b) Assumed MeanMethod
(c) Step-DeviationMethod
N
fx2
22
N
fd
N
fd
cN
fd
N
fd
22
3. VarianceVariance =
2
4. combined StandardDeviation
21
2
22
2
11
2
22
2
11
12nn
dndnnn
5. Standard Deviation ofany two Values
2
ItemSmallest-Itemlargest
1090 PP
1090
1090
PP
PP
2
13 QQ 13
13
QQ
QQ
N
DDM
..
N
DfDM
..
N
DfDM ..
ModeMedianMean //
n/ModeMean/MediaaboutM.D.
N
x2
22
N
d
N
d
100MeanArithemtic
DeviationStandard
-
7/28/2019 Measures of Central Tendency Final
39/39