the mean for quantitative data is obtained by dividing the sum of all values by the number of values...
TRANSCRIPT
![Page 1: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/1.jpg)
• The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
n
xx
![Page 2: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/2.jpg)
The following are the ages of all eight employees of a small company:
53 32 61 27 39 44 49 57
Find the mean age of these employees.
![Page 3: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/3.jpg)
years 25.458
362
n
xx
![Page 4: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/4.jpg)
Thus, the mean age of all eight employees of this company is 45.25 years, or 45 years and 3 months.
![Page 5: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/5.jpg)
The median is the value of the middle term in a data set that has been ranked in increasing order.
The calculation of the median consists of the following two steps:1.Sort/Arrange the data set in increasing order
2.Find the middle term in a data set with n values. The value of this term is the median.
![Page 6: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/6.jpg)
The following data give the weight lost (in pounds) by a sample of five members of a health club at the end of two months of membership:
10 5 19 8 3
Find the median.
![Page 7: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/7.jpg)
First, we rank the given data in increasing order as follows:
3 5 8 10 19
Therefore, the median is 3 5 8 10 19
The median weight loss for this sample of five members of this health club is 8 pounds.
![Page 8: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/8.jpg)
The median gives the “center” with half the data values to the left of the median and half to the right of the median. The advantage of using the median as a measure of central tendency is that it is less influenced by outliers & skewness. Consequently, the median is preferred over the mean as a measure of central tendency for data sets that contain outliers and/or skewness.
![Page 9: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/9.jpg)
The mode is the value that occurs with the highest frequency in a data set.
![Page 10: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/10.jpg)
![Page 11: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/11.jpg)
![Page 12: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/12.jpg)
![Page 13: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/13.jpg)
• Range = Largest value – Smallest Value
![Page 14: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/14.jpg)
• The range, like the mean has the disadvantage of being influenced by outliers.
• Its calculation is based on two values only: the largest and the smallest.
![Page 15: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/15.jpg)
• The standard deviation is the most used measure of dispersion.
• The value of the standard deviation tells how closely the values of a data set are clustered around the mean.
![Page 16: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/16.jpg)
x deviation
82
95
67
92
82 – 84 = -2
95 – 84 = +11
67 – 84 = -17
92 – 84 = +8
∑(deviation) = 0
![Page 17: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/17.jpg)
• standard deviation stdev =
sqrt (sum squared deviations divided by n-1)
Example :: sqrt[(4 + 121 + 289 + 64)/3]
sqrt(478/3) = sqrt(159.3) = 12.62
![Page 18: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/18.jpg)
• A numerical measure such as the mean, median, mode, range, variance, or standard deviation calculated for a population data set is called a population parameter, or simply a parameter.
• A summary measure calculated for a sample data set is called a sample statistic, or simply a statistic.
![Page 19: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/19.jpg)
For a bell shaped distribution approximately1. 68% of the observations lie within
one standard deviation of the mean
2. 95% of the observations lie within two standard deviations of the mean
3. 99.7% of the observations lie within three standard deviations of the mean
![Page 20: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/20.jpg)
The age distribution of a sample of 5000 persons is bell-shaped with a mean of 40 years and a standard deviation of 12 years. Determine the approximate percentage of people who are 16 to 64 years old.
![Page 21: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/21.jpg)
Quartiles are three summary measures that divide a ranked data set into four equal parts. The second quartile is the same as the median of a data set. The first quartile is the value of the middle term among the observations in the lower half, and the third quartile is the value of the middle term among the observations in the upper half.
![Page 22: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/22.jpg)
25% 25% 25% 25%
Each of these portions contains 25% of the observations of a data set arranged in increasing order
Q1 Q2
![Page 23: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/23.jpg)
Calculating Interquartile Range The difference between the third and first
quartiles gives the interquartile range; that is,
IQR = Interquartile range = Q3 – Q1
![Page 24: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/24.jpg)
The following are the ages of nine employees of an insurance company:
47 28 39 51 33 37 59 24 33
a)Find the values of the three quartiles. Where does the age of 28 fall in relation to the ages of the employees?
b)Find the interquartile range.
![Page 25: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/25.jpg)
The following data are the incomes (in thousands of dollars) for a sample of 12 households. 35 29 44 72 34 64 41 50 54 104 39 58
Construct a box-and-whisker plot for these data.
![Page 26: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/26.jpg)
Step 1.
29 34 35 39 41 44 50 54 58 64 72 104
Median = (44 + 50) / 2 = 47 Q1 = (35 + 39) / 2 = 37 Q3 = (58 + 64) / 2 = 61 IQR = Q3 – Q1 = 61 – 37 = 24
![Page 27: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/27.jpg)
Step 2.
1.5 x IQR = 1.5 x 24 = 36 Lower inner fence = Q1 – 36 = 37 – 36 = 1
Upper inner fence = Q3 + 36 = 61 + 36 = 97
![Page 28: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/28.jpg)
Step 3.
Smallest value within the two inner fences = 29 Largest value within the two inner fences = 72
![Page 29: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/29.jpg)
25 35 45 55 65 75 85 95 105Income
First quartile
Third quartile
Median
![Page 30: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/30.jpg)
25 35 45 55 65 75 85 95 105
An outlier
First quartile
Median
Third quartile
Smallest value within the two inner
fences
Largest value within two inner
fences
Income
![Page 31: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/31.jpg)
What Can Go Wrong?
• Do a reality check—don’t let technology do your thinking for you.
• Don’t forget to sort the values before finding the median … quartiles.
• Don’t compute numerical summaries of a categorical variable.
• Watch out for multiple peaks—multiple peaks might indicate multiple groups in your data.
![Page 32: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/32.jpg)
What Can Go Wrong?
• Be aware of slightly different methods—different statistics packages and calculators may give you different answers for the same data.
• Beware of outliers.• Make a picture (make a picture, make a picture).• Be careful when comparing groups
that have very different spreads.
![Page 33: The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set](https://reader031.vdocument.in/reader031/viewer/2022020800/56649ce25503460f949adee6/html5/thumbnails/33.jpg)
So What Do We Know?
• We describe distributions in terms of L.O.S.S.
• For symmetric distributions, it’s safe to use the mean and standard deviation; for skewed distributions, it’s better to use the median and interquartile range.
• Always make a picture—don’t make judgments about which measures of center and spread to use by just looking at the data.