data distributions. essential question: how can you use shape, center, and spread to characterize a...

30
Data Distributions

Upload: leslie-gardner

Post on 12-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Data Distributions

Page 2: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Essential Question:How can you use shape, center,

and spread to characterize a data distribution?

Page 3: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Data Distribution

• A set of numerical data that you can graph using a data display that involves a number line.

• Ex: line plot, histogram, or box plot. • The graph will reveal the shape of the

distribution.

Page 4: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Seeing the Shape of a Data DistributionBaby Birth Month Birth Weight

(kg)Mother’s

age

1 5 3.3 28

2 7 3.6 31

3 11 3.5 33

4 2 3.4 35

5 10 3.7 39

6 3 3.4 30

7 1 3.5 29

8 4 3.2 30

9 7 3.6 31

10 6 3.4 32

11 9 3.6 33

12 10 3.5 29

13 11 3.4 31

14 1 3.7 29

15 6 3.5 34

16 5 3.8 30

17 8 3.5 32

18 9 3.6 30

19 12 3.3 29

20 2 3.5 28

A. Make a line plot for the distribution of birth months.

B. Make a line plot for the distribution of birth weights.

C. Make a line plot for the distribution of mothers’ ages.

Page 5: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 1a.

• Describe the shape of the distribution of birth months.

Page 6: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 1b.

• Describe the shape of the distribution of birth weights.

Page 7: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 1c.

• Describe the shape of the distribution of mothers’ age.

Page 8: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Understanding Shape, Center, and Spread

• Data distributions can have various shapes. • These shapes have names in Statistics.

Page 9: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Uniform Distribution

• The shape is basically level.

• It looks like a rectangle.

Page 10: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Normal Distribution

• Mound in the middle with symmetric tails at each end.

• Looks bell shaped.

Page 11: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Skewed distribution

• Mounded by not symmetric because one “tail” is much longer than the other.

Page 12: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Other (Mixed)

Page 13: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Distribution Center and Spread

• Mean

Page 14: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Distribution Center and Spread

• Median

Page 15: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Distribution Center and Spread

• Standard Deviation

Page 16: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Distribution Center and Spread

• Interquartile Range (IQR) – tells you how spread out the middle or (50%) of data are.

Page 17: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 2a.

• Describe the shape of each distribution that you made in the Example, using the vocabulary you just learned.

Page 18: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 2b.

• When the center and the spread of a distribution are reported, they are generally given either as the mean and standard deviation or as the median and IQR.

• Why do these pairings make sense?

Page 19: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Relating Center and Spread to ShapeBaby Birth Month Birth Weight

(kg)Mothers’

age

1 5 3.3 28

2 7 3.6 31

3 11 3.5 33

4 2 3.4 35

5 10 3.7 39

6 3 3.4 30

7 1 3.5 29

8 4 3.2 30

9 7 3.6 31

10 6 3.4 32

11 9 3.6 33

12 10 3.5 29

13 11 3.4 31

14 1 3.7 29

15 6 3.5 34

16 5 3.8 30

17 8 3.5 32

18 9 3.6 30

19 12 3.3 29

20 2 3.5 28

Calculate the following for both birth weight and mothers’ ages.

A. MeanB. MedianC. Standard DeviationD. IQR

Page 20: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 3a.

• What do you notice about the mean and median for the symmetric distribution (baby weights) as compared with the mean and median for the skewed distribution (mothers’ ages)?

• Explain why this happens.

Page 21: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 3b.

• One way to compare the spread of two distributions is to find the ratio (expressed as a percent) of the standard deviation to the mean for each distribution. Another way to find the ratio (expressed as a percent) of the IQR to the median.

• Calculate these ratios, rounding each to the nearest percent if necessary, for the symmetric and the skewed distribution.

• What do you observe when you compare the corresponding ratios? Why does this make sense?

Page 22: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 3c.

• Which measures of center and spread would you report for the symmetric distribution?

• For the skewed distribution? • Explain your reasoning.

Page 23: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Making and Analyzing a Histogram

• We will use Excel to create a histogram using the data of baby weights.

3.2 3.3 3.4 3.5 3.6 3.7 3.80

1

2

3

4

5

6

7

Weight Frequency

Weight Frequency

Page 24: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 4a.

• By examining the histogram, determine the percent of the data that fall within 1 Standard Deviation (s=0.14) of the mean (). That is, determine the percent of the data in the interval or

• Explain your reasoning.

Page 25: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 4b.

• Suppose one of the baby weights is chosen at random. By examining the histogram, determine the probability that the weight is more than 1 standard deviation (s=0.14) above the mean (=3.5). That is, determine the probability that the weight is in the interval

• Explain your reasoning.

Page 26: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Making and Analyzing a Box Plot

Page 27: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 5a

• How does the box plot show the distribution is skewed right?

Page 28: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 5b.

• Suppose one of the mothers’ ages is chosen at random. Based on the box plot and not the original set of data, what can you say is the approximate probability that the age falls between the median, 30.5 and the third quartile, 32.5?

• Explain your reasoning.

Page 29: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Making and Analyzing a Box Plot with outliers

Page 30: Data Distributions. Essential Question: How can you use shape, center, and spread to characterize a data distribution?

Reflect 5c.

• A data value is considered to be an outlier if • Explain why a mother’s age of 39 is an outlier

for this data set. • Redraw the box plot using the option for

showing outliers. • How does the box plot change?