outline 1. histograms and boxplots 2. mean and standard deviation 3. proportions and bar charts 4....

17

Upload: laurel-perkins

Post on 26-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence
Page 2: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Outline

1. Histograms and boxplots2. Mean and standard deviation3. Proportions and bar charts4. Sampling and allocation5. Inference and confidence

intervals

6. t tests and alternatives7. ANOVA8. Regression and correlation9. More ANOVA and regression10. Categorical data analysis

Page 3: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Histograms and Boxplots

Learning outcomesStatisticaleseMaking histograms

- deciding type and bin width- the macro/micro distinction in graphing

Making boxplots - ranking and ordering data- learning the 5-point summary

Page 4: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Statisticalese

I will probably have a bagel today.Probability of having a bagel > 50%

It takes about 20 minutes to cook rice.The central tendency (more on what this means throughout the course) for cooking rice is 20 minutes.

Statisticalese takes English phrases that include numerical information and uncertainity and translates them (often making them more precise).

Page 5: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Today's data set: DNA exonerations

• Hundreds of people found guilty of crimes, who spent time in prison, and later exonerated by DNA evidence.

• http://www.innocenceproject.org/• http://www.fiu.edu/~dwright/steps/dnaphotos.pptx

Page 6: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Casenoi firstni lastni statei year1i year2i timei

1 Gary Dotson Illinois 1979 1989 10

2 David Vasquez Virginia 1985 1989 4 3 Edward Green DC. 1989 1990 1 : : : : : : :162 Leo Waters N. Carolina 1981 2005 24163 George Rodriquez Texas 1987 2005 18

This is what a data file looks like in most statistics packages

Focus is on the timei variable for years in prison. The subscripts show the values vary.

Page 7: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Frequency Table

Page 8: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Histogram: With dots

0 10 20 30Years in prison

Freq

uenc

y

0 10 20 30Years in prison

0

1

0

20

Page 9: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Stem and leaf diagram

Page 10: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Deciding bin width

Page 11: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Name histogram

Page 12: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

5 point summary

values: 2 5 8 3 8 7 2 2 12 sorted: 2 2 2 3 4 7 8 8 12ranks: 1 2 3 4 5 6 7 8 9

values: 2 2 2 3 4 7 8 8 12ranks: 1 2 3 4 5 6 7 8 9

↑ ↑ ↑ ↑ ↑minimum first

quartilemedian third

quartilemaximum

Page 13: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence
Page 14: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Median when n is even: the mid-rank

Page 15: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Boxplots (Box and Whiskers)

0 10 20 30Years in Prison

0 10 20 30Years in Prison

Page 16: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Comparing histograms and boxplots

Page 17: Outline 1. Histograms and boxplots 2. Mean and standard deviation 3. Proportions and bar charts 4. Sampling and allocation 5. Inference and confidence

Summary

• Statisticalese. A language for numbers and chance.• Histograms. Decide bin width.• Boxplot. Shows outliers well.

• Graphs. Make clear. Avoid adding frills.