chapter 2 (summarizing data)_student

Upload: le-thi-thu-trang

Post on 05-Apr-2018

243 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    1/67

    Chapter 2

    Summarizing data

    Summarizing Qualitative Data

    Summarizing Quantitative Data

    Part A

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    2/67

    Why summarizing data?

    Survey,investigation

    Rawdata

    Disadvantage?

    How to extract some information from

    these data?

    - arranging

    - putting them intoorder

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    3/67

    I. Summarizing Qualitative Data

    Frequency Distribution

    Relative Frequency Distribution

    Percent Frequency Distribution Bar Graph

    Pie Chart

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    4/67

    A frequency distribution is

    The objective is

    Frequency Distribution

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    5/67

    Example: Marada Inn

    Guests staying at Marada Inn wereasked to rate the quality of their

    accommodations as being excellent,

    above average, average, below average, or

    poor. The ratings provided by a sample of 20 guests are:

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    6/67

    Example: Marada Inn

    Below Average

    Above Average

    Above Average

    Average

    Above Average

    Average

    Above Average

    Average

    Above Average

    Below Average

    Poor

    Excellent

    Above Average

    Average

    Above Average

    Above Average

    Below Average

    Poor

    Above Average

    Average

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    7/67

    Why Use Frequency Distributions?

    A frequency distribution is a way tosummarize data

    The distribution condenses the raw datainto a more useful form...

    and allows for a quick visual interpretationof the data

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    8/67

    The relative frequency of a class is

    A relative frequency distribution is

    Relative Frequency Distribution

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    9/67

    Relative Frequency Distribution

    Ratings Frequency Relative frequency

    Poor

    Below Average

    Average

    Above AverageExcellent

    Total

    2

    3

    5

    91

    20

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    10/67

    Bar Graph

    A graphical device for depicting qualitative data.

    On one axis (usually the horizontal axis), we specifythe label or the name of each of the class.

    On the other axis (usually the vertical axis), we specifythe frequency, relative frequency of each class

    A bar of fixed width is drawn above each class

    label, we extend the height appropriately. The bars are separated to emphasize the fact that each

    class is a separate category.

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    11/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    12/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    13/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    14/67

    Insights Gained from the Preceding Pie Chart

    Example: Marada Inn

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    15/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    16/67

    1. Simple frequency distribution

    Simple frequencydistribution consists of alist of data values, eachshowing the number of

    items having that value(the frequency).

    Data values Frequency

    0 10

    1 12

    2 9

    3 8

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    17/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    18/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    19/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    20/67

    Frequency distribution table

    Number of childrenin family

    Number of workers

    0

    1

    2

    3

    4

    5

    6

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    21/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    22/67

    2. Grouped frequency distributions

    Used when the data set contains a large numberof data values.

    A grouped frequency distribution summaries

    data into groups of values, each showing thenumber of items having values in the group.

    Each group of data value called class

    Used for both continuous data and discrete data

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    23/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    24/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    25/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    26/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    27/67

    Example: Hudson Auto Repair

    The manager of Hudson Auto

    would like to have a better

    understanding of the cost

    of parts used in the engine

    tune-ups performed in theshop. She examines 50

    customer invoices for tune-ups. The costs of parts,

    rounded to the nearest dollar, are listed on the next

    slide.

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    28/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    29/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    30/67

    Example

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    31/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    32/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    33/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    34/67

    Dot Plot

    Tune-up Parts Cost

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    35/67

    Hi

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    36/67

    Histogram

    Tune-up Parts Cost

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    37/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    38/67

    Cumulative Frequency Distribution

    Part costs ($) Frequency Cumulativefrequency

    Relativecumulativefrequency

    Hudson Auto Repair

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    39/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    40/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    41/67

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    42/67

    Chapter 2

    Summarizing data

    Exploratory Data Analysis

    Crosstabulation and Scatter Diagrams

    Part B

    x

    y

    E l t D t A l i

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    43/67

    Exploratory Data Analysis

    The techniques of exploratory data analysis consist of

    simple arithmetic and easy-to-draw pictures that canbe used to summarize data quickly.

    One such technique is the stem-and-leaf display.

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    44/67

    Stem-and-Leaf Display

    E l H d A t R i

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    45/67

    Example: Hudson Auto Repair

    The manager of Hudson Auto

    would like to have a better

    understanding of the cost

    of parts used in the engine

    tune-ups performed in the

    shop. She examines 50

    customer invoices for tune-ups. The costs of parts,

    rounded to the nearest dollar, are listed on the next

    slide.

    E l H d A t R i

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    46/67

    Example: Hudson Auto Repair

    Sample of Parts Cost for 50 Tune-ups

    91 78 93 57 75 52 99 80 97 62

    71 69 72 89 66 75 79 75 72 76

    104 74 62 68 97 105 77 65 80 109

    85 97 88 68 83 68 71 69 67 74

    62 82 98 101 79 105 79 69 62 73

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    47/67

    St etched Stem and Leaf Displa

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    48/67

    Stretched Stem-and-Leaf Display

    St d L f Di l

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    49/67

    Stem-and-Leaf Display

    Leaf Units

    Where the leaf unit is not shown, it is assumedto equal 1.

    Leaf units may be 100, 10, 1, 0.1, and so on.

    In the preceding example, the leaf unit was 1.

    A single digit is used to define each leaf.

    Example: Leaf Unit 0 1

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    50/67

    Example: Leaf Unit = 0.1

    If we have data with values such as8.6 11.7 9.4 9.1 10.2 11.0 8.8

    a stem-and-leaf display of these data will be

    Example: Leaf Unit 10

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    51/67

    Example: Leaf Unit = 10

    If we have data with values such as

    1806 1717 1974 1791 1682 1910 1838

    a stem-and-leaf display of these data will be

    Crosstabulations and Scatter

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    52/67

    Crosstabulations and ScatterDiagrams

    Crosstabulation and a scatter diagram are twomethods for summarizing the data for two (or more)variables simultaneously.

    Often a manager is interested in tabular andgraphical methods that will help understand the

    relationship between two variables.

    So far we have focused on methods that are usedto summarize the data for one variable at a time.

    Crosstabulation

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    53/67

    Crosstabulation

    Crosstabulation can be used when:

    A crosstabulation is

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    54/67

    Cross-tabulation

    Example: Finger Lakes Homes

    The number of Finger Lakes homes sold for each

    style and price for the past two years is shownbelow

    Crosstabulation

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    55/67

    Crosstabulation

    Insights Gained from Preceding

    Crosstabulation

    Crosstabulation

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    56/67

    PriceRange Colonial Log Split A-Frame Total

    < $99,000

    > $99,000

    18 6 19 12 55

    45

    30 20 35 15Total 100

    12 14 16 3

    Home Style

    Crosstabulation

    Frequency distributionfor the price variable

    Frequency distributionfor the home style variable

    Cross-tabulation: Row or Column

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    57/67

    Percentages

    Converting the entries in the table into row

    percentages or column percentages canprovide additional insight about therelationship between the two variables.

    Crosstabulation: Row Percentages

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    58/67

    Crosstabulation: Row Percentages

    C t b l ti C l P t

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    59/67

    Crosstabulation: Column Percentages

    Scatter Diagram and Trendline

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    60/67

    The general pattern of the plotted points suggests theoverall relationship between the variables.

    One variable is shown on the horizontal axis and theother variable is shown on the vertical axis.

    A scatter diagram is a graphical presentation of the

    relationship between two quantitative variables.

    Scatter Diagram and Trendline

    A trendline is an approximation of the relationship.

    Scatter Diagram

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    61/67

    Scatter Diagram

    A Positive Relationship

    x

    y

    Scatter Diagram

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    62/67

    Scatter Diagram

    A Negative Relationship

    x

    y

    Scatter Diagram

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    63/67

    Scatter Diagram

    No Apparent Relationship

    x

    y

    Example: Panthers Football Team

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    64/67

    Example: Panthers Football Team

    Scatter Diagram

    The Panthers football team is interestedin investigating the relationship, if any,

    between interceptions made and points scored.

    1

    3

    21

    3

    14

    24

    1817

    30

    x= Number ofInterceptions

    y= Number ofPoints Scored

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    65/67

    Scatter Diagram

    Example: Panthers Football Team

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    66/67

    Insights Gained from the Preceding Scatter Diagram

    Example: Panthers Football Team

    Tabular and Graphical Procedures

  • 7/31/2019 Chapter 2 (Summarizing Data)_student

    67/67

    Tabular and Graphical Procedures

    Qualitative Data Quantitative Data

    TabularMethods

    TabularMethods

    GraphicalMethods

    GraphicalMethods

    FrequencyDistribution

    Rel. Freq. Dist.Crosstabulation

    Bar GraphPie Chart

    FrequencyDistribution

    Rel. Freq. Dist.Cum. Freq. Dist.

    Cum. Rel. Freq.DistributionStem-and-Leaf

    DisplayCrosstabulation

    Dot PlotHistogramOgiveScatter

    Diagram

    Data