representation of data note

Upload: rachel-saw

Post on 07-Apr-2018

228 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/6/2019 Representation of Data Note

    1/59

    epresentation ofdataVariables,ode median and mean,ean Variance and standard deviation

    Coding- -tem and leaf diagrams

    rouped dataHistogram

    uartileOutlierox plotsSkewness

  • 8/6/2019 Representation of Data Note

    2/59

    ,ariable X, ,or example shoe size weight and

    .ationality are all variables

    ,ariable X easurement , xihoe size , . , , .6 5 7 7 5Weight . , .21 kg 7 9 kg

    Nationality , ,ritish FrenchGerman

  • 8/6/2019 Representation of Data Note

    3/59

    Quantitatvevariable

    iscretevariable

    ContinuosvariableQualitatievariable

    Variableas numerical values

    as no numerical values

    ntegers only

    eal numbers

  • 8/6/2019 Representation of Data Note

    4/59

    From a sample of Jan 2010 group of A Level students, the followingdata are collected with height in cm ( x ) and weight in kg ( y ).

    ID Home town Sex x , Height in cm y , Weight in kg1 Sarikei M 168.8 48.0

    2 Sibu M 175.9 79.5

    3 Kuching F 155.5 49.0

    4 Kuching M 171.7 70.1

    5 Kuching M 188.2 90.2

    6 Kuching F 164.5 54.5

    7 KK F 156.0 60.5

    8 Kuching M 173.0 79.5

    9 Sarikei M 161.8 45.5

    10 Kuching M 175.9 56.8

  • 8/6/2019 Representation of Data Note

    5/59

    alitative data

  • 8/6/2019 Representation of Data Note

    6/59

    frequency

    Hometown

    6

    2

    1

    %0

    %0%0 %0

    ualitative data

  • 8/6/2019 Representation of Data Note

    7/59

    alitative data

  • 8/6/2019 Representation of Data Note

    8/59

    Height

    168.8

    175.9

    155.5

    171.7

    188.2

    164.5

    156

    173

    161.8

    175.9

    Sum = 1691.3

    Mean = 169.13

    ,antitative datantinuous variable

  • 8/6/2019 Representation of Data Note

    9/59

    easure oflocationMode : s the value that occursost often .Median : s the iddle value henhe data is

    .rranged in ascending orderMean : n

    xx =

    o find medianf n is odd , ake.aluef n is even , verage .alue and the following value

    ( )121 +n

    n2

    1

  • 8/6/2019 Representation of Data Note

    10/59

    .xample 1

    ,ind the mode median and mean) , , , , , , ,7 9 6 8 26 3 3

    rranged in ascending order fir

  • 8/6/2019 Representation of Data Note

    11/59

    .xample 2

    ( )he monthly salary RM for the executives of:ompany ABZ are, ,000 2200 8000

    ) .ind the mean and median) .omment on the mean and median

  • 8/6/2019 Representation of Data Note

    12/59

    Coding ( )a

    n

    ax

    x +

    =

    .xample 1: , ,ata 102 104106 104=x

    ( ) 12100 = x 1041003

    12=+=x

  • 8/6/2019 Representation of Data Note

    13/59

    .xample 2= , ( ) = .iven n 80 x 160 240 find.he mean

  • 8/6/2019 Representation of Data Note

    14/59

    - -tem and leafdiagrams- nable the shape of the distribution.o be revealed- how all the data- .ll raw data values are not lost

  • 8/6/2019 Representation of Data Note

    15/59

    .xample 1raw a stem and leaf diagram:ata 60 51 53 42 51 652 50

    .ind the median

  • 8/6/2019 Representation of Data Note

    16/59

    4 5 1 1 36 2 5

    | 2 means 42

    stem leaf

    =edian 52

  • 8/6/2019 Representation of Data Note

    17/59

    .xample 2raw a stem and leaf diagram:ata. . . . . . .7 3 6 3 8 4 7 4 1 2 2 3 6. . .0 4 4 5 0

    . . . . . . .7 4 6 4 8 3 7 3 2 2 5 3 6. . .5 4 7 5 2

    . . . . . . .7 4 2 3 8 5 1 1 4 2 1 3 5. . .2 2 4 5 1.ind the mode and median

  • 8/6/2019 Representation of Data Note

    18/59

    .xample 3raw a stem and leaf diagram:ata 360 351 353 342356

    .ind the median

  • 8/6/2019 Representation of Data Note

    19/59

    - -ack to back stem and leafdiagrams .xample 4 There are 24 children in class A .ach child is given the same problem to solvehe time each child takes to solve it is:ecorded in minutes

    . . . . . . . .4 8 2 6 1 9 3 7 4 8 5 7 2 6 8

    . . . . . . . .4 7 7 9 6 8 8 8 9 7 2 7 3 7 0

    . . . . . . . .9 8 9 7 6 9 3 7 4 7 9 9 1 5 7he 27 children in class B are also given theame problem and their time for solving it:re

    . . . . . . . .2 7 3 6 9 5 5 6 1 9 3 8 1 7 2.7

    . . . . . . . .7 7 6 8 4 8 8 5 8 6 9 8 2 9 5.1

    . . . . . . . .4 6 4 7 7 5 2 7 1 6 6 5 9 6 7

  • 8/6/2019 Representation of Data Note

    20/59

    rouped data, , -lasses Class boundaries Mid,oint Class width

    Length of wire( cm )

    Frequency, f

    10 x 14 2

    15 x 19 6

    20 x 24 425 x 29 3

    ype1 rouped frequencydistribution

    014 59 04 59

    0 50 59

  • 8/6/2019 Representation of Data Note

    21/59

    . 4 5 15.9 19 5

    or class 15 19lass width

    owerlassboundary pperlassboundary

    ower classlimit pper classlimit

    -idpoint17

    =lass width 5

  • 8/6/2019 Representation of Data Note

    22/59

    Length of wire (cm )

    Frequency, f

    10 x < 16 3

    16 x < 22 7

    22 x < 34 5

    34 x < 40 3

    ype2

    Length of wire( cm )

    Frequency, f

    10 < x 16 4

    16 < x 22 6

    22 < x 34 6

    34 < x 40 2

    ype3

  • 8/6/2019 Representation of Data Note

    23/59

    622

    or class 16 22lass width

    owerlassboundary pperlassboundary-idpoint19

    =lass width 6

  • 8/6/2019 Representation of Data Note

    24/59

    Histogram- .o space between the bars- ives a good picture of how data.re distributed- rea is proportional to the.requency

    :isadvantage Raw data values are.ost .xample 1 Give a reason to support the use.f a histogram to represent these dataecause the data is .ontinuous variable

  • 8/6/2019 Representation of Data Note

    25/59

    =requency density Frequencylass width=rea k frequency

    =ormally we use k 1o =rea frequency. , ,ut k can be 1 5 2 3, . . .

  • 8/6/2019 Representation of Data Note

    26/59

    Time(minutes )

    Frequency,f

    Class width Frequencydensity

    1 10 2

    11 20 521 30 6

    31 40 4

    41 50 3

    .xample 1random sample of 20 students was asked howong it took them to complete their. ) .omework a Draw a histogram

    Class Boundaries Frequency

  • 8/6/2019 Representation of Data Note

    27/59

    0.1.2.3.4

    10 20 30 40 50

    Class Boundaries Frequencydensity1 10 0.5 10.5 0.2

    11 20 10.5 20.5 0.5

    21 30 20.5 30.5 0.6

    31 40 30.5 40.5 0.4

    41 50 40.5 50.5 0.3.5.6

    ime in minutes

    equency density

    0

  • 8/6/2019 Representation of Data Note

    28/59

    ) From the histogram estimate how many .tudents recorded times were between 20 5 and. .0 5 minutes) What is the modal class?) What is the probability of the students.aking more than 30 5 minutes to complete.heir homework

  • 8/6/2019 Representation of Data Note

    29/59

    umulative frequency graphshe umulative frequencies re plotted

    gainst the pper class boundaries .

  • 8/6/2019 Representation of Data Note

    30/59

    Time (minutes)

    Frequency, f Upper classboundaries

    Cumulativefrequencies

    0.5 0

    1 10 2 10.5 2

    11 20 5 20.5 7

    21 30 6 30.5 1331 40 4 40.5 17

    41 50 3 50.5 20

    .xample 1random sample of 20 students was asked how .ong it took them to complete their homework.raw a cumulative frequency graph Estimate the.edian What is the probability of studentsaking more than 35 minutes to complete their.omework

    C l ti

  • 8/6/2019 Representation of Data Note

    31/59

    10 20 30 40 50ime in minutes

    0

    510

    1520

    Cumulativefrequencies

    **

    **

    **

    = .edian 25 5

    = / = .robability 5 20 0 25

  • 8/6/2019 Representation of Data Note

    32/59

    Range = ighest valueowest value.xample 1 Find the.ange

    , , , , ,15 26 34 4653

    = ange 53 5= 48

  • 8/6/2019 Representation of Data Note

    33/59

    Quartiles,ower quartile Q1 :%5

    ,edian Q2 :%0,pper quartile Q3 :%5

    Q1 Q2 Q3%00

    %0

    %25

  • 8/6/2019 Representation of Data Note

    34/59

    nterquartilerange=QR Q3 Q1

  • 8/6/2019 Representation of Data Note

    35/59

    inding the quartiles- .irst arrange the data in ascending order

    (ase 1 even )umber of data- plit the data into upper half and lower.alf- .pply the median rule(ase 2 od )number of data- ind the median and delete t from the.ist- plit the data into upper half and lower.alf

    - .pply the median rule

  • 8/6/2019 Representation of Data Note

    36/59

    .xample 1ind Q1 , 2 , 3 nd IQR. :ata 25 35 30 24 324 29

  • 8/6/2019 Representation of Data Note

    37/59

    Smallesvalue largesvalueQ1 Q2 Q3

    ( - -ox plots Box and)hisker plots

  • 8/6/2019 Representation of Data Note

    38/59

    .xample 1= ,iven smallest value 24 Q 1 = ,5 Q2 = ,0 Q 3 = , = .4 largest value 35.lot the box plot

    20 25 30 35 x

  • 8/6/2019 Representation of Data Note

    39/59

    .xample 2ind Q1 , 2 , 3 .nd IQR Plot

    .he box plot:ata 24 25 29 30 32 346 38

  • 8/6/2019 Representation of Data Note

    40/59

    Outliern outlier is an extreme value that

    ies outside the overall pattern of the.ata, >pper fence Q3 + .5IQR,

  • 8/6/2019 Representation of Data Note

    41/59

    1 ( )2 2 4 5 ( )3 5 6 6 6 7 7 88 ( ) 0 1 2 2 4 5 6 7 7 7 8 ( )2 0 1 1 2( )

    .| 1 means 2 1

    .xample 3ind Q1 , 2 , 3 .nd IQR Identify the.utliers

  • 8/6/2019 Representation of Data Note

    42/59

    10 20 30 40 50ime in minutes

    05

    10

    1520

    Cumulativefrequencies

    **

    **

    **

    2 = .5 5

    .xample 4 Find Q1 , 2 ,3 .nd IQR

    1 3

  • 8/6/2019 Representation of Data Note

    43/59

    ,ean variance and standarddeviation

    n

    xx =

    2

    2)(

    efiniti:n Variancen

    xx

    =ean

    tandard =eviation 2

    2

    2

    2 )(xn

    x=

  • 8/6/2019 Representation of Data Note

    44/59

    .xample 1

    ,ind the mean variance and standardeviation of, , ,5 9 11

  • 8/6/2019 Representation of Data Note

    45/59

    Coding ( )a

    n

    ax

    x +

    =

    ( ) ( )22

    2

    =

    n

    ax

    n

    ax

    = x a

    y

  • 8/6/2019 Representation of Data Note

    46/59

    .xample 1= , ( ) =iven n 80 x 160 240 and

    ( )160 2 = .720 .ind the mean and standard deviation

  • 8/6/2019 Representation of Data Note

    47/59

    ,ean variance and standarddeviationor rouped frequency

    distributionn

    fxx

    =ean

    tandard =eviation 2

    2

    2

    2 )(xn

    fx=

    = fn

  • 8/6/2019 Representation of Data Note

    48/59

    .xample 1,he following table summarizes the distance,o the nearest mile traveled to work by a.andom sample of commuters Estimate the mean.nd standard deviation

    Distances,

    miles

    Number of

    commuters, f

    0 - 9 15

    10 - 19 38

    20 - 29 2230 - 39 15

    40 - 49 8

    50 - 59 2

    Total 100

  • 8/6/2019 Representation of Data Note

    49/59

    .xample 2company runs two manufacturing lines A and B.hich make rods 2 cm in diameter Random

    .amples are taken and summarized belowDiameter( cm )

    Standarddeviation

    A 2 0.015

    B 2 0.05

    he company wishes to close one of the.ines down State which one and give a.eason

  • 8/6/2019 Representation of Data Note

    50/59

    .xample 3, ,he table summarizes the mark out of 75 for.wo different examinations

    Mean mark Standarddeviation

    Statistics 55 16

    Mechanics 55 4

    tate which paper is better for you to set.air grade boundaries

  • 8/6/2019 Representation of Data Note

    51/59

    hape of a datasetSkewness

  • 8/6/2019 Representation of Data Note

    52/59

    ositive skew

    +ail in the ve xdirectionodeMedian mean

    : <

  • 8/6/2019 Representation of Data Note

    53/59

    egative skew

    -ail in the vedirection

    modemedianmean

    : > >egative skew mode median mean

  • 8/6/2019 Representation of Data Note

    54/59

    ymmetrical distribution

    = =ode mean median

  • 8/6/2019 Representation of Data Note

    55/59

    .xample 1= , = ,iven mean 52 mode 56 median

    = , .4 comment on the skewness

  • 8/6/2019 Representation of Data Note

    56/59

    :ymmetrical Q2 Q1 = Q3 Q2 :ositive skew Q2 Q

    1 < Q3 Q2 :egative skew Q2 Q1 > Q3 Q2 Q1 Q2

    Q3

  • 8/6/2019 Representation of Data Note

    57/59

    .xample 1iven Q1 = ,4 Q2 = ,1 Q3 = ,3.omment on the skewness

  • 8/6/2019 Representation of Data Note

    58/59

    kewness= tandarddeviation( )mean median

  • 8/6/2019 Representation of Data Note

    59/59

    .xample 1= , =iven mean 46 standard deviation 17= , .nd median 53 comment on the skewness