ma-250 probability and statistics nazar khan pucit lecture 3
TRANSCRIPT
![Page 1: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/1.jpg)
MA-250 Probability and Statistics
Nazar KhanPUCIT
Lecture 3
![Page 2: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/2.jpg)
Average and Standard Deviation
• A histogram tries to summarize large amounts of data.
• An even more drastic summary can be given by the histogram’s– Center– Spread
![Page 3: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/3.jpg)
Average and Spread
![Page 4: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/4.jpg)
But not always…
![Page 5: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/5.jpg)
Average balances the histogram
Average
Average
Average
![Page 6: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/6.jpg)
Average balances the histogram
![Page 7: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/7.jpg)
Median
• Median of a histogram is the value with half the area to the left and half to the right.
![Page 8: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/8.jpg)
Median
Lies in the middle
Balances both sides
Median of a list is the value from which half or more values are larger and half or more are smaller.
![Page 9: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/9.jpg)
Median
• Compute median of– 2,6,8– 4,8,9,13– 1,2,2,7,8– 8,-3,5,0,1,4,-1– 800,-3,5,0,1,4,-1
![Page 10: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/10.jpg)
Average vs. Median
• Which estimate is better when data contains outliers?– Median since it is not
affected by outliers.
List 1 List 21 12 23 34 45 56 67 78 89 9
10 100Average 5.5 14.5Median 5.5 5.5
Outlier
![Page 11: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/11.jpg)
Measuring Spread – Standard Deviation
• It is usually quite helpful to see how a list of numbers spreads around the average value.
• This is measured by the standard deviation (SD).
• SD = r.m.s deviation from average• Compute SD of 20,10,10,15– Compute average– Compute deviations from average– Compute r.m.s of deviations
![Page 12: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/12.jpg)
Magic of Standard DeviationThe 68-95-99 Rule
![Page 13: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/13.jpg)
The 68-95-99 Rule
![Page 14: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/14.jpg)
The 68-95-99 Rule
![Page 15: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/15.jpg)
Not Always …
![Page 16: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/16.jpg)
Summary
• Usually a list of numbers can be well-summarized by its average and standard deviation
• Center of histogram– Average – balances the histogram– Median – divides histogram areas into half
• Standard deviation measures spread around the average• Usually
– 68% data lies within 1 SD of the average– 95% data lies within 2 SD of the average– 99% data lies within 3 SD of the average
![Page 17: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/17.jpg)
The Normal Curve
• An approximation to data distribution that is normally quite accurate– Normally data follows such a distribution
![Page 18: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/18.jpg)
The Normal Curve
![Page 19: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/19.jpg)
Standard Units
• Express the data in terms of standard deviation
• Converting a value X to standard units– (X-average)/SD
![Page 20: MA-250 Probability and Statistics Nazar Khan PUCIT Lecture 3](https://reader033.vdocument.in/reader033/viewer/2022052509/56649cea5503460f949b520c/html5/thumbnails/20.jpg)
Histogram to Standard Units