Chapter 2: Density Curves Chapter 2: Density Curves and Normal Distributionsand Normal Distributions
Density CurvesDensity Curves
What is the probability of scoring in one of these regions?What do all of the areas of the rectangles add up to be?What do all of the probabilities add up to be?
Density CurvesDensity CurvesImpose a curve over a relative
frequency histogram:
Density CurvesDensity CurvesThe area of the curve should
approximate the area of the histogram.
Both areas should = 1
Density Curves:Density Curves:Properties:
◦Is always on or above the horizontal axis.
◦Has area exactly equal to 1.◦Describes the overall pattern of the
distribution.◦The area under the curve and above
any range of values is the proportion of observations that fall in that range.
Density Curves:Density Curves:Come in many shapes just like
their counterparts histograms:◦Normal Curves: symmetric with the
median and mean at the highest point of the graph.
◦Skewed Left: mean is to the left of the median.
◦Skewed Right: mean is to the right of the median.
Density CurveDensity CurveAn approximately normal curve:
Density CurvesDensity CurvesA skewed left density curve and
skewed right density curve:
Skewed Right Density Curve Skewed Left Density Curve
Density CurvesDensity CurvesThe median is the equal areas point,
the point with half of the area under the curve to its left and the remaining half of the area to its right.
The quartiles do the same thing as the median but splits the data into ¼ s.
The mean is the point at which the curve would balance if it were made of solid material.
Density CurvesDensity CurvesSymmetric curves: the mean and
the median are the same for a symmetric curve.
The mean of a skewed curve is pulled away from the median in the direction of the long tail.
Skewed right – mean to the right of the median.
Skewed left – mean to the left of the median.
Density Curves – Normal Density Curves – Normal DistributionsDistributionsSymmetric, bell-shaped, and
single-peaked.Mean, median, and mode occur
at the peak.The distribution is in terms of
numbers of standard deviations from the mean.
The mean is zero standard deviations from the mean.
Normal DistributionsNormal Distributions
Centered at the mean.Positive standard deviations are data to the right of the mean, data greater than the mean.Negative standard deviations are data to the left of the mean, data less than the mean.
Normal DistributionsNormal Distributions68 – 95 – 99.7 Rule
Approximately 68% of the data falls within one standard deviation to the left and right of the mean.Approximately 95% of the data falls within two standard deviations to the left and right of the mean.Approximately 99.7% of the data falls within three standard deviations to the left and right of the mean,
Normal DistributionsNormal DistributionsThe larger the standard deviation is
the flatter the bell-shaped curve will be.
The smaller the standard deviation the tighter the data will be.
Inflection points - the first standard deviation to the left and right occur at the inflection points of the graph; the place where the graph switches from concave up to concave down.
Normal DistributionsNormal Distributions
Standard Normal Standard Normal DistributionsDistributions
We can standardize data set, so that each data point is represented by number of standard deviations from the mean.
Standard Normal Standard Normal CalculationsCalculations
Standardizing DataStandardizing Data
Standard Normal Standard Normal DistributionsDistributionsThe standard normal distribution
is the normal distribution N(0,1) with mean 0 and standard deviation 1.
Standard Normal Standard Normal DistributionsDistributionsThe standard normal table:
◦Provides the probability of falling within a certain region of the standard normal distribution.
◦The distribution provides the area under the curve from - ∞ to some z – score.
Standard Normal Standard Normal DistributionsDistributions
Standard Normal Standard Normal DistributionsDistributionsFind the area under the curve for
the following inequalities:◦z < 1.52◦z < -1.78◦z > 2.10◦z > -1.23◦-2.10 < z < 1.27◦-1.33 < z < 3.12
Finding Normal Finding Normal ProportionsProportionsStep 1 : State the problem in terms
of the observed value x. Draw a picture of the distribution and shade the area of interest under the curve.
Step 2 : Standardize x to restate the problem in in terms of standard normal variable z. Draw a picture to show the area of interest under the standard normal curve.
Finding Normal Finding Normal ProportionsProportionsStep 3 : Find the required area
under the standard normal curve, using Table and the fact that the total area under the curve is 1.
Step 4 : Write your conclusion in the context of the problem.
Example 2.8Example 2.8The level of cholesterol in the blood is
important because high cholesterol levels may increase the risk of heart disease. The distribution of blood cholesterol levels in a large population of people of the same age and sex is roughly normal. For 14 year old boys the mean is μ=170 mg of cholesterol per dl of blood (mg/dl)and the standard deviation is σ=30 mg/dl. Levels above the 240 mg/dl may require medical attention. What percent of 14 year old boys have more than 240 mg/dl of cholesterol?
Example 2.8Example 2.8Step 1 : State the problem
◦P(x>240) with μ=170 mg/dl and σ=30 mg/dl.
◦Sketch the distribution: mark points of interest and shade the appropriate area.
Step 2 : Standardize x and draw a picture.◦Use z conversion formula◦Sketch standard normal curve of
distribution
Example 2.8Example 2.8Step 3 : Use Table appropriately.
◦1 – P(z<2.33)Step 4 : Write your conclusion in
context of the problem.
Example 2.9Example 2.9What percent of 14 year old boys
have blood cholesterol between 170 and 240 mg/dl?
Example 2.9Example 2.9Step 1 : State the problem
◦P(170 < x< 240◦Sketch the curve
Step 2 : Standardize the data◦Use z conversion formula◦P(0 < z < 2.33)◦Sketch the standardized curve
Step 3 : Use the table appropriately◦ Area between 0 and 2.33 = area below 2.33 – area below 0
Step 4: State your conclusion in the context of the problem.
Example 2.10 SAT Verbal Example 2.10 SAT Verbal Scores: Working BackwardsScores: Working BackwardsScores on the recent SAT Verbal
test in the recent years follow approximately the N(505,110) distribution. How high must a student score in order to place in the top 10% of all students taking the SAT?
Example 2.10 SAT Verbal Example 2.10 SAT Verbal ScoresScoresStep 1: State the problem and draw
a sketch.◦We want a curve that is only .10 of the
total area to the right of a z value.◦Draw Sketch◦z > something, now remember the Table
does less than’s only.Step 2: Use the Table. Look in Table A for
the entry closest to .9 since that would be the area to the left of our .1 region.
z = 1.28 is our standardized score.
Example 2.10 SAT Verbal Example 2.10 SAT Verbal ScoresScores
Assessing NormalityAssessing Normality• How can we decide if a distribution is
normal or not?◦Method 1 : Construct a frequency
histogram or stem and leaf plot. See if the graph is approximately symmetrical and bell-shaped about the mean. Histograms can reveal nonnormal features such
as outliers, pronounced skewness, or gaps and clusters.
Compare the distribution to the 68 – 95 – 99.7 rule.
Small data sets will rarely be approximately normal due to chance variation.
Assessing NormalityAssessing NormalityMethod 2: Construct a normal
probability plot. Put data in from 2.27 page 108
Put data in List 1Use 1 VarStats to compare mean and
median.Construct a dot plot using the 6th option
in your StatPlotsPlot the data and zoom in with
ZoomStatsIf the data is approximately linear then
the data is approximately normal.
Calculator Commands Calculator Commands ReviewReviewnormalcdf(xmin, xmax, μ,σ)normalcdf(zmin,zmax)shadenorm(xmin,xmax,μ,σ)Shadenorm(zmin,zmax)invnorm(p,μ,σ) gives x value that
gives this area under the curveinvnorm(p) gives z value with area
under the curve.