ch. 10. we cannot use a linear model unless the relationship between the two variables is linear. ...
TRANSCRIPT
![Page 1: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/1.jpg)
RE-EXPRESSING DATACH. 10
![Page 2: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/2.jpg)
WHY RE-EXPRESSION?We cannot use a linear model unless the relationship between the two variables is linear.
Often re-expression can save the day, straightening bent relationships so that we can fit and use a simple linear model.
![Page 3: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/3.jpg)
GOALS OF RE-EXPRESSION1. Make the distribution of a variable more symmetric.
2. Make the spread of several groups more alike, even if their centers differ.
3. Make the form of a scatterplot more nearly linear.
4. Make the scatter in a scatterplot spread out evenly rather than thickening at one end.
![Page 4: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/4.jpg)
LADDER OF POWERS There is a family of simple re-expressions that move data
toward our goals in a consistent way. This collection of re-expressions is called the Ladder of Powers.
The Ladder of Powers orders the effects that the re-expressions have on data.
![Page 5: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/5.jpg)
LADDER OF POWERS
Ratios of two quantities (e.g., mph) often benefit from a reciprocal.
The reciprocal of the data
–1
An uncommon re-expression, but sometimes useful.
Reciprocal square root
–1/2
Measurements that cannot be negative often benefit from a log re-expression.
We’ll use logarithms here
“0”
Counts often benefit from a square root re-expression.
Square root of data values
½
Data with positive and negative values and no bounds are less likely to benefit from re-expression.
Raw data1
Try with unimodal distributions that are skewed to the left.
Square of data values
2
CommentNamePower
![Page 6: CH. 10. We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression can save the day, straightening](https://reader036.vdocument.in/reader036/viewer/2022072006/56649d015503460f949d3a56/html5/thumbnails/6.jpg)
CLASS EXAMPLEYou are given the following costs to build a square deck for your house.
a. Use re-expressed data to create a model that predicts the cost of the deck based on width.
b. Which model did you choose? Why do you think this model is appropriate?
c. Find the predicted cost of a square deck whose width is 10.5 ft.
d. Is it reasonable to use this model to predict the cost of a square deck that is 20 ft wide? Explain.