unit 11: regression modeling in practice

Unit 11: Regression modeling in practice

The S-030 roadmap: Where’s this unit in the big picture?

Unit 2:Correlation

and causality

Unit 3:Inference for the regression model

Unit 4:Regression assumptions:

Evaluating their tenability

Unit 5:Transformations

to achieve linearity

Unit 6:The basics of

multiple regression

Unit 7:Statistical control in

depth:Correlation and

collinearityUnit 10:

Interaction and quadratic effects

Unit 8:Categorical predictors I:

Dichotomies

Unit 9:Categorical predictors II:

Polychotomies

Unit 11:Regression modeling

in practice

Unit 1:Introduction to

simple linear regression

Building a solid

foundation

Mastering the

subtleties

Adding additional predictors

Generalizing to other types of

predictors and effects

Pulling it all

together

In this unit, we’re going to learn about…

• Distinguishing between question predictors, covariates, and rival hypothesis predictors

• Mapping your research questions onto an analytic strategy• What kinds of paths and feedback loops do you need?• Alternative analytic approaches—which are sound, which are

unwise?• Which kinds of rival explanations can you examine and rule out?• What caveats and limitations still remain?• Constructing informative tables and figures• Writing up your results

Automated model building strategies (& why you don’t want to use them)

Automated model building strategies

(that you may see in journal articles)1. All possible subsets: all

2k-1 regression models2. Forward selection: start

with no predictors and sequentially add them so that each maximally increases the R2 statistic at that step

3. Backwards elimination: start with all predictors and sequentially drop them so that each minimally decreases the R2 statistic at that step

4. Stepwise regression (forward selection with backwards glances)

All models are wrong, but some are useful George E.P. Box (1979)

Far better an approximate answer to the right question…than an exact answer to the wrong question John W. Tukey (1962)

The hallmark of good science is that it uses models and ‘theory’ but never believes them attributed to Martin Wilk in Tukey (1962)

Occam’s razor: entia non sunt multiplicanda praeter necessitatem If two competing theories lead to the same predictions, the simpler one is better William of Occam (14th century)

Introducing the case study

RQ: “to determine whether…[the sentences of young Black and White male inmates] depended both on race and within race, on the degree to which they manifested Afrocentric facial features, controlling for the seriousness of the crimes they had committed and their prior criminal histories.”

(2004) Volume 15, Number 10, pp 674-679

Hypothesis: “controlling for legally relevant factors, Black offenders as a group may not receive harsher sentences than White offenders, but members of both groups who have relatively more Afrocentric features may receive harsher sentences than group members with less Afrocentric features”

How the study was conducted: Online data and undergraduate raters

Name: SUMMERS, JAMES LRace: WHITE

Name: SUMMERS, JAMES LRace: BLACK

Current Prison Sentence History

Offense Date Offense Sentence Date

Prison Sentence Length

02/13/2002 AGG BATTERY INTENDED HARM

02/24/2005 10Y 0M 0D

02/13/2002 SEX BAT/ WPN. OR FORCE

02/24/2005 10Y 0M 0D

02/13/2002 FALS.IMPRSN-NO 787.01 INT

02/24/2005 10Y 0M 0D

Prior Prison History

Offense Date Offense Sentence Date

Prison Sentence Length

03/17/1997 AGG BATTERY/W/DEADLY WEAPON

03/16/1998 3Y 0M 0D

06/09/1997 RESISTING OFFICER W/VIOLEN

03/16/1998 3Y 0M 0D

OutcomeYears = Sentence length

216 randomly selected felons, ages 18-24,

who committed crimes between 1 Oct 1998 – 1 Oct 2002, stratified by

100 Black 116 White

Predictors describing facial features Photos were randomly placed into groups and rated

by ~35 undergraduates, using three 9 point scales (1=not at all; 9=very much)

•Attractiveness•Baby-faced-ness•“Afrocentric” features (Features)

Predictors describing criminal history•Primary offense level (FL rates the offenses)

•Secondary offenses: # & average level•Prior offenses: # & average level

A first look at the data

Prim Any N Sec Any N PriorID Years Black Features Lev Sec Second Lev Prior Prior Lev Attr Baby

1 3.83 0 1.94286 7 1 1 5.00 0 0 0.00 2.00000 2.77143 2 11.50 1 6.20000 9 1 1 8.00 0 0 0.00 2.22857 2.65714 3 4.33 1 5.00000 9 0 0 0.00 0 0 0.00 5.26471 4.26471 4 17.17 1 7.85714 10 1 3 4.33 0 0 0.00 3.54286 3.62857 5 14.17 1 5.50000 6 1 1 6.00 1 4 4.25 2.00000 4.94118 6 4.42 0 5.88235 7 1 3 7.00 1 4 7.00 2.08824 5.67647 7 99.00 0 2.94286 11 1 3 6.33 0 0 0.00 3.57143 4.22857 8 4.67 1 6.97143 5 1 7 3.86 0 0 0.00 2.20000 3.85714 9 9.17 1 6.32353 9 1 3 4.67 0 0 0.00 3.44118 5.5882410 9.17 1 5.40000 11 0 0 0.00 0 0 0.00 3.25714 6.4285711 23.75 0 4.38235 11 0 0 0.00 0 0 0.00 2.23529 5.4411812 9.00 1 4.23529 9 1 2 8.50 0 0 0.00 2.58824 4.6176513 3.75 0 3.28571 8 0 0 0.00 0 0 0.00 1.57143 4.6571414 13.33 0 6.64706 8 1 2 3.50 0 0 0.00 2.85294 3.6764715 2.92 1 4.08571 9 1 3 9.00 0 0 0.00 4.40000 3.5142916 3.67 1 7.02941 7 1 1 5.00 0 0 0.00 3.61765 4.3529417 1.25 0 5.08571 9 1 3 5.50 0 0 0.00 2.82857 5.8857118 3.42 0 1.88235 7 0 0 0.00 0 0 0.00 2.38235 6.3823519 2.33 1 6.65714 4 0 0 0.00 1 10 3.80 3.65714 3.4857120 1.92 1 4.32353 5 1 1 1.00 1 6 4.50 2.64706 2.79412

Years ranges from 0.42 – 99 (for life in

prison)(may need to deal with its very wide

range)

By design, sample is 46% Black

rBLACK,Features= 0.74*** leaving lots of

variation in Features within each group

PrimLev ranges from 1 – 11 (from

unauthorized driver’s license to murder)

75% of inmates had a secondary offense, some several (mean # = 3.2; max # = 41!) The average level of the

2nd offenses <= primary level

33% of inmates had a prior offense, some several (mean # = 2.9; max #

= 13!) This is somewhat low, probably due to the sample’s relative

Included because there’s previous evidence that attractiveness

and baby-faced-ness is correlated with judicial decisions

How should you proceed when you have so many predictors

What have you done for HW assignments (and have we done in class)?

1. Describe the distributions of the outcome and predictors

2. Examine scatterplots of the outcome vs. each predictor, transforming as necessary (with supplemental residual plots to guide transformation)

3. Examine estimated correlation matrix to see what it foreshadows for model building

4. If there is a clear predictor for which to control statistically, examine the estimated partial correlation matrix to further foreshadow model building

5. Thoughtfully fit a series of MR models

6. Examine the series to select a “final” model that you believe best summarizes your findings

But with > 3 or 4 predictors, model building (step 5) becomes unwieldy…

Advice: Before doing any analysis, place your predictors into up to four conceptual groups based on a combination of substance/theory and their role in your statistical analyses

Question predictor(s)

Key control predictor(s)

Additional control predictor(s)

Rival hypothesis predictor(s)

Challenge:11 predictors = 211-1 = 2,047 possible models (+

interactions!)

Developing a taxonomy of fitted models (using the predictor classifications)

Question predictorsBlack and Features

Key control predictorsPrimary offense level

Additional control predictorsSecondary offenses

Prior offenses

Rival hypothesis predictorsAttractiveness

Baby-faced-ness

Strategy 1: Question predictors first

1. Start with your question predictors: after all, those are the variables in which you’re most interested

2. Add key control predictors assessing whether the effects change—probably keep the key control predictors in the model regardless

3. Add additional control predictors, keeping them in the model only as necessary

4. Check rival hypothesis predictors to see whether the effects of the question predictors remain

Strategy 2: Build a control model first

1. Start with the key control predictors: after all, you’re pretty confident they have a major effect that you need to remove

2. Add additional control predictors, keeping them in the model only as necessary

3. Add the question predictors seeing whether they have an effect over and above the control predictors

4. Check rival hypothesis predictors to see whether the effects of the question predictors remainOften the approach of choice because

it focuses attention on the question predictors

Preferable when the effects of the control predictors are so well established that beyond a first “peek” it’s difficult to think about examining the question predictors

uncontrolled

Or some combinati

Don’t forget there’s a difference between how you do the analysis and how you report

the results

Let’s begin by examining the outcome: Length of prison sentence

Variable: Years Mean 6.835694 Std Deviation 15.54396

Histogram # Boxplot97.5+** 5 * . . . . . . . .* 1 * . . . . .* 1 * .* 2 * .* 4 * .* 3 * .** 7 0 .******* 28 0 2.5+****************************************** 165 +-----+ ----+----+----+----+----+----+----+----+-- * may represent up to 4 counts

“because sentence length was skewed, a log-transformation was performed on this variable prior to analysis”

Because the relationship between sentence length (in years) and key predictors was markedly nonlinear, we transformed the outcome (by taking natural logarithms) and fit a series of regression models predicting log sentence length.

Plots vs. question and key control predictors

Having logged the outcome, what should we do next?

Variable: LYears Mean 1.128249 Std Deviation 1.06197

Histogram # Boxplot 4.75+*** 5 0 .* 1 0 .* 1 0 .*** 6 0 .**** 8 | .*********** 21 | .********* 18 +-----+ .*********************** 45 | + | .************************* 49 *-----* .******************* 37 +-----+ .********** 19 | -0.75+*** 6 | ----+----+----+----+----+ * may represent up to 2 counts

r = 0.67***

Which model building strategy makes the most sense given that…

• The effect of the question predictor (Features) is statistically significant (p<0.05) but relatively modest (r=0.15)

• There’s no difference in sentence length by race (r=0.07, ns)

• But…the effect of the key control predictor (Primary Offense Level) is very strong (r=0.67, p<0.0001)

Decision:

r = 0.15*Plots vs. question and key control predictors

r = 0.07(ns)

What functional form should we use for Primary Offense Level?

Variable: PrimLev Mean 6.541667 Std Deviation 2.07154Median 7.000000 Variance 4.29128Mode 5.000000 Range 10.00000

Stem Leaf # Boxplot 11 0000000 7 | 10 | 10 00000 5 | 9 | 9 00000000000000000000000000000000000000 38 | 8 | 8 0000000000000000000 19 +-----+ 7 | | 7 0000000000000000000000000000000000000000 40 *-----* 6 5555 4 | + | 6 0000000000000000000000 22 | | 5 | | 5 0000000000000000000000000000000000000000000 43 +-----+ 4 | 4 000000000000000000000000000000 30 | 3 | 3 0000 4 | 2 | 2 000 3 | 1 | 1 0 1 | ----+----+----+----+----+----+----+----+---

“We also included quadratic terms for seriousness of the primary offense… because the Florida Criminal Punishment Code specifies that for more serious offenses, the length of the sentence ought to increase dramatically as the seriousness of the offense increases.”Linear Model: R2 = 0.4539

Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 -1.13117 0.17766 -6.37 <.0001PrimLev 1 0.34539 0.02590 13.34 <.0001

Quadratic Model: R2 = 0.5030 Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 0.74718 0.44331 1.69 0.0934PrimLev 1 -0.28180 0.13895 -2.03 0.0438PrimLevSq 1 0.04726 0.01030 4.59 <.0001

Pearson Correlation Coefficients, N = 216 Prob > |r| under H0: Rho=0

LYears PrimLev AnySec SecLev NSecond AnyPrior PriorLev LYears 1.00000 0.67374 0.03399 0.36529 0.19168 -0.20537 -0.15635 <.0001 0.6193 <.0001 0.0047 0.0024 0.0215

PrimLev 0.67374 1.00000 -0.04221 0.30743 -0.03511 -0.30738 -0.22443 <.0001 0.5373 <.0001 0.6078 <.0001 0.0009

AnySec 0.03399 -0.04221 1.00000 0.77439 0.35567 0.16014 0.10670 0.6193 0.5373 <.0001 <.0001 0.0185 0.1179

SecLev 0.36529 0.30743 0.77439 1.00000 0.30844 -0.05396 -0.03146 <.0001 <.0001 <.0001 <.0001 0.4301 0.6457

NSecond 0.19168 -0.03511 0.35567 0.30844 1.00000 -0.05159 -0.05633 0.0047 0.6078 <.0001 <.0001 0.4507 0.4101

AnyPrior -0.20537 -0.30738 0.16014 -0.05396 -0.05159 1.00000 0.89362 0.0024 <.0001 0.0185 0.4301 0.4507 <.0001

PriorLev -0.15635 -0.22443 0.10670 -0.03146 -0.05633 0.89362 1.00000 0.0215 0.0009 0.1179 0.6457 0.4101 <.0001

NPrior -0.11997 -0.20655 0.13546 0.01684 -0.00930 0.71445 0.54775 0.0785 0.0023 0.0468 0.8057 0.8919 <.0001 <.0001

A first look at the effects of the other control variables

Two analytic issues:1 Should we be controlling for

the effects of primary offense level?

2 Given Florida statutes, we might also ask about functional form—might the effects of some of these predictors also be non-linear?

Inmates with more severe secondary offenses have, on average, more severe primary offenses and longer sentences

(but there’s no effect of the presence of secondary offenses, and less effect of the #)

Inmates with prior offenses have, on average, less severe primary offenses and (?therefore?) shorter sentences

As expected, the three variables describing secondary and (especially) prior offenses

are positively related

Overall, there’s little relationship between prior offenses and secondary

offenses

Pearson Partial Correlation Coefficients, N = 216 Prob > |r| under H0: Partial Rho=0

LYears SecLev NSecond AnySec PriorLev NPrior

LYears 1.00000 0.22803 0.30637 0.11599 -0.00959 0.04104 0.0008 <.0001 0.0905 0.8890 0.5505

SecLev 0.22803 1.00000 0.33585 0.83363 0.04033 0.08743 0.0008 <.0001 <.0001 0.5574 0.2027

NSecond 0.30637 0.33585 1.00000 0.35583 -0.06592 -0.01704 <.0001 <.0001 <.0001 0.3372 0.8043

AnySec 0.11599 0.83363 0.35583 1.00000 0.10080 0.12664 0.0905 <.0001 <.0001 0.1416 0.0644

PriorLev -0.00959 0.04033 -0.06592 0.10080 1.00000 0.52661 0.8890 0.5574 0.3372 0.1416 <.0001

NPrior 0.04104 0.08743 -0.01704 0.12664 0.52661 1.00000 0.5505 0.2027 0.8043 0.0644 <.0001

AnyPrior -0.00512 0.04419 -0.06556 0.15752 0.88939 0.70100 0.9407 0.5202 0.3399 0.0212 <.0001 <.0001

Addressing issue 1: A second look at the other control predictors,

partialling out the effects of PrimLev & PrimLev2

Conclusion:

(should also note that we double-checked the functional form of

prior offenses and there was still no effect)

The positive effect of the number (and level) of secondary offenses persists (but ANYSEC still appears to have no effect)

The negative effect of prior offenses disappears upon control for severity of

the primary offense (thankfully)

The distinctiveness of prior and secondary offenses persists -- these sets of

predictors seem fairly distinct

Controlling for Primary offenses: R2 = 0.5363 Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 0.64277 0.43118 1.49 0.1375PrimLev 1 -0.23931 0.13853 -1.73 0.0856PrimLevSq 1 0.04022 0.01055 3.81 0.0002SecLev 1 -0.03298 0.05942 -0.56 0.5795SecLevSq 1 0.01488 0.00809 1.84 0.0672

Addressing issue 2: Functional form for secondary offense predictors

Uncontrolled Model: R2 = 0.2911 Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 1.00906 0.11647 8.66 <.0001SecLev 1 -0.27569 0.06638 -4.15 <.0001SecLevSq 1 0.05821 0.00846 6.88 <.0001

“We also included quadratic terms for … seriousness of additional offenses … because the Florida Criminal Punishment Code specifies that for more serious offenses, the length of the sentence ought to increase dramatically as the seriousness of the offense increases.”

LNSecond = ln(NSecond + 1)r = .19 (p=.0047) r = .17 (p=.0150)

“starting” a variable

Finalizing the “control” model for the effects of severity of crimes

“Final” control model: R2 = 0.5712 Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 0.43362 0.41871 1.04 0.3016PrimLev 1 -0.18643 0.13415 -1.39 0.1661PrimLevSq 1 0.03650 0.01021 3.57 0.0004SecLev 1 -0.22487 0.07374 -3.05 0.0026SecLevSq 1 0.03220 0.00885 3.64 0.0003LNSecond 1 0.40356 0.09766 4.13 <.0001

Double checked effects of other crime predictors (AnyPrior, NPrior, PriorLev &

AnySec) in uncontrolled and controlled models and all were n.s.

“The results of the analysis showed, as expected, that criminal record accounted for a substantial amount of the variance (57%) in sentence length. The resulting unstandardized coefficients (and their standard errors and associated t statistics) are given in Table 1 (Model 1). Unsurprisingly, the seriousness of the primary offense (linear and quadratic effects) and both the seriousness (quadratic effect) and the number of additional offenses were significant predictors of sentence length. Neither the seriousness nor the number of prior offenses predicted sentence length. We attribute these null effects to the relative youthfulness of the inmates, who had relatively few prior felony offenses (mean=0.95, sd=1.90)”

Examining the effects of the question predictors: Uncontrolled & controlled

Results of fitting a taxonomy of multiple regression models predicting ln(sentence length) among a random sample of 216 Florida inmates

Uncontrolled modelsModels controlling for

severity and number of offensesa

Predictor A B C D E F G H

Black0.14(0.14)0.99

-0.20(0.21)-0.97

1.18(0.63)1.87

-0.07(0.10)-0.75

-0.30*(0.14)-2.15

-0.20(0.10)-0.46

Features

0.09*(0.04)2.25

0.13*(0.06)2.24

0.25**(0.25)3.22

0.03(0.03)1.00

0.09*(0.04)2.25

0.10~(0.05)1.88

Black*Features

-0.28*(0.12)-2.33

-0.02(0.08)-0.25

R2 0.5 2.3 2.7 5.2 57.2 57.3 58.3 58.3

F0.98(1,214)0.3226

5.08*(1,214)0.0253

3.00~(2,213)0.0517

3.86*(3,212)0.0102

46.41***(6,209)<0.0001

46.79***(6,209)<0.0001

41.45***(7, 208)<0.0001

36.12***(8,207)<0.0001

Cell entries are estimated regression coefficients, (standard errors) and t-statistics~ p<.10, *p<.05, **p<.01, ***p<.001aModels E-H control for 5 additional predictors: Primary offense level (linear and quadratic), Secondary offense level (linear and quadratic) and log(Number of Secondary offenses)

In uncontrolled models, there is a statistically significant interaction

between Race and Features (such that Whites with higher values get longer

sentences, while for Blacks, there’s no effect).

White Black

rBlack, Features = .74

In controlled models, there are main effects of Race and Features, but there

is no interaction between the two. Controlling for severity and number of

offenses: (1) Blacks and Whites with more “afrocentric” features receive longer

sentences; and (2) controlling for features, Whites receive longer sentences than Blacks

Features

What about the rival hypothesis predictors?: Attractiveness & Baby-faced-ness

Adding Attractiveness and Baby-faced-ness to the “final” model Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 0.17736 0.49786 0.36 0.7220PrimLev 1 -0.19096 0.13371 -1.43 0.1548PrimLevSq 1 0.03649 0.01018 3.58 0.0004SecLev 1 -0.22544 0.07420 -3.04 0.0027SecLevSq 1 0.03244 0.00887 3.66 0.0003L2NSecond 1 0.39795 0.09812 4.06 <.0001Black 1 -0.30791 0.14194 -2.17 0.0312Features 1 0.09469 0.04100 2.31 0.0219baby 1 0.02305 0.04426 0.52 0.6031attr 1 -0.02805 0.05524 -0.51 0.6121

The rival hypothesis predictors have no

effect (we also tested these predictors

separately and found the same thing)

Question predictor effects remain even

controlling for the rival hypothesis predictors

“Finally, we examined the influence of facial attractiveness and babyish features on sentence length.

Controlling for criminal record, neither variable was a significant predictor of sentence length, t(206) = 0.05 and t(206) = 0.65, respectively. Moreover, Afrocentric

features continued to predict sentence length when these variables were controlled t(203) =2.32, p<0.025.”

Final check: We tested for statistical interactions between our two question predictors, Black and

Features, and all other predictors in the model: None were statistically significant

What predictors should we include in our “final” model???

What would have happened if we used an automated strategy?

AnySec*

Nsecond*

SecLev**

PrimLev***

AnySec*

Nsecond*

SecLev**

PrimLev***

AnySec*

Nsecond*

SecLev*

PrimLev***

All variables kept in raw form

Stepwise regression

Backward elimination

Forward selection

AnySec***

LNSecond***

SecLevSq***

PrimLevSq***

PrimLev~

AnySec***

LNSecond***

SecLevSq***

PrimLevSq***

PrimLev~

All variables transformed as we did in our analyses

Stepwise regression

Backward elimination

Forward selection

Black*

Features*

AnySec**

LNSecond***

SecLevSq***

PrimLevSq***

PrimLev~

Examining residuals from “final” modelStem Leaf # Boxplot 2 779 3 0 2 0144 4 | 1 5556667789 10 | 1 00011223334444 14 | 0 55555555566666666666667788888899999999 38 +-----+ 0 001111111222222233333333444444444 33 | + | -0 444444443333333322222222222211111111111100000 45 *-----* -0 999999988888887777777666666655555555555 39 +-----+ -1 443322211110 12 | -1 888877555555 12 | -2 4430 4 | -2 97 2 0 ----+----+----+----+----+----+----+----+----+

n = 13(6.0%) Reasonably symmetric

Contemplating a graph that displays the findings

Ln (sentence

length)

WhiteBlack

Very seriouscrime

Averagecrime

Features

WhiteBlack

Less serious crime

PrimLevStem Leaf # Boxplot 11 0000000 7 | 10 | 10 00000 5 | 9 | 9 00000000000000000000000000000000000000 38 | 8 | 8 0000000000000000000 19 +-----+ 7 | | 7 0000000000000000000000000000000000000000 40 *-----* 6 5555 4 | + | 6 0000000000000000000000 22 | | 5 | | 5 0000000000000000000000000000000000000000000 43 +-----+ 4 | 4 000000000000000000000000000000 30 | 3 | 3 0000 4 | 2 | 2 000 3 | 1 | 1 0 1 | ----+----+----+----+----+----+----+----+---

Mean 6.54167 Std Dev 2.07154

100% Max 1190% 975% Q3 850% Median 725% Q1 510% 4 0% Min 1

False driver’s license (1)

Possessing child pornography (5)

Aggravated child abuse (9)

Florida Criminal Punishment Code Severity Ranking ChartSection 921.0022, Florida Statutes

Aggravated battery on an officer (7)

Grand theft between 5-10K (3)

FeaturesBlackLNSecondSecLevSecLevPrimLevPrimLevYrsnL 0898.03032.03957.00322.02233.00366.1896.01850.0)(ˆ 22

White Black

Computing fitted values to create prototypical plots

Model G: Main effects of Black & Features, controlling for severity of offense

Parameter StandardVariable DF Estimate Error t Value Pr > |t|

Intercept 1 0.18504 0.43271 0.43 0.6694PrimLev 1 -0.18960 0.13302 -1.43 0.1556PrimLevSq 1 0.03663 0.01013 3.62 0.0004SecLev 1 -0.22333 0.07365 -3.03 0.0027SecLevSq 1 0.03216 0.00881 3.65 0.0003L2NSecond 1 0.39574 0.09770 4.05 <.0001Black 1 -0.30321 0.14127 -2.15 0.0330Features 1 0.08978 0.03990 2.25 0.0255

9326.0

4009.3

LNSecond

SecLev

5th%ile1.94

95th%ile6.23

5th%ile3.99

95th%ile7.59

FeaturesBlack

PrimLevPrimLevYrsnL

0898.03032.0)9326.0(3957.0)4009.3(0322.0)4009.3(2233.0

0366.1896.01850.0)(ˆ2

FeaturesBlackPrimLevPrimLevYrsnL

0898.03032.00366.1896.01671.0)(ˆ 2

Features Wh, Low Wh, Med Wh, Hi Bl, Low Bl, Med Bl, Hi12 0.3137 0.8129 1.60493 0.4035 0.9027 1.69474 0.4933 0.9925 1.7845 0.1901 0.6893 1.48135 0.5831 1.0823 1.8743 0.2799 0.7791 1.57116 0.6729 1.1721 1.9641 0.3697 0.8689 1.66097 0.4595 0.9587 1.75078 0.5493 1.0485 1.84059

Computing fitted values of ln(Sentence Length), by Features, Black, and Severity

of primary offense

Identifying reasonable plotting limits for Features for Blacks and Whites

White limits: 2 – 6Black limits: 4 - 8

Summarizing the effects of crime severity, race and features

1 2 3 4 5 6 7 8 9Features

Ln(Yrs)

WhiteBlack

Moderately serious crime

Less serious crimeWhite

Very serious crimeWhite

FeaturesThe more “afrocentric” one’s facial features, the longer his sentence (for individuals of both races)

Severity of crimeThe more serious the crime, the longer the sentence (and the more serious the crime, the larger the effect of seriousness)

RaceControlling for severity and facial features, Blacks have shorter sentences than Whites

Mean for Whites = 3.09Mean for Blacks = 5.92

“…when we examined the race difference in sentence length controlling for Afrocentric features, we were comparing White inmates with relatively high levels of Afrocentric features and Black inmates with relatively low levels.”“At the two within-group mean levels, there was no difference in sentence length between the groups.”

What tables and graphs might we present in a paper or presentation?

Four sets of evidence ina typical research

presentation1. Descriptive statistics: a

table summarizing distributions (often by interesting subgroups)

2. Correlation matrix summarizing relationships among variables (sometimes with partials as well)

3.3. SelectedSelected regression results documenting key findings from the analysis (not every model you fit)

4. Prototypical plots summarizing the major findings (probably the plot we just constructed)

Don’t forget to distinguish between how you do the analysis and how you report

the results Helpful hints about presenting

results1. Decide on your key points: Your

text, tables and displays (appropriately titled and organized) should support that argument

2. Think about your reader, not yourself: take the reader’s perspective and supply evidence that helps him/her evaluate your argument

3. Try out alternative displays and text: your first attempt is rarely your best

4. Writing up your results usually helps solidify—and often modify—your major argument, tables and graphs: Learn from writing; re-writing is essential

Table 1. Estimated means and sd’s by race (with t-statistics testing for differences in means by race)

Table 1. Estimated means and standard deviations of the sentence length and predictors, by race of felon(with t-test for difference in means)

VariableWhite

(n=116)

Black(n=100

) tSentence length (in years)

5.77(13.23)

8.08(17.85) -1.09

Ln(Sentence length)

1.06(1.01)

1.21(1.12) 0.99

Primary offense level

5.84(2.03)

6.93(2.07) -2.56*

Any secondary offenses?

0.65(0.44)

0.76(0.43) -0.46

N secondary offenses

1.84(4.65)

2.07(2.50) 1.21

Secondary offense level

3.14(2.47)

3.70(2.66) -1.61

Any prior offenses?

0.23(0.47)

0.34(0.48) -0.33

N prioroffenses

0.84(1.89)

1.07(1.92) -0.87

Prior offense level

0.91(2.16)

1.57(2.44) -0.84

Features 3.09(1.27)

5.92(1.11)

-15.86***

Attractiveness 2.90(0.98)

3.36(0.80) -2.27*

Baby-faced-ness

4.00(1.10)

4.09(1.10) -0.59

Cell entries are sample means and standard deviations*p<0.05; **p<0.01, ***P<0.001

Estimated mean sentence lengths are 5.77 years for Whites and 8.08 for Blacks; the difference is not statistically significant.

The mean primary offense level for Black felons (6.93) is significantly higher than the mean for White felons (5.84)

65% of White felons and 76% of Black felons had a secondary offense. This difference is not statistically significant nor is the difference between Black and White felons with respect to the number or severity of secondary offenses

23% of White felons and 34% of Black felons had a history of prior offenses. This difference is not statistically significant nor is the difference between Black and White felons with respect to the number or severity of prior offenses

On average, Black felons had significantly more afrocentric features than White felons (5.92 vs 3.09, t=-15.86, p<0.0001), but within both groups there is substantial variation (standard deviations of 1.11 and 1.27 respectively)

On average, Black felons were significantly more attractive than White felons (3.36 vs 2.90, t=-2.27, p<0.05), but the two groups were equivalent with respect to baby-faced-ness

Table 2. Correlation matrix and Partial Correlation Matrix (controlling for primary offense level (in linear and quadratic form) n=216)

Ln (Sentence Length)

Primary off

ense level

Any secondary off

Ln(N Secondary

offenses

Secondary offense

Any Prior offenses

Prior offense level

Features

Attractiveness

0.67***

Any secondary offenses?

0.03 0.12

Ln(N secondary offenses)

0.17* 0.28***

0.72*** 0.72***

Secondary offense level

0.37*** 0.23***

0.31***

0.77*** 0.83***

0.58*** 0.62***

Any prior offenses?

-0.21**-0.01

-0.31***

0.16* 0.16*

0.05 0.04

-0.05 0.04

Prior offense level

-0.16*-0.01

-0.22***

0.11 0.10

0.03 0.02

-0.03 0.04

0.89*** 0.89***

Black 0.07-0.07

0.17* 0.03 0.04

-0.05-0.05

0.11 0.06

0.02 0.08

0.06 0.10

Features 0.15* 0.05

0.18** 0.03 0.04

-0.03-0.02

0.11 0.06

0.01 0.07

0.05 0.10

0.74*** 0.73***

Attractiveness -0.02-0.00

-0.04 -0.05-0.05

-0.03 0.03

-0.03-0.02

0.00-0.01

-0.01-0.02

0.15* 0.16*

0.25*** 0.26***

Baby-faced-ness

0.12 0.01

0.15* -0.06-0.05

-0.06-0.06

-0.01-0.06

-0.07-0.03

-0.08-0.05

0.04 0.01

0.02-0.01

0.09 0.09

Table 3. Results of fitting a taxonomy of multiple regression models

Results of fitting a taxonomy of multiple regression models predicting ln(sentence length) among a random sample of 216 Florida inmatesPredictor Model A Model B Model C Model D Model EConstant 0.62**

(0.22)0.75

(0.44)0.43

(0.42)0.19

(0.43)0.18

(0.50)Black -0.20

(0.21)-0.30*(0.14)

-0.31*(0.14)

Features 0.13*(0.06)

0.09*(0.04)

-0.28*(0.14)

-0.19(0.13)

-0.19(0.12)

-0.19(0.13)

Primary offense level2

0.05***(0.01)

0.04***(0.01)

Secondary level

-0.22**(0.07)

-0.23**(0.07)

Secondary level2

0.03***(0.01)

Ln (N sec offenses)

0.40***(0.10)

Attractiveness

-0.03(0.06)

Baby-faced-ness

0.02(0.04)

R2 2.7 50.3 57.1 58.3 58.4Cell entries are estimated regression coefficients and standard errors*p<.05, **p<.01, ***p<.001

Here’s how they presented the MR results

“We turn next to the question of race differences in sentencing. We estimated a second model (Model 2) in which inmate race (-1 if White, +1 if Black) was

entered as a predictor along with the variables from the previous model. … The race of the offender did not account for a significant amount of variance in sentence length over and above the effects of seriousness and number of

offenses, t(206) = 0.90, p = .37.” [Notice that they refer to “Model 2” but they chose not to present it]

Two alternative graphic displays of findings

2 3 4 5 6 7 8

Features

Ln(Yrs)

WhiteBlack

Moderately serious crime

Less serious crime

WhiteBlack

Very serious crime

WhiteBlack

Another example of model building: The Father Presence study

“A hierarchical linear regression analysis was conducted to determine the effects of fathers’ antisocial behavior and fathers’ presence on child antisocial behavior. Fathers’ antisocial behavior (r=.30, p<.001) and fathers’ presence (r=-.16, p<.001) were significantly correlated with child behavior problems. …

DADHOMEDADASBCHILDASB 321

At the second step, we asked whether the effect of father presence was moderated by fathers’ antisocial behavior. Thus, the interaction between fathers’ antisocial behavior and father presence was entered and the model was estimated as:

DADHOMEDADASB

DADHOMEDADASBCHILDASB*4

The interaction was statistically significant, slope = .28, p<.001).

We conducted four additional analyses to test the robustness of the interaction between fathers’ antisocial behavior and father presence. First, we tested whether fathers’ antisocial behavior moderated the effect of father presence controlling for the presence of nonbiological father figures in the home. Second, we tested whether fathers’ antisocial behavior moderated the effect of father presence, controlling for maternal antisocial behavior. Third, we tested whether the interaction between fathers’ antisocial behavior and father presence predicted child behavior problems in the clinical range. Fourth, we tested whether fathers’ antisocial behavior moderated a more fine-grained measure of his involvement, such as his caretaking behavior.”

At the first step, we asked whether fathers’ antisocial behavior and father presence independently predicted child behavior problems. The model was estimated as:

Fathers’ antisocial behavior significantly predicted elevated levels of child antisocial behavior (slope = 0.32, p<0.001), but father presence did not when fathers’ antisocial behavior was controlled (slope = 1.80, p=.33).

These same model building strategies can apply to more complex models!

(n=51)(n=57)

RQ: Can narrative skills be ‘taught’ via TV to English

Language Learners?

Narrative development in bilingual kindergarteners: Can Arthur help? Yuuko Uchikoshi (2005)

Developmental Psychology

More complex residual term

Taxonomy of fitted models predicting narrative development

Multiple RegressionAnalysis 22110 XXY

Do your residuals meet the required assumptions?

Test for residual

normality

Use influence statistics to

detect atypical datapoints

Are the data longitudinal?

Use Individual

growth modeling

If your residuals are not independent,

replace OLS by GLS regression analysis

Specify a Multilevel

If time is a predictor, you need discrete-

time survival analysis…

If your outcome is categorical, you need to

use…

Discriminant Analysis

Multinomial logistic

regression analysis

(polychotomous outcome)

Binomial logistic

regression analysis

(dichotomous outcome)

If you have more predictors than you

can deal with,

Create taxonomies of fitted models and compare

Conduct a Principal Components Analysis

Form composites of the indicators of any common

construct.

Use Cluster Analysis

Transform the outcome or predictor

If your outcome vs. predictor relationship

is non-linear,

Use non-linear regression analysis.

Go to supplemental resources on course website

The S-052 Roadmap (Courtesy of John B. Willett)

What’s the big takeaway from this unit?

• Be guided by the research questions– Don’t go on fishing expeditions fitting all possible subsets and don’t rely

on computers to select models for fitting– No automated model selection routine can replace thoughtful model

building strategies– It’s wise to divide your predictors into substantive groupings and use

those groupings to guide the analysis• There is no single “right answer” or “right model”

– Different researchers may make different analytic decisions; hopefully, substantive findings about question predictors won’t change (but they can)

– Different researchers will choose to make different decisions about what information to present in a paper; hopefully, regardless of approach, there will be sufficient information to judge the soundness of the conclusions

• You can do data analysis!– Think back to the beginning of the semester; you’ve all come a long

way – You can judge the soundness of a research presentation; don’t believe

everything you read and be sure to read the methods section– No matter how much you learn about data analysis, there’s always

more to learn!

unit 11: regression modeling in practice

Documents

utica shale regression modeling 5_19 update

logistic regression for distribution modeling - clas...

regression and modeling

modeling scientific impact with topical influence regression

data modeling using symbolic regression

predicting modeling with sas (for health) group...

ordinal logistic regression (cumulative logit modeling...

structural equation modeling and regression: guidelines

credibility modeling via spline nonparametrie ...

variable selection in semiparametric regression modeling

boosted regression trees for ecological modeling

introduction to logistic regression modeling

finite element simulation and regression modeling of

handbook of regression and modeling -...

regression modeling and prediction by individual

regression discontinuity: a causal modeling approach to non...

spatial regression modeling - csiss.org · spatial...

beyond multilevel regression modeling: multilevel analysis...

regression modeling strategies for microarchitectural...

regression modeling - data...