lecture 8 matched pairs review –summary –the flow approach to problem solving –example

Lecture 8

• Matched Pairs

• Review– Summary– The Flow approach to problem solving– Example

Announcements

Extra office hours:– Today, after class – 1:00.– Sunday, 2-5– Monday 9-11:30. I also should be in my office

most of the afternoon after 2:15

• Example 13.04 – It was suspected that salary offers were affected by

students’ GPA, (which caused S12 and S2

2 to increase).

– To reduce this variability, the following procedure was used:

• 25 ranges of GPAs were predetermined.• Students from each major were randomly selected, one

from each GPA range.• The highest salary offer for each student was recorded.

– From the data presented can we conclude that Finance majors are offered higher salaries?

The Matched Pairs ExperimentThe Matched Pairs Experiment

Adminstrative Info for Midterm• Location: Steinberg Hall-Dietrich Hall 351

• Time: Monday (Feb. 10th), 6:00-8:00 p.m.

• Closed book.

• Allowed one double-sided 8.5 x 11 note sheet

• Bring calculator.

Matched Pairs => One-Sample Test• After taking differences of observations

within each pair, one can continue with a one-sample test with 0:H0 D

GPA Group Finance Marketing1 95171 893292 88009 927053 98089 992054 106322 990035 74566 748256 87089 770387 88664 782728 71200 594629 69367 5155510 82618 81591

Difference5842

-4696-11167319-259

100511039211738178121027

• Solution 13.04 (by hand)– The parameter tested is D (=1 – 2)

– The hypotheses:H0: D = 0H1: D > 0

– The t statistic:

Finance Marketing

The matched pairs hypothesis testThe matched pairs hypothesis test

Degrees of freedom = nD – 1

The rejection region is t > t.05,25-1 = 1.711

• Solution 13.04

– Calculate t

81.325664705065

Conclusion: There is sufficient evidence to infer at 5% significance level that the Finance MBAs’ highest salary offer is, on the average, higher than that ofthe Marketing MBAs.

Matched Pairs vs. Independent Samples

• Potential advantage of matched pairs: The variance of can be much less than the variance of if the variable(s) on which the pairs are matched account for much of the variation within each group.

• However, if the variance of is close to the variance of , the independent samples design will produce a more powerful test because it has almost twice as many degrees of freedom.

21 XX DX

Recognizing Matched Pairs Experiments

• Does there exist some natural relationship between the first pairs of observations that makes it more appropriate to compare the first pair than the first observation in group 1 and the second observation in group 2?

• Before and after designs• Example: A researcher for OSHA wants to see

whether cutbacks in enforcement of safety regulations coincided with an increase in work related accidents. For 20 industrial plants, she has number of accidents in 1980 and 1995.

Examples

• Example: A scientist claims that tomatoes will grow large if they are played soft, soothing music. To test the claim, 10 tomatoes grown without music and 10 tomatoes grown with music are randomly selected. The tomatoes are weighed in ounces.

• Example: Many Americans use tax preparation companies to prepare their taxes. In order to investigate whether there are are any differences between companies, an experiment was conducted in which two of the largest companies were asked to prepare the tax returns of a sample of 55 taxpayers. The amounts of tax payable for each taxpayer at the two companies were recorded.

Caveat

• Keep in mind the difference between observational and experimental data in terms of what can be inferred. – Does the study of finance/marketing majors in

Example 13.4 show that students educated in finance are more attractive to prospective employers?

Additional Example-Problem 13.75

Tire example contd. (Problem 13.49) Suppose

now we redo the experiment: On 20 randomly

selected cars, one of each type of tire is

installed on the rear wheels and, as before, the

cars are driven until the tires wear out. The

number of miles(in 1000s) is stored in Xr13-

75. Can we conclude that the new tire is superior?

A Review of Chapters 12 and 13

• Summary of techniques seen

• Here (Chapter 14) we build a framework that helps decide which technique (or techniques) should be used in solving a problem.

• Logical flow chart of techniques for Chapters 12 and 13 is presented next.

Summary of statistical inference:Chapters 12 and 13

• Problem objective: Describe a population.– Data type: Interval

• Descriptive measurement: Central location– Parameter:

– Test statistic:

– Interval estimator:

– Required condition: Normal population

Summary - continued• Problem objective: Describe a population.

– Data type: Interval• Descriptive measurement: Variability.

– Parameter: s2

– Test statistic:

– Required condition: normal population.

22 s)1n(

Summary - continued• Problem objective: Describe a population.

– Data type: Nominal

– Parameter: p

– Test statistic:

– Required condition:

n)p1(ppp̂

n)p̂1(p̂

zp̂ 2

estimatefor(5)p̂1(nand5p̂n

)testfor(5)p1(nand5np

Summary - continued

• Problem objective: Compare two populations.– Data type: Interval

• Descriptive measurement: Central location– Experimental design: Independent samples

» population variances:

» Parameter: m1 - m2

» Test statistic: Interval estimator:

» Required condition: Normal populations

)()xx(t

(stxx21

d.f. = n1 + n2 -2

Summary - continued

• Problem objective: Compare two populations.– Data type: Interval.

• Descriptive measurement: Central location– Experimental design: Independent samples

» population variances:

» Parameter: m1 - m2

» Required condition: Normal populations

2121 )()(

)/(d.f.

Summary - continued

• Problem objective: Compare two populations.– Data type: Interval.

• Descriptive measurement: Central location– Experimental design: Matched pairs

» Parameter: mD

» Required condition: Normal differences

stx 1,2/

d.f. = nD - 1

Problem objective?

Describe a population Compare two populations

Data type? Data type?

Interval Nominal Interval Nominal

Type of descriptivemeasurement?

Z test &estimator of p

Z test &estimator of p1-p2

Central location Variability Central location Variability

t- test &estimator of

- test &estimator of 2

F- test &estimator of

2/2Experimental design?

Continue Continue

t- test &estimator of 1-2

(Unequal variances)

Population variances? t- test &estimator of D

t- test &estimator of D

(Equal variances)

Independent samples Matched pairs

Experimental design?

UnequalEqual

Identifying the appropriate technique

• Example 14.1– Is the antilock braking system (ABS) really effective?– Two aspects of the effectiveness were examined:

• The number of accidents. • Cost of repair when accidents do occur.

– An experiment was conducted as follows:• 500 cars with ABS and 500 cars without ABS were randomly

selected.• For each car it was recorded whether the car was involved in an

accident.• If a car was involved with an accident, the cost of repair was

recorded.

• Example – continued– Data

• 42 cars without ABS had an accident,

• 38 cars equipped with ABS had an accident

• The costs of repairs were recorded (see Xm14-01).

– Can we conclude that ABS is effective?

• Solution– Question 1: Is there sufficient evidence to infer

that the cost of repairing accident damage in ABS equipped cars is less than that of cars without ABS?

– Question 2: How much cheaper is it to repair ABS equipped cars than cars without ABS?

Question 2: Compare the mean repair costs per accident

• Solution

Problem objective?

Describing a single population Compare two populations

Data type?

Interval Nominal

Type of descriptivemeasurements?

Central location Variability

Cost of repair per accident

• Solution - continued

Population variancesequal?

Independent samples Matched pairs

Unequal

Experimental design?

Central location

(Equal variances)

Run the F test for the ratio of two variances.

• Solution – continued– 1 = mean cost of repairing cars without ABS

2 = mean cost of repairing cars with ABS

– The hypotheses testedH0: 1 – 2 = 0H1: 1 – 2 > 0

– For the equal variance case we use

2nn.f.d

)()xx(t

• Solution – continued– To determine whether the population variances

differ we apply the F test– From JMP we have (Xm14-01)

Do not reject H0.There is insufficientevidence to concludethat the two variances areunequal.

F-Test Two-Sample for Variances

Cost 1991 Cost 1992Mean 2075 1714Variance 450343 390409Observations 42 38df 41 37F 1.15P(F<=f) one-tail 0.3313F Critical one-tail 1.7129

Mean 2075 1714Variance 450343 390409Observations 42 38Pooled Variance 421913Hypothesized Mean Difference0df 78t Stat 2.48P(T<=t) one-tail 0.0077t Critical one-tail 1.6646P(T<=t) two-tail 0.0153t Critical two-tail 1.9908

• Solution – continued – Assuming the variances are really equal we run

the equal-variances t-test of the difference between two means

At 5% significance levelthere is sufficient evidenceto infer that the cost of repairsafter accidents for cars with ABS is smaller than the cost of repairs for cars without ABS.

Checking required conditions

• The two populations should be normal (or at least not extremely nonnormal)

Cost without ABS

900 1400 1900 2400 2900 3400 More

Frequency

900 1400 1900 2400 2900 3400 3900 More

Question 3: Estimate the difference in repair costs

• Solution– Use Estimators Workbook: t-Test_2 Means

(Eq-Var) worksheett-Estimate of the Difference Between Two Means (Equal-Variances)

Sample 1 Sample 2 Confidence Interval EstimateMean 2075 1714 361 +/-' 290Variance 450343 390409 Lower confidence limit 71Sample size 42 38 Upper confidence limit 651Pooled Variance 421913Confidence level 0.95

We estimate that the cost of repairing a car not equipped with ABS is between $71 and $651 more expensive than to repair an ABS equipped car.

lecture 8 matched pairs review –summary –the flow approach to problem solving –example

matched pairs experimentsdoes

pairs of observations

matched account

tax preparation companies

students gpa

largest companies

finance majors

salary offers

Documents

practical applications of matched series analysis: sar...

wilcoxon matched-pairs signed-ranks test

bioisosteres in medicinal chemistry - estranky.sk ·...

automatic compound design by matched molecular pairs

comparison of locking compression plate versus … twelve...

assessment of depression in dementia - henry.olders.ca ·...

solving two-step equations. algebra tiles variable zero...

inference in experiments with matched...

chapter 9 matched pairs or dependent t- testnlucas/stat...

tapps (thinking aloud pairs -...

1 sta 617 – chp10 models for matched pairs 10.4 symmetry,...

beyond matched pairs - optibriumboyle... · beyond matched...

logistic regression for matched pairs datalogistic...

models for matched pairs (models for square tables) edpsy

package ‘estimatr’ · matched-pairs, block-clustered,...

matched pairs approach to set theoretic solutions …

visualization and manipulation of matched molecular series...

employment first...

david evans, eli-lilly, 'field-aligned matched pairs

practical applications of matched molecular pairs at...