ma in english linguistics experimental design and statistics sean wallis survey of english usage...

69
MA in English Linguistics MA in English Linguistics Experimental design and statistics Experimental design and statistics Sean Wallis Survey of English Usage University College London [email protected]

Upload: patrick-adams

Post on 28-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

MA in English LinguisticsMA in English LinguisticsExperimental design and statisticsExperimental design and statistics

Sean WallisSurvey of English Usage

University College London

[email protected]

Page 2: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

OutlineOutline

• What is a research question?

• Choice and baselines

• Making sense of probability

• Observing change in a corpus

• Drawing inferences to larger populations

• Estimating error in observations

• Testing results for significance

Page 3: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

What is a research question?What is a research question?

• You may have heard this phrase last term

• What do you think we mean by a “research question”?

• Can you think of any examples?

Page 4: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

Page 5: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

– smoking is good for you

Page 6: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

– smoking is good for you

– dropped objects accelerate toward the ground at 9.8 metres per second squared

Page 7: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

– smoking is good for you

– dropped objects accelerate toward the ground at 9.8 metres per second squared

– ’s is a clitic rather than a word

Page 8: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

– smoking is good for you

– dropped objects accelerate toward the ground at 9.8 metres per second squared

– ’s is a clitic rather than a word

– the word shall is used less often in recent

years

Page 9: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ExamplesExamples

• Some example research questions

– smoking is good for you

– dropped objects accelerate toward the ground at 9.8 metres per second squared

– ’s is a clitic rather than a word

– the word shall is used less often in recent years

– the degree of preference for shall rather than will has declined in British English over the period 1960s-1990s

Page 10: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Testable hypothesesTestable hypotheses

• An hypothesis = a testable research question

• Compare– the word shall is used less in recent years

to– the degree of preference for shall rather

than will has declined in British English over the period 1960s-1990s

• How could you test these hypotheses?

Page 11: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Questions of choiceQuestions of choice

• Suppose we wanted to test the following hypothesis using DCPSE

– the word shall is used less in recent years

• When we say the word shall is used less...

– ...less compared to what?• traditionally corpus linguists have “normalised” data as

a proportion of words (so we might say shall is used less frequently per million words)

• But what might this mean?

Page 12: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Questions of choiceQuestions of choice

• From the speaker’s perspective:– The probability of a speaker using a word like shall

depends on whether they had the opportunity to say it in the first place

– They were about to say will, but said shall instead

Page 13: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Questions of choiceQuestions of choice

• From the speaker’s perspective:– The probability of a speaker using a word like shall

depends on whether they had the opportunity to say it in the first place

– They were about to say will, but said shall instead

– Per million words might still be relevant from the hearer’s perspective

Page 14: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Questions of choiceQuestions of choice

• From the speaker’s perspective:– The probability of a speaker using a word like shall

depends on whether they had the opportunity to say it in the first place

– They were about to say will, but said shall instead

– Per million words might still be relevant from the hearer’s perspective

• If we can identify all points where the choice arose, we have an ideal baseline for studying linguistic choices made by speakers/writers.

Page 15: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Questions of choiceQuestions of choice

• From the speaker’s perspective:– The probability of a speaker using a word like shall depends

on whether they had the opportunity to say it in the first place

– They were about to say will, but said shall instead

– Per million words might still be relevant from the hearer’s perspective

• If we can identify all points where the choice arose, we have an ideal baseline for studying linguistic choices made by speakers/writers.

– Can all cases of will be replaced by shall ?– What about second or third person shall ?

Page 16: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

BaselinesBaselines

• The baseline is a central element of the hypothesis– Changes are always relative to something– You can get different results with different baselines– Different baselines imply different conclusions

• We have seen two different kinds of baselines– A word baseline

• shall per million words– A choice baseline (an “alternation experiment”)

• shall as a proportion of the choice shall vs. will (including’ll ), when the choice arises

Page 17: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

BaselinesBaselines

• In many cases it is very difficult to identify all cases where “the choice” arises– e.g. studying modal verbs

Page 18: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

BaselinesBaselines

• In many cases it is very difficult to identify all cases where “the choice” arises– e.g. studying modal verbs

• You may need to pick a different baseline– Be as specific as you can

• words VPs tensed VPs alternating modals

Page 19: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

BaselinesBaselines

• In many cases it is very difficult to identify all cases where “the choice” arises– e.g. studying modal verbs

• You may need to pick a different baseline– Be as specific as you can

• words VPs tensed VPs alternating modals

alternation = “different

words, same meaning”

Page 20: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

BaselinesBaselines

• In many cases it is very difficult to identify all cases where “the choice” arises– e.g. studying modal verbs

• You may need to pick a different baseline– Be as specific as you can

• words VPs tensed VPs alternating modals

• Other hypotheses imply different baselines:– Different meanings of the same word:

• e.g. uses of very, as a proportion of all cases of veryvery +N - the very personvery +ADJ - the very tall personvery +ADV - very slightly moving

alternation = “different

words, same meaning”

semasiological

variation}

Page 21: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• We are used to concepts like these being expressed as numbers:– length (distance, height)– area– volume– temperature – wealth (income, assets)

Page 22: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• We are used to concepts like these being expressed as numbers:– length (distance, height)– area– volume– temperature – wealth (income, assets)

• We are going to discuss another concept:– probability (proportion, percentage)

Page 23: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

Page 24: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n – e.g. the probability that the

speaker says will instead of shall

Page 25: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

• where– frequency x (often, f )

• the number of times something actually happens• the number of hits in a search

– e.g. the probability that the speaker says will instead of shall

Page 26: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

• where– frequency x (often, f )

• the number of times something actually happens• the number of hits in a search

– cases of will

– e.g. the probability that the speaker says will instead of shall

Page 27: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

• where– frequency x (often, f )

• the number of times something actually happens• the number of hits in a search

– baseline n is• the number of times something could happen• the number of hits

– in a more general search – in several alternative patterns (‘alternate forms’)

– cases of will

– e.g. the probability that the speaker says will instead of shall

Page 28: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

• where– frequency x (often, f )

• the number of times something actually happens• the number of hits in a search

– baseline n is• the number of times something could happen• the number of hits

– in a more general search – in several alternative patterns (‘alternate forms’)

– cases of will

– total: will + shall

– e.g. the probability that the speaker says will instead of shall

Page 29: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ProbabilityProbability

• Based on another, even simpler, idea:– probability p = x / n

• where– frequency x (often, f )

• the number of times something actually happens• the number of hits in a search

– baseline n is• the number of times something could happen• the number of hits

– in a more general search – in several alternative patterns (‘alternate forms’)

• Probability can range from 0 to 1

– e.g. the probability that the speaker says will instead of shall– cases of will

– total: will + shall

Page 30: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

A simple research questionA simple research question

• What happens to modal shall vs. will over time in British English?– Does shall increase or decrease?

• What do you think?

• How might we find out?

Page 31: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Lets get some dataLets get some data

• Open DCPSE with ICECUP– FTF query for first person declarative shall:

• repeat for will

Page 32: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Lets get some dataLets get some data

• Open DCPSE with ICECUP– FTF query for first person declarative shall:

• repeat for will– Corpus Map:

• DATE Do the first set of queries and then drop into Corpus

Map}

Page 33: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting probability of speaker selecting modal shall out of shall/will over time (DCPSE)

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})

(Aarts et al., 2013)

shallshall = 100% = 100%

shallshall = 0% = 0%

Page 34: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting probability of speaker selecting modal shall out of shall/will over time (DCPSE)

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})

Is shall going up or down?

(Aarts et al., 2013)

shallshall = 100% = 100%

shallshall = 0% = 0%

Page 35: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Is Is shall shall going up or down? going up or down?

• Whenever we look at change, we must ask ourselves two things:

Page 36: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Is Is shall shall going up or down? going up or down? • Whenever we look at change, we must ask ourselves two things:

What is the change relative to?– What is our baseline for comparison? – In this case we ask

• Does shall decrease relative to shall +will ?

Page 37: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Is Is shall shall going up or down? going up or down? • Whenever we look at change, we must ask ourselves two things:

What is the change relative to?– What is our baseline for comparison? – In this case we ask

• Does shall decrease relative to shall +will ?

How confident are we in our results?– Is the change big enough to be reproducible?

Page 38: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The ‘sample’ and the The ‘sample’ and the ‘population’‘population’• The corpus is a sample

Page 39: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The ‘sample’ and the The ‘sample’ and the ‘population’‘population’• The corpus is a sample

• If we ask questions about the proportions of certain words in the corpus– We ask questions about the sample– Answers are statements of fact

Page 40: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The ‘sample’ and the The ‘sample’ and the ‘population’‘population’• The corpus is a sample

• If we ask questions about the proportions of certain words in the corpus– We ask questions about the sample– Answers are statements of fact

• Now we are asking about “British English”

?

Page 41: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The ‘sample’ and the The ‘sample’ and the ‘population’‘population’• The corpus is a sample

• If we ask questions about the proportions of certain words in the corpus– We ask questions about the sample– Answers are statements of fact

• Now we are asking about “British English”– We want to draw an inference

• from the sample (in this case, DCPSE)• to the population (similarly-sampled BrE utterances)

– This inference is a best guess– This process is called inferential statistics

Page 42: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Basic inferential Basic inferential statisticsstatistics

• Suppose we carry out an experiment– We toss a coin 10 times and get 5 heads– How confident are we in the results?

• Suppose we repeat the experiment• Will we get the same result again?

Page 43: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Basic inferential Basic inferential statisticsstatistics

• Suppose we carry out an experiment– We toss a coin 10 times and get 5 heads– How confident are we in the results?

• Suppose we repeat the experiment• Will we get the same result again?

• Let’s try…– You should have one coin– Toss it 10 times– Write down how many heads you get– Do you all get the same results?

Page 44: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 1

x

531 7 9

• We toss a coin 10 times, and get 5 heads

X

Page 45: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 4

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 46: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 8

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 47: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 12

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 48: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 16

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 49: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 20

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 50: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Repeated sampling tends to form a Binomial distribution around the expected mean X

F

N = 26

x

531 7 9

• Due to chance, some samples will have a higher or lower score

X

Page 51: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution• It is helpful to express x as the probability of choosing a head, p, with expected mean P

• p = x / n– n = max. number of

possible heads (10)

• Probabilities are inthe range 0 to 1=percentages

(0 to 100%)

F

p

0.50.30.1 0.7 0.9

P

Page 52: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Take-home point:– A single observation, say x hits (or p as a

proportion of n possible hits) in the corpus, is not guaranteed to be correct ‘in the world’!

• Estimating the confidence you have in your results is essential

F

p

P

0.50.30.1 0.7 0.9

p

Page 53: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The Binomial distributionThe Binomial distribution

• Take-home point:– A single observation, say x hits (or p as a

proportion of n possible hits) in the corpus, is not guaranteed to be correct ‘in the world’!

• Estimating the confidence you have in your results is essential

– We want to makepredictions about future runs of the same experiment

F

p

P

p

0.50.30.1 0.7 0.9

Page 54: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Binomial Binomial Normal Normal

• The Binomial (discrete) distribution is close to the Normal (continuous) distribution

x

F

0.50.30.1 0.7 0.9

Page 55: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Binomial Binomial Normal Normal

• Any Normal distribution can be defined by only two variables and the Normal function z

z . S z . S

F

– With more data in the experiment, S will be smaller

p0.50.30.1 0.7

population

mean P

standard deviationS = P(1 – P) / n

Page 56: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Binomial Binomial Normal Normal

• Any Normal distribution can be defined by only two variables and the Normal function z

z . S z . S

F

2.5% 2.5%

population

mean P

– 95% of the curve is within ~2 standard deviations of the expected mean

standard deviationS = P(1 – P) / n

p0.50.30.1 0.7

95%

– the correct figure is 1.95996!

= the critical value of z for an error level of 0.05.

Page 57: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

The single-sample The single-sample zz test...test...

• Is an observation p > z standard deviations from the expected (population) mean P?

z . S z . S

F

P0.25% 0.25%

p0.50.30.1 0.7

observation p• If yes, p is

significantly different from P

Page 58: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

...gives us a “confidence ...gives us a “confidence interval”interval”• P ± z . S is the confidence interval for P

– We want to plot the interval about p

z . S z . S

F

P0.25% 0.25%

p0.50.30.1 0.7

Page 59: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

...gives us a “confidence ...gives us a “confidence interval”interval”• P ± z . S is the confidence interval for P

– We want to plot the interval about p

w+

F

P0.25% 0.25%

p0.50.30.1 0.7

observation p

w–

Page 60: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

...gives us a “confidence ...gives us a “confidence interval”interval”• The interval about p is called the

Wilson score interval

• This interval reflects the Normal interval about P:

• If P is at the upper limit of p,p is at the lower limit of P

(Wallis, 2013)

F

P0.25% 0.25%

p

w+

observation p

w–

0.50.30.1 0.7

Page 61: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Simple test: – Compare p for

• all LLC texts in DCPSE (1956-77) with• all ICE-GB texts (early 1990s)

– We get the following data

– We may plot the probabilityof shall being selected,with Wilson intervals

LLC ICE-GB totalshall 110 40 150will 78 58 136total 188 98 286

0.0

0.2

0.4

0.6

0.8

1.0

LLC ICE-GB

p(shall | {shall, will})

Page 62: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Simple test: – Compare p for

• all LLC texts in DCPSE (1956-77) with• all ICE-GB texts (early 1990s)

– We get the following data

– We may plot the probabilityof shall being selected,with Wilson intervals

0.0

0.2

0.4

0.6

0.8

1.0

LLC ICE-GB

p(shall | {shall, will})LLC ICE-GB total

shall 110 40 150will 78 58 136total 188 98 286

May be input in a

2 x 2 chi-square test

- or you can check Wilson intervals

Page 63: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting modal shall/will over time (DCPSE)

• Small amounts of data / year

Page 64: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting modal shall/will over time (DCPSE)

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})• Small amounts

of data / year

• Confidence intervals identify the degree of certainty in our results

Page 65: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting modal shall/will over time (DCPSE)

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})

• Small amounts of data / year

• Confidence intervals identify the degree of certainty in our results

• Highly skewed p in some cases

– p = 0 or 1 (circled)

Page 66: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Modal Modal shallshall vs. vs. willwill over time over time

• Plotting modal shall/will over time (DCPSE)

0.0

0.2

0.4

0.6

0.8

1.0

1955 1960 1965 1970 1975 1980 1985 1990 1995

p(shall | {shall, will})

• Small amounts of data / year

• Confidence intervals identify the degree of certainty in our results

• We can now estimate an approximate downwards curve

(Aarts et al., 2013)

Page 67: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

Recap Recap • Whenever we look at change, we must ask ourselves two things:

What is the change relative to?– Is our observation higher or lower than we might expect

• In this case we ask • Does shall decrease relative to shall +will ?

How confident are we in our results?– Is the change big enough to be reproducible?

Page 68: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ConclusionsConclusions

• An observation is not the actual value – Repeating the experiment might get different results

• The basic idea of inferential statistics is – Predict range of future results if experiment was

repeated• ‘Significant’ = effect > 0 (e.g. 19 times out of 20)

• Based on the Binomial distribution– Approximated by Normal distribution – many uses

• Plotting confidence intervals• Use goodness of fit or single-sample z tests to compare

an observation with an expected baseline• Use 22 tests or independent-sample z tests to compare

two observed samples

Page 69: MA in English Linguistics Experimental design and statistics Sean Wallis Survey of English Usage University College London s.wallis@ucl.ac.uk

ReferencesReferences

• Aarts, B., Close, J., and Wallis, S.A. 2013. Choices over time: methodological issues in investigating current change. Chapter 2 in Aarts, B. Close, J., Leech G., and Wallis, S.A. (eds.) The Verb Phrase in English. Cambridge University Press.

• Wallis, S.A. 2013. Binomial confidence intervals and contingency tests. Journal of Quantitative Linguistics 20:3, 178-208.

• Wilson, E.B. 1927. Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association 22: 209-212

• NOTE: Statistics papers, more explanation, spreadsheets etc. are published on corp.ling.stats blog: http://corplingstats.wordpress.com