computational morphology and its implications for theoretical morphology richard sproat university...

52
Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge Venice April 12, 2006

Upload: patrick-gaines

Post on 20-Jan-2016

224 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology and its Implications for Theoretical Morphology

Richard SproatUniversity of Illinois at Urbana-Champaign

PASCAL MorphoChallenge

Venice

April 12, 2006

Page 2: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 2

“Item-and-arrangement” versus “Item and process”

• Charles Hockett (1954) “Two models of grammatical description”:– Item-and-arrangement: words are

composed of morphemes that are put together by a kind of “word syntax”

– Item-and-process: words are built up via the application of rules that add phonological and morphosyntactic information

Page 3: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 3

Stump’s classification

Lexical Inferential

Incremental Lieber Steele

Realizational Halle&Marantz Stump,

Beard’s LMBM

hoot+s[3sg] Ø’s / hoot[3sg]

hoots = 3sg because of -s

-s is introduced due to 3sg

Affix is a lexical entry that introduces morphosyntactic features

Affix introduced because of morphosyntactic features

Page 4: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 4

Computational morphology• Nearly all morphological operations can be expressed

in terms of regular relations.– Only possible exception is reduplication

• Regular relations are relations over pairs of strings that can be constructed solely by the operations of:– Concatenation: if R, S are regular relations then so is R• S– Union: if R, S are regular relations then so is RUS– Kleene closure: if R is a regular relation then so is R* (0 or

more instances of R concatenated with itself)

• Regular relations are closed under composition: if R, S are regular relations, then so is R○S

• Implemented with finite-state transducers

Page 5: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 5

Transducers and composition(Johnson, 1972; Koskenniemi, 1983; Kaplan & Kay, 1994; Mohri & Sproat, 1996)

•Consider 3-letter alphabet {a,b,c}•Given a rule a b, the equivalent transducer is:

abbca

bbbcb

Page 6: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 6

Another rule

bc / _ b

Page 7: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 7

The two rules composed

abbca

ccbcb

bc / _ bab

Page 8: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 8

Composition and morphology

• Composition is the most general computational mechanism that handles morphological operations (Roark and Sproat, 2006)

• Affixation (which is more typically handled using concatenation) can also be handled using composition

• Composition, and other closure properties of regular relations imply that there is no fundamental difference between morphological theories.

Page 9: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 9

Affixation as composition

Any string over the alphabet Insert

Page 10: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 10

Is this Rube-Goldbergesque?

• No! Because many affixes either impose requirements on their base or modify their base.

• Cf. Yowlumne (aka Yawelmani) (Archangeli, 1984)

Page 11: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 11

Yowlumne gerundial -inay

• -inay requires the template CVC(C)

Composing the base with 1 will modifythe base and add [+GER]

Page 12: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 12

CVC(C)

Page 13: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 13

Some morphological operations

• Subsegmental morphology• Truncation• Infixation• Root-and-pattern morphology• Reduplication• Morphomic requirements (Aronoff, 1994)

• All of these can be handled using composition

Page 14: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 14

German diminutives

Page 15: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 15

Koasati truncation (Lombardi & McCarthy, 1991)

Page 16: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 16

Two kinds of infixation

• Extrametrical infixation– E.g. Bontoc

• Positively circumscribed infixation– E.g. Ulwa

Page 17: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 17

Bontoc infixation (Seidenadel, 1907)

Page 18: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 18

Ulwa infixation (CODIUL, 1989)

Page 19: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 19

Root & pattern morphology (McCarthy 1979)

k t b

Page 20: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 20

Root & pattern morphology

Page 21: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 21

Root & pattern morphology: related approaches

• Beesley & Karttunen (2000) propose an approach using compile-replace plus merge

• Kiraz (2000) proposes a multitape solution

• But all of these are equivalent to composition

d V V r V Sd u u r i sSurface form is a regular expression

Page 22: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 22

Reduplication: Gothic (Wright 1910)

• Prefix a syllable of the form (A)Cai to the stem, where C is a consonant position and A is an optional appendix

• Copy the onset of the stem to the C position. If there is a pre-onset appendix /s/, copy this to the appendix position

Page 23: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 24

Bambara reduplication (Culy, 1985)

This is apparently beyond the power offinite-state methods.

Page 24: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 25

Factoring reduplication

• Prosodic constraints

• Copy verification transducer C

Page 25: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 26

Gothic index transducer

Page 26: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 27

Factoring reduplication

• Then reduplication in Gothic can be modeled as:

α o C• More generally, one can model reduplication

as the following composition, where P implements the prosodic constraints, C the copy constraints, and A optional phonological adjustments:

P o C o A

Page 27: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 28

Other approaches

• Walther (2000a, 2000b) proposes a special kind of transducer involving– Repeat arcs: move backwards in a string and

repeat– Skip arcs: skip over portions of the string

• Cohen-Sygal & Wintner (forthcoming) introduce finite state registered automata, extending FSA’s with registers

• These methods generally seem to presume exact copies

Page 28: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 29

Non-exact copies

• Dakota (Inkelas & Zoll, 1999):

Page 29: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 30

Non-exact copies

• Basic and modified stems in Sye (Inkelas & Zoll, 1999):

“they will fall all over”

Page 30: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 31

Morphological Doubling Theory(Inkelas & Zoll, 1999)

• In contradistinction to the more common “correspondence” theory:– Reduplication involves doubling at the

morphosyntactic level– Phonological doubling is thus expected,

but not required

Page 31: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 32

Gothic reduplication under Morphological Doubling Theory

Page 32: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 33

More

• Composition also elegantly accounts for other phenomena such as prosodic circumscription (McCarthy and Prince, 1990) or morphomic requirements (Aronoff, 1994).

• Composition of regular relations can model rules

• It can also model affixation• It doesn’t matter if you describe affixation as

lexical-incremental or inferential-realizational

Page 33: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 34

Morphomic requirements (Aronoff, 1994)

Latin 3rd Stem

Page 34: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 35

So?• 3rd stem is not morphologically uniform:

– It differs across different verb classes and some verbs have idiosyncratic third stems

• It is not semantically coherent:– Forms that require the 3rd stem are a motley crew

• Yet there is clearly a notion of 3rd stem:– If you tell me the 3rd stem of a verb, I can tell you how

the agentive noun, the supine, the perfect participle … are formed

• 3rd stem has a purely morphological function

Page 35: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 36

3rd stem is just prosodically induced affixation

• Assume we have a transducer T that forms the 3rd stem of a verb:– of course, T will have to allow for a lot of

idiosyncratic changes

Σ* >3st:ε Σ*

Page 36: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 38

Summary so far

• Most or all morphological operations can be handled with composition

• We wish to show next that this fact, along with general properties of regular languages and relations, allows us to dispense with distinctions between morphological theories.

Page 37: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 39

Return to Stump (2001)

• In (Roark & Sproat, 2006) we reanalyze Stump’s analyses of:– Sanskrit nominal declensions– Swahili verbal declensions– Breton double plurals

• All of which purport to show the need for an realizational-inferential account.

• Here we will consider:– A simple example from Beard & Volpe’s analysis

of English agentive nominals– A quick overview of the Sanskrit case.

Page 38: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 40

English Agentive Nominals (cf. Beard & Volpe, 2005)

• read-er, stand-ee, correspond-ent, record-ist, cook

• ent / [+ent][+noun,+agentive] __ $

• Call the set of all agentive rules R• We can define a new ‘metarule’ R′ that is the

union of all rules in R:

Page 39: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 41

Feature [+noun,+agentive]

• Presumably this is also introduced by rule: call this rule M

• Then given a base B, the base with that feature specification added is given by B○M

• Then the appropriate suffixed form is given by [B○M]○R′

• But this can be written, by associativity, as B○[M○R′]

• Finally, [M○R′] can be precomposed; call this R′′

Page 40: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 42

So what?

• R′′:– Introduces the morphosyntactic feature

[+noun,+agentive]– Introduces the affixal morphology as

appropriate to the base

• In short, R′′ encodes a lexical-incremental model of morphology.

Page 41: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 43

Sanskrit declensions

Page 42: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 44

Sanskrit declensions

Page 43: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 45

Issues with Sanskrit

• Nouns have two or three stems – strong, middle and (optionally) weakest

• A different series of stem alternations cross-cuts this: guna, vrddhi, and zero:– “foot”: pād-, pad-, pd-

– strong stems may be guna or vrddhi

– middle stems may be zero, or a lexeme-specific stem

– weakest stems may be zero or lexeme-specific stem

Page 44: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 46

Sanskrit declensions

guna zero

Page 45: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 47

Sanskrit declensions

vrddhi lexeme-class particularlexeme-classparticular

Page 46: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 48

Further issues

• Stump argues for Indexing Autonomy Hypothesis:– A stem’s index is independent of the form used for the stem– Sanskrit nominal declensions are morphomic in Aronoff’s

sense

• Also involved are rules of referral whereby a particular form is systematically used to represent more than one slot in the paradigm.– For example, in Latin the ablative and dative plural in nominal

paradigms are identical no matter what form is used for the particular paradigm

• So we have several layers of complexity here, which would seem to make an “item-and-arrangement” approach impossible

Page 47: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 49

Computational analysis

Page 48: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 50

Refactoring

But this is just an item-and-arrangement analysis

Page 49: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 51

Summary

• Theoretical distinctions between different approaches to morphology seem to the issue of how cleanly one can describe a given phenomenon.

• But it is not clear that they relate to important differences in underlying mechanisms.

Page 50: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 52

Why morphological theory?

• Morphology has tended to develop highly articulated theories that are (often) intended to represent the morphological component of some putative ‘language faculty’.

• Need a set of mechanisms to account for complex morphological systems – e.g. Sanskrit.

• Need to account for observed universals– These might related to built-in predispositions, but equally

well might relate to historical change; cf. Blevins (2004)

• Linguistic phenomena are complex: how can children learn them?– Clearly relates to learning mechanisms

Page 51: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 53

Whither morphological theory?• Assumptions underlying linguistic theory have not

changed much in the last 50 years– Arguments against statistical learning methods are

based on antiquated notions of what statistical methods are capable of

• Meanwhile there have been significant advances in machine learning over the past 10-20 years.

• Some of this has made it into computational linguistics in the form of grammar induction methods (cf. Klein and Manning, 2004; Smith 2006)

Page 52: Computational Morphology and its Implications for Theoretical Morphology Richard Sproat University of Illinois at Urbana-Champaign PASCAL MorphoChallenge

Computational Morphology/Theoretical Morphology 54

Morphological theory redux

• Computational arguments (above) suggest there may not be as much difference between morphological theories as people like to think

• Recent work on induction of morphology suggests that we need to revisit our assumptions.

• Issues of the future will likely be:– What historical mechanisms explain the observed

patterns across the world’s languages?– What general learning mechanisms can account for

children’s learning of morphology?