natural language processing >> syntax · syntax - from transformation to unification 2...

32
Natural Language Processing >> Syntax << >> from transformation to unification << winter / fall 2011/2012 41.4268 Prof. Dr. Bettina Harriehausen-Mühlbauer Univ. of Applied Sciences, Darmstadt, Germany https://www.fbi.h-da.de/organisation/personen/harriehausen- muehlbauer-bettina.html [email protected]

Upload: dinhduong

Post on 25-Jul-2018

233 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Natural Language Processing>> Syntax <<

>> from transformation to unification <<

winter / fall 2011/201241.4268

Prof. Dr. Bettina Harriehausen-MühlbauerUniv. of Applied Sciences, Darmstadt, Germanyhttps://www.fbi.h-da.de/organisation/personen/harriehausen-muehlbauer-bettina.html

[email protected]

Page 2: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 2

content

1 What is syntax ?

2 Grammar theories and formalisms• Dependency Grammar• Transformational Grammar• Phrase Structure Grammar• Case Grammar• Unification Based Grammar

Page 3: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 3

What is Syntax?

Syntax deals with:

• the analysis of NLP input on sentence level

• the generation of NLP output on sentence level

• structural descriptions on sentence level, mostly

• in form of PS-(phrase structure) trees and/or

• unification-based formalisms

• structural rules on sentence level (this can vaguely becompared to how „grammar“ of a language is traditionally taught)

Syntax - introduction

Page 4: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 4

Acronyms used in structural descriptions of naturallanguage („vocabulary“) = the auxiliary dictionary for thenode descriptions:

S =sentence/clauseN =(a single) nounNP =noun phraseV =verbVP =verb phraseAUX =auxiliary verbAJ/ADJ=adjectiveADJP =adjective phraseADV =adverb

ADVP =adverb phraseDET =determiner

CONJ =conjunctionCOMP =complementizerPRO =pro-constituentPUNC =punctuation

Syntax - introduction

Page 5: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 5

Examples for the node names:

S =sentence/clause „Does the dog chase the cat?“N =(a single) noun „dog“NP =noun phrase „the old dog“V =verb „chase“VP =verb phrase „chase the cat“AUX =auxiliary verb „does“AJ/ADJ=adjective „old“ADJP =adjective phrase „old and gray“ADV =adverb „happily“ADVP =adverb phrase „once upon a time“DET =determiner „the“

Syntax - introduction

Page 6: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 6

CONJ = conjunction „and“COMP = complementizer „what“PRO = pro-constituent „he“PUNC = punctuation „?“

Syntax - introductionExamples for the node names:

Page 7: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 7

Genealogy of grammar theories and formalisms (and their influence on AI)

1. Dependency grammar (Tesnière 1953, 1959):The finite verb is the focal point of the sentential analysis, i.e. thevalency of the verb determines the structure of the sentence.

e.g. transitive verbs: valency of 2: Bob loves Mary.

intransitive verbs: valency of 1. The cat sleeps.

1. Dependency Grammar2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 8: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 8

Genealogy of grammar theories and formalisms (and their influence on AI)

2. Generative transformational grammar (TG) -> X-bar -> GB(Chomsky 1957, 1959)

The attempt to construct a formal model of the language competenceof an ideal speaker-hearer:

„I understand a generative grammar to simply be a rule system, whichassigns structural descriptions to sentences in an explicit and well defined manner“ (Chomsky 1965)

= formalization of natural language

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 9: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 9

Genealogy of grammar theories and formalisms (and their influence on AI)

deep structure <-> surface structure

transformation rules

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 10: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 10

surface structureThe dog barks at the cat.

The cat is barked at by the dog.

active

passive

dog = actively barking animal

cat= passively barked at animal

bark at = action

deep structure

Transformations

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 11: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 11

Deletion: A + B => B

C C

=>

A B B

Example: He didn‘t know that he should read the book.

He didn‘t know --- he should read the book.

Examples for transformations

Page 12: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 12

Permutation: X + Y + Z => X + Z + Y

A A

=>

X Y Z X Z Y

Examples for transformations

Page 13: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 13

Sample of a permutation rule:NP + VP + ADV => ADV + NP + VP

S => S

NP VP Adv Adv NP VP

Det N V NP Det N V NP

Det N Det N

a girl cut a flower yesterday. Yesterday a girl cut a flower.

Examples for transformations

Page 14: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 14

Substitution / Replacement: X + B => Y +B + X

A A

=>

X B Y B X

Example: (in combination with permutation)

That it is raining is too bad.

It is too bad that it is raining.

Examples for transformations

Page 15: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 15

Adjunction: B => A + B

C

B

initial structurebefore adjunction

C

A B

sister-adjoin Ato the left of B

Examples for transformations

Page 16: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 16

Adjunction: Example: I don‘t know what he will do.

S

NP AUX VP

PRO don‘t V S‘

I know COMP S

+WH NP AUX VP

e PRO will V NP

he do PRO

what

Tree Structure of sentence

Examples for transformations

Page 17: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 17

Adjunction: Example: I don‘t know what he will do.

S

NP AUX VP

PRO don‘t V S‘

I know COMP S

NP +WH NP AUX VP

what e PRO will V NP

he do what

Sister adjunction of wh-phrase to COMP

Examples for transformations

Page 18: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 18

Binary rules

Top-down:

S -> VP + PUNCVP -> NP + VPNP -> DET + NPNP-> ADJ + NPVP -> V + NP...

Bottom-up:

DET + NP -> NPADJ + NP -> NPV + NP -> VPNP + VP -> VPVP + PUNC -> S...

Parser

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 19: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 19

Binary rules

Top-down:

S -> VP + PUNCVP -> NP + VPNP -> DET + NPNP-> NVP -> VP + NPVP-> V...

S ->The dog chases the cat (VP) + . (PUNC)VP-> the dog (NP) + chases the cat (VP)NP-> the(DET) + dog (NP)NP-> dog (N)VP-> chases (VP) + the cat (NP)VP-> chases (V)NP-> the (DET) + cat (NP)NP-> cat (N)

The dog chases the cat.

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 20: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 20

Binary rules

Bottom-up:

DET + NP -> NPN-> NPVP + NP -> VPV->VPNP + VP -> VPVP + PUNC -> S...

The dog chases the cat.

dog (N) -> NPthe (DET) + dog (NP) -> NPchases (V)-> VPcat (N)-> NPthe (DET) + cat (NP) -> NPchases (VP) + the cat (NP) -> VPthe dog (NP) + chases the cat (VP) -> VPthe dog chases the cat (VP) + .(PUNC) -> S

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 21: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 21

constituent structure / PS-tree:

A girl handed a baby a toy. S -> NP + VP

S(=VP) NP->Det + N

VP-> V + NP + NP

NP VP better: VP-> V + NP (recursive!)

Det N VP NP

VP NP Det N

V Det N

a girl handed a baby a toy

Page 22: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 22

Why binary rules ?Claim: „ the set of well-formed sentences in any language is infinite“

(Chomsky)Claim: „ there is no (theoretical) upper limit on the length of

sentences in any language“ (though there are of courseperformance limitations)

Example 1: We can have indefinitely many attributive adjectivesqualifying a noun in English:

John is a handsome man.John is a dark, handsome man.John is a tall, dark, handsome man.John is a sensitive, tall, dark, handsome man.John is an intelligent, sensitive, tall, dark, handsome man.etc.

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 23: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 23

Why binary rules ?

Example 2: There is in principle no upper limit to the number of quantifying expressions we can use to modify an adjective in English:

Debbie Harry is very attractive.Debbie Harry is very, very attractive. Debbie Harry is very, very, very attractive.Debbie Harry is very , very, very, very attractive.Debbie Harry is very, very , very, very, very attractive.Debbie Harry is very, very, very , very, very, very attractive. etc.

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 24: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 24

Why binary rules ?

Example 3: There isn‘t any limit on the number of times that we canuse one clause as the complement of another in English:

John said that Mary was ill.Fred said that John said that Mary was ill.Harry said that Fred said that John said that Mary was ill. etc.

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 25: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 25

Why binary rules ?

Example 4: There is no limit on the number of phrases that we canconjoin together by and (or or) in English:

I met Debbie and Harry.I met Debbie, Noam, and Harry.I met Debbie, Noam, the dustman, and Harry.I met Debbie, Noam, the dustman, the president, and Harry. etc.

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 26: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 26

Why binary rules ?

Example 5: There is no upper limit on the number of relative clausesa sentence can contain in English:

I chased the dog.I chased the dog that chased the cat.I chased the dog that chased the cat that chased the rat.I chased the dog that chased the cat that chased the rat that chasedthe mouse. etc.

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 27: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 27

Why binary rules ?

In order to reach the goal of writing a set of finite rules to describean infinite set of well-formed sentences, we need to think „binary“ in order to be able to program rules efficiently – otherwise we wouldhave an infinite set of rules (...and our task of writing a grammarwould be an endless story...)

That is why „binary rules“ !

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 28: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 28

References

Jensen, Karen. 1987. Binary rules and non-binary trees: Breaking down theconcept of phrase structure. In Mathematics of language, ed.Alexis Manaster-Ramer, pages 65-86. Amsterdam, John Benjamins Publishing Company

Jensen, Karen. Issues in Parsing.

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 29: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 29

Genealogy of grammar theories and formalisms (and their influence on AI)

3. Phrase structure grammar (PS-grammar)

= a finite set of PS-rules

A PS-grammar consists of:

• an auxiliary dictionary (the node names)

• the final vocabulary (the lexicon)

• a start node (= S)

• a set of PS-rules

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Grammar theories and formalisms

Page 30: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 30

Genealogy of grammar theories and formalisms (and their influence on AI)

3. Phrase structure grammar

contextcontext sensitivesensitive contextcontext freefree

X -> Y/w_z X -> Y

V_trans/_obj S -> NP + VP

VP NP -> Det + NP

V_intrans/_# VP -> V_trans + NP

VP -> V_intrans

Grammar theories and formalisms

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar

4. Case Grammar

5. Unification Based Grammar

Page 31: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 31

(a) PS-Rules

S -> NP + VP

NP -> PRO RULESVP -> V + (S)

(b) Lexical Insertion Rules

PRO -> I, she INTERFACE :V -> wonder, snores RULES-LEXICONCOMP -> whether

(c) I -> (PRO, +1sg, +human, +animate,...)

she -> (PRO, +3sg, +human, +animate,...) LEXICONwonder -> (VERB, +1sg, -3sg, +wh-clause, +cognitive,...) snores -> (VERB, +3sg, +intrans, +active, +animate,...)

................................................................................................

Lexicon: parts of speech, subcategorisational features (morpho-syntactic), selectional features (semantic)

1. Dependency Grammar

2. Transformational Grammar

3. Phrase Structure Grammar4. Case Grammar

5. Unification Based Grammar

Page 32: Natural Language Processing >> Syntax · Syntax - from transformation to unification 2 content 1 What is syntax ? 2 Grammar theories and formalisms • Dependency Grammar • Transformational

Syntax - from transformation to unification 32