improving dependency parsing using sentence clause...

61
Vincent Kríž Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays, 10.10.2016 MFF UK Institute of Formal and Applied Linguistics Faculty of Mathematics and Physics Charles University in Prague Czech Republic [email protected]ff.cuni.cz http://ufal.mff.cuni.cz/vincent-kriz

Upload: others

Post on 26-Jan-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Vincent Kríž

Improving Dependency Parsing Using Sentence Clause Charts

Linguistic Mondays, 10.10.2016MFF UK

Institute of Formal and Applied LinguisticsFaculty of Mathematics and PhysicsCharles University in PragueCzech Republic

[email protected]://ufal.mff.cuni.cz/vincent-kriz

Page 2: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Motivation

● large collections of documents● efficient browsing & querying● typical approaches

– full-text search– meta-data search

no semantics

Page 3: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

INTLIB

● Intelligent Library (INTLIB)– founded by

– 2012–2015– partners

Page 4: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

INTLIB

● New search approach– semantic interpretation of documents– suitable DB & query language– user-friendly browsing & querying

Page 5: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

INTLIB

● New search approach– semantic interpretation of documents– suitable DB & query language– user-friendly browsing & querying

● Knowledge base– set of entities and relations between them

Page 6: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

INTLIB

● New search approach– semantic interpretation of documents– suitable DB & query language– user-friendly browsing & querying

● Knowledge base– set of entities and relations between them

● RExtractor– information extraction system

Page 7: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

RExtractor

● entity and relation extraction from plain-texts

● server architecture– process client's requests– REST API– web interface (~ demo)

http://quest.ms.mff.cuni.cz:14280

Page 8: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

RExtractor

● extract entities and relations– queries over dependency trees– domain and language independent

● real use-case defined by INTLIB– definitions, rights and obligations in Czech laws– Czech extraction strategy

Page 9: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

RExtractor

● extract entities and relations– queries over dependency trees– domain and language independent

● real use-case defined by INTLIB– definitions, rights and obligations in Czech laws– Czech extraction strategy– English extraction strategy

Page 10: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Evaluation

Czech Legal Text Treebank 1.0 (CLTT)– Accounting Act (563/1991 Coll.)– Decree on Double-entry Accounting for

undertakers (500/2002 Coll.)– automatically parsed, then manually checked

● 1,133 manually annotated dependency trees● 35,085 tokens

Czech LegalText Treebank

Page 11: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Evaluation

Czech Legal Text Treebank 1.0 (CLTT)– Kríž Vincent, Hladká Barbora, Urešová Zdeňka: Czech Legal

Text Treebank 1.0. In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), Copyright © European Language Resources Association, Paris, France, ISBN 978-2-9517408-9-1, pp. 2387-2392, 2016

Page 12: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Evaluation

Error analysis

Error # of errors Ratio

Parser 145 59.7%

Query 93 38.3%

Entity 5 2.1%

Error # of errors Ratio

Parser 145 59.7%

Query 93 38.3%

Entity 5 2.1% Parser60%

Query38%

Entity2%

Page 13: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Baseline

Automatic parser for Czech– MST parser

● Ryan McDonald, Fernando Pereira, Kiril Ribarov, Jan Hajič (2005): Non-projective Dependency Parsing using Spanning Tree Algorithms. In: Proceedings of HLT/EMNLP, Vancouver, British Columbia.

– trained on newspaper texts– long sentences still problematic

Page 14: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Baseline

Long sentences dependency parsing– as the sentence length increases,

the unlabeled attachment score (UAS) decreases

1-10 11-20 21-30 31-40 41-50 51+70.00%

75.00%

80.00%

85.00%

90.00%

95.00%

PDT dtest

PDT etest

CAC

Sentence length

UAS

Page 15: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● segmentation of complex sentences– Kuboň (2001), Kuboň et al. (2007)– segments – easily detectable and linguistically

motivated units– may be combined into clauses– provide a structure of a complex sentence with

regard to the mutual relationship of individual clauses

Page 16: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● segmentation of complex sentences– Lopatková and Holan (2009)– a new module between morphological and

syntactic analysis– determine the overall sentence structure– segmentation chart

● relationship among segments● especially relations of coordination, apposition and

subordination

Page 17: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

S tím byly trochu problémy, protože starosta v řeči rád zdůrazňoval své vzdělání.

Credits: Lopatková and Holan (2009)

Page 18: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

S tím byly trochu problémy, protože starosta v řeči rád zdůrazňoval své vzdělání.

● split sentence into segments– rule-based boundaries identification

● punctuation marks, coordinating conjunctions, brackets, …

Credits: Lopatková and Holan (2009)

Page 19: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● determine mutual relations– manually designed rules

● finite verb● subordinating expression● opening bracket

Credits: Lopatková and Holan (2009)

S tím byly trochu problémy , protože starosta … vzdělání .

Page 20: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

Credits: Lopatková and Holan (2009)

S tím byly trochu problémy

protože starosta … vzdělání

Page 21: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● segmentation chart– captures the layer of embedding for individual segments

Credits: Lopatková and Holan (2009)

S tím byly trochu problémy

protože starosta … vzdělání

Page 22: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● segmentation chart principles– main segments belong to layer 0

– segments that depend on segment on layer k belong to k+1

– coordinated segments have the same layer

– segments in parenthesis/brackets belong to k+1 layer

Credits: Lopatková and Holan (2009)

S tím byly trochu problémy

protože starosta … vzdělání

0 1

Page 23: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● sentence clause structure– Lopatková et al. (2012)– manual clause structure annotation based on the

concept of segments– 2,699 annotated sentences

Page 24: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● sentence clause structure– Krůza and Kuboň (2014)– automatic procedure for recognizing clauses and

their mutual relationship from plain-texts

– Bejček et al. (2013)– automatic procedure for recognizing clauses and

their mutual relationship from dependency trees

– used for clause annotation in PDT 3.0

Page 25: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Related Work

● clause annotation in PDT 3.0

Page 26: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● analogous to a segmentation chart– Lopatková and Holan (2009)– two differences

● subordinating conjunctions at the beginning of each clause are considered as boundaries

● clauses split into two parts (by an embedded clause) are considered as two different clauses

Page 27: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● example

While failure is usually an orphan, the success tends to have many fathers, claiming eagerly that particularly they were present at its conception.

failure is usually an orphan

the success tends to have many fathers

claiming eagerly

particularly they were present at its conception

while , ,

that

B 1 B 0 B 1 B 2

1

0

1

2

2

3

4

Credits: Kuboň et al. (2007)

Page 28: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● generating clause charts– from dependency trees with the clause

annotation– a layer of embedding number of different →

clauses on the path from the clause to the root in the dependency tree

Page 29: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo

rozdělit molekuly dekaboranu na části,

nepodařilo se zmíněným vědcům

zatím určit, jaké produkty přitom

vznikly.

Page 30: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

0

1

2

Page 31: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2

Page 32: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2 2

Page 33: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2 2

3

Page 34: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2 2

3

4

Page 35: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2 2

3

4

Page 36: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

Ačkoliv jsou si jisti, že se jim podařilo rozdělit molekuly

dekaboranu na části, nepodařilo se zmíněným vědcům zatím určit, jaké produkty přitom vznikly.

1

0

1

2 2

3

4

B 1 B 2 B 0 B 1 B

Page 37: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● exploring clause charts

Page 38: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● exploring clause charts

0 0B1 0B0 0B1B0 0B1B270

75

80

85

90

95

100

0

10

20

30

40

50

60

PDT train PDT dtest PDT etest CAC 2.0 Rel. freq.

UA

S

Page 39: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Charts

● exploring clause charts– sentence with 36 clauses– sentence with 7 layers of embedding

● 0B1B2B3B4B5B6

Page 40: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● new method for dependency parsing– exploit an existing dependency parser

● trained on complete sentences

– exploit gold-standard clause charts– Kríž Vincent, Hladká Barbora: Improving Dependency Parsing Using

Sentence Clause Charts. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics – Student Research Workshop

● two specific strategies– parsing coordinated clauses– parsing subordinated clauses

Page 41: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses– let's explore the most

simple sentences with coordinated clauses – 0B0

– how good is the full-scale parser on individual clauses from 0B0?

Full-scale Clauses76

78

80

82

84

86

88

PDT dtest PDT etest CAC 2.0

UA

S

Page 42: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses– let's explore the most

simple sentences with coordinated clauses – 0B0

– how good is the full-scale parser on individual clauses from 0B0?

Full-scale Clauses76

78

80

82

84

86

88

PDT dtest PDT etest CAC 2.0

UA

S

+4% of UAS in average

Page 43: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses

– C1, C2, …, Cn ● neighboring coordinated clauses● on the same layer

– parse Ci individuallyobtain dependency tree Ti with root node ri

– create a sequence of tokensS = r1 B1,2 r2 B2,3 … rn

– parse S, obtain Ts

– build a final dependency tree using Ti and Ts

Page 44: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses

– John loves Mary and Linda hates Peter.

– C1 = {John loves Mary}, C2 = {Linda hates Peter}– parse individual clauses

● C1 → T1, r1 = loves● C2 → T2, r2 = hates

– create a sequence of tokensS = {loves and hates}

– parse S → Ts

– build a final dependency tree

Page 45: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses

Full-scale Clauses CCP76

78

80

82

84

86

88

PDT dtest PDT etest CAC 2.0

Page 46: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing coordinated clauses

Full-scale Clauses CCP76

78

80

82

84

86

88

PDT dtest PDT etest CAC 2.0

+1.4% of UAS in average

Page 47: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing subordinated clauses– exploring 0B1 sentences– almost no improvement

when parse individual clauses

– UAS is significantly higher then overall UAS Full-scale Clauses

84

84.5

85

85.5

86

86.5

87

87.5

88

88.5

89

PDT dtest PDT etest CAC 2.0U

AS

Page 48: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing subordinated clauses– exploring 0B1 sentences– almost no improvement

when parse individual clauses

– UAS is significantly higher then overall UAS Full-scale Clauses

84

84.5

85

85.5

86

86.5

87

87.5

88

88.5

89

PDT dtest PDT etest CAC 2.0U

AS

0 0B1 0B0 0B1B0 0B1B270

75

80

85

90

95

100

0

10

20

30

40

50

60

PDT train PDT dtest PDT etest CAC 2.0 Rel. freq.

UA

S

Page 49: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing subordinated clauses

– C1, C2, …, Cn ● the longest sequence of neighboring subordinated

clauses● layer(Ci+1) = layer(Ci) + 1

– create a sequence of tokensS = C1 B1,2 C2 B2,3 … Cn

– parse S, obtain Ts

Page 50: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing subordinated clauses

– evaluation on 0B1B0 sentences

● parse 0B1● parse 0B0

Full-scale CCP77

78

79

80

81

82

83

84

85

PDT dtest PDT etest CAC 2.0

Page 51: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● parsing subordinated clauses

– evaluation on 0B1B0 sentences

● parse 0B1● parse 0B0

Full-scale CCP77

78

79

80

81

82

83

84

85

PDT dtest PDT etest CAC 2.0

+1.6% of UAS in average

Page 52: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● CCP as full-scale parsing– work in cycles– check the deepest layer

● if there are coordinated clauses apply 0B0 strategy→● otherwise identify the longest sequence of

subordinated clauses apply 0B1 strategy→

– use standard full-scale parsing as a fall-back

Page 53: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● final evaluation (excl. 0 sentences)

Full-scale CCP80.5

81

81.5

82

82.5

83

83.5

84

84.5

85

PDT dtest PDT etest CAC 2.0

Page 54: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● final evaluation (excl. 0 sentences)

Full-scale CCP80.5

81

81.5

82

82.5

83

83.5

84

84.5

85

PDT dtest PDT etest CAC 2.0

+1.0% of UAS in average

Page 55: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● final evaluation

Full-scale CCP81.5

82

82.5

83

83.5

84

84.5

85

85.5

PDT dtest PDT etest CAC 2.0

Page 56: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Clause Chart Parsing

● final evaluation

Full-scale CCP81.5

82

82.5

83

83.5

84

84.5

85

85.5

PDT dtest PDT etest CAC 2.0

+0.7% of UAS in average

Page 57: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Future Work

● evaluation– Czech Legal Text Treebank 1.0– relation extraction in RExtractor

● clause charts– extraction from plain-text

● special parsers– train on individual clauses

Page 58: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Conclusion

● sentence clause structure helps with dependency parsing

● 1% increase of UAS on complex sentences

Page 59: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Conclusion

● sentence clause structure helps with dependency parsing

● 1% increase of UAS on complex sentences

in the real parsing task, automatically detected clause structures must be used, not gold-standard

we can train specialized clause-parsers – for main clauses, subordinated clauses, merge clauses, …

we can find out better strategies for parsing sequences of subordinated clauses

Page 60: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Conclusion

● sentence clause structure helps with dependency parsing

● 1% increase of UAS on complex sentences

in the real parsing task, automatically detected clause structures must be used, not gold-standard

we can train specialized clause-parsers – for main clauses, subordinated clauses, merge clauses, …

we can find out better strategies for parsing sequences of subordinated clauses

Page 61: Improving Dependency Parsing Using Sentence Clause Chartsufal.mff.cuni.cz/~kriz/talks/presentation_2016-10-10.pdf · Kríž: Improving Dependency Parsing Using Sentence Clause Charts

Kríž: Improving Dependency Parsing Using Sentence Clause Charts Linguistic Mondays

Conclusion

● sentence clause structure helps with dependency parsing

● 1% increase of UAS on complex sentences

in the real parsing task, automatically detected clause structures must be used, not gold-standard

we can train specialized clause-parsers – for main clauses, subordinated clauses, merge clauses, …

we can find out better strategies for parsing sequences of subordinated clauses