algorithms for nlpdemo.clab.cs.cmu.edu/algo4nlp20/slides/sp20 iitp lecture 10 -- hm… ·...
TRANSCRIPT
1
Yulia Tsvetkov
Algorithms for NLP
IITP, Spring 2020
HMMs, POS tagging, NER
2
▪ POS tagging recap▪ HMMs, Viterbi ▪ HMMs+
▪ dealing with UNKs▪ 3gram HMMs▪ multilingual POS tagging
▪ Featurizing HMMs▪ MEMM, CRF
▪ NER ▪ HMMs is speech recognition
Plan
https://universaldependencies.org
●
▪▪
▪▪ → →
▪▪▪
15
Levels of linguistic knowledge
Slide credit: Noah Smith
16
▪ map a sequence of words to a sequence of labels
▪ Part-of-speech tagging (Church, 1988; Brants, 2000) ▪ Named entity recognition (Bikel et al., 1999)▪ Text chunking and shallow parsing (Ramshaw and Marcus,
1995) ▪ Word alignment of parallel text (Vogel et al., 1996) ▪ Compression (Conroy and O’Leary, 2001) ▪ Acoustic models, discourse segmentation, etc.
Sequence Labeling
17
Sequence labeling as classification
the future is independent of the past given the present
the future is independent of the past given the present
o1 o2 on
▪
▪
▪
...
▪
▪▪
▪
▪▪
▪
▪▪▪
▪▪ →▪ →▪ →▪ →▪ →▪ →
▪▪ →
▪▪▪
▪▪▪▪
▪
▪
▪
▪
▪
▪▪
▪
▪▪
▪▪▪
▪▪
▪
▪
▪
▪ ⇒▪
▪
▪
▪
▪▪
▪
▪▪
▪▪
ssssssssppppeeeeeeetshshshshllllaeaeaebbbbb
“speech lab”