lightweight multilingual entity extraction and linking slide

Speaker: Shih-Han LoAdvisor: Professor Jia-Ling Koh

Author: Aasish Pappu, Roi Blanco, Yashar Mehdad, Amanda Stent, Kapil Thadani

Date: 2017/09/19Source: WSDM ’17

1

Lightweight Multilingual Entity

Extraction and Linking

Outline

2

Introduction

Method

Experiment

Conclusion

Introduction

3

Key tasks for text analytic systems:

Named Entity Recognition (NER)

Named Entity Linking (NEL)

Some systems perform NER and NEL jointly.

Introduction

4

Most approaches involve (some of) the following steps:

Mention detection

Mention normalization

Candidate entity retrieval for each mention

Entity disambiguation for mentions with multiple candidate entities

Mention clustering for mentions that do not link to any entity

Motivation

Outline

5

Introduction

Method

Experiment

Conclusion

Mention Detection

6

Typically consists of running an NER system over input text.

We use simple CRFs and only a few lexical, syntactic and semantic features.

https://en.wikipedia.org/wiki/Conditional_random_field

System Description

7

Candidate Entity Retrieval

8

Entity Embeddings

We aim to simultaneously learn D-dimensional representations of Ent and W in a common vector space.

Training our embedding model: continuous skip-grams with 300 dimensions and a window size of 10.

https://kknews.cc/zh-tw/news/3j6yj2g.html


9

Entity Embeddings


10

Fast Entity Linking

Fast Entity Linker (FEL) is an unsupervisedapproach.

FEL imposes contextual dependencies by calculating the cosine distance between two entities. Candidate From the substrings of the input string

Minimal perfect hash function

Elias-Fano integer coding

http://cmph.sourceforge.net/concepts.html

https://en.wikipedia.org/wiki/Shannon%E2%80%93Fano%E2%80%93Elias_coding

Entity Disambiguation

11

Task of figuring out to which candidate entity a mention refers.

The task is complex because mentions may refer to different entities, depend on local context.


12

Forward-Backward Algorithm (FwBw)


13

Exemplar (Clustering)


14

Label Propagation (LabelProp)

Modified adsorption (MAD)

For , we inject seed labels L on a few nodes.

For nodes V’, we assign a label distribution:

Along with , MAD takes three hyper-parameters as input.

We pick the highest ranked label for each node in V as the final candidate.

Outline

15

Introduction

Method

Experiment

Conclusion

Experiment

16

Datasets:

Cross-lingual TAC KBP 2013

Mono-lingual AIDA-CONLL 2003

Experiment

17

Setup

N-best: N = 10

FwBw: λ = 0.5

Exemplar: max_iterations = 300, λ = 0.5

LabelProp: μ1 = 1, μ2 = 1e − 2, μ3 = 1e − 2

Experiment

18

TAC KBP Evaluation Results

Experiment

19

Analysis

Experiment

20

Analysis

Experiment

21

AIDA Evaluation

Experiment

22

Runtime Performance

Outline

23

Introduction

Method

Experiment

Conclusion

Conclusion

24

Our NER implementation is outperformed only by NER systems that use much more complex feature engineering and/or modeling methods.

In future work, we plan to improve the performance of our system for other languages, by expanding the pool of entities for which we have information.

Candidate retrieval in Spanish is relatively poor compared to English and Chinese.

lightweight multilingual entity extraction and linking slide

Documents