contextualization / decontextualization (wp6 forgetit 1st year review)

15
Concise Preservation by combining Managed Forgetting and Contextualized Remembering

Upload: forgetit-project

Post on 08-Aug-2015

135 views

Category:

Technology


2 download

TRANSCRIPT

Concise Preservation by combining Managed Forgetting and Contextualized Remembering

Mark A. GreenwoodThe University of Sheffield

WP 6 PresentationContextualization / Decontextualization

ForgetIT 1st Review Meeting, April 29-30, 2014 Kaiserslautern, Germany

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

WP Objectives

• Determine the context associated with an information object

• Extract an object from its natural context and store it with the

required context in a Submission Information Package (SIP)

• Reintegrate a previously archived information object into active user

Focus of Year 1

• Review the current state-of-the-art

• Generate a formal, ForgetIT specific, model of contextualization

• Develop prototype contextualization components

Objectives of WP and Year 1 Focus

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

What is Context? Why Do We Need Context?

The new Quarry Hunslet arrived today.Quarry Hunslet

Elider was built in 1889 as works number 493 for the Dinorwic Quarry in Wales. It can now be seen at the Llanberis Lake Railway.

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Formal Model of Contextualization

• Context, C(t): information surrounding one or more information

pieces at time t and that influences its interpretation

• Interpretation, I(i,C(t),B(t,u)): interpretation of information piece i

done with context C and with background knowledge B of user u at

time t

• Contextualization: for an information piece i, it is the process of

providing additional context c+(i,t0) C(t0) where I(i,C(t0),B(t0,u0)) is

referred to as the intended interpretation

Contextualization Model

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Operative model for Contextualization

• Information pieces: items to be contextualized, e.g. text documents, images, document collections

• Contextualization hooks: elements within the information piece, e.g. entities, topics, temporal references, other metadata

• Contextualization source: source of the information (made of contextualization units) exploited for contextualization, e.g. knowledge bases, document collections

• Contextualization method: it provides a set of contextualization units that help in understanding and interpreting the information piece

Contextualization Model

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Example Contextualization Source

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Image Contextualization

Step 1: find similar collections Step 2: add contextual information

Archived Image Collections

Seed Image Collection

Distance CalculationSimilar

Step 1: find similar collections

Additionalimages

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Text Contextualization via Disambiguation

I might visit Paris while travelling through Texas to Oklahoma.

While information extraction would stop at this point, we collect the surrounding context from the ontology which we store along with the item being archived.• Currently we store all concepts and relations within 3

relations of the disambiguated concepts

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Text Re-Contextualization

KB annotationWikipedia

pages

Sentence index

Annotated sentences

Sentence Extraction

Named Entity

Recognition

Temporal Expressio

n Extraction

Article annotation

Query Formulatio

n

Sentence Ranking

Annotated document

Document

Queries

Sentences

Contextualized document

Contextualizing Sentences

Topic Extraction

Term Extraction

Sentence Extraction

Named Entity

Recognition

Temporal Expressio

n Extraction

Topic Extraction

Term Extraction

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Role in Preserve-or-Forget Architecture

ForgetIT Project GA600826, 1st Review Meeting, Kaiserslautern, April 2014

Andrea Ceroni, Nam Khanh Tran, Nattiya Kanhabua and Claudia Niederée,

Bridging Temporal Context Gaps using Time-Aware Re-Contextualization, (To

appear) Proceedings of the 37th Annual ACM SIGIR Conference (SIGIR'2014),

Goal Coast, Australia, July, 2014.

Publications

Thank you for your attention!