semantic assistants wiki (saw) in the context of the etc project

Post on 14-Dec-2015

220 Views

Category:

Documents

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Semantic Assistants Wiki (SAW)

In the Context of the ETC Project

SAW Intro

• Wiki– Users collaboratively develop and organize content– Information Analysis is up to the user

• Goal: – “Self-aware wiki that can develop and organize its

content”– Support users in information analysis

• Requires NLP to handle majority of content • Semantic Assistants Wiki: Integration of NLP with

Wikis

SAW in Action

Wiki Wiki-NLP Integration Semantic Assistants GATE NLP Pipeline:Names Entity Recognition

wiki.org/Mary “…Mary won…” “…Mary won…”

… Mary won the first prize...

… [[hasType::Person|Mary]] won… XMLXML

Example: Biomedical Literature Curation

• GenWiki: Filled with full text research papers• Entity Recognition: Enzymes, Organisms• Additional semantic information– Systematic name– Link to enzyme database entry

• Wiki is enriched with additional data e.g. using Semantic MediaWiki markup [[hasType::Enzyme]

• Time required to curate research papers reduced

Example: Wiktionary

• Automatically populate the wiki using computational linguistics

• Cross-link between different language entries• Manual work can be reduced

SAW for ETC

• Charaparser

Wiki Wiki-NLP Integration Semantic Assistants

etc-project.org/wiki/fna19 “…abaxial faces, without…”

GATE NLP Pipeline:Charaparser

“…abaxial faces, without…”

… abaxial faces, without

septate trichomes ...

SAW for ETC

• Charaparser <structure id="o0" name="face" constraint="abaxial“/><structure id="o1" name="trichome"> <character name="architecture" value="septate"/></structure><relation id="r0" name=“without" negation="false" from="o0" to="o1"/>

Faces* [[hasConstraint::Abaxial]] * [[without::Trichome]]

Trichome* [[hasArchitecture::septate]]

Wiki Wiki-NLP Integration Semantic Assistants

etc-project.org/wiki/fna19/superstructure/faces

Charaparser NLP Pipeline

etc-project.org/wiki/fna19/superstructure/trichome

XML

SAW for ETC

• Charaparser– GATE compliant NLP pipeline– Charaparser output to wiki markup translation module– Wiki as ‘User Interface’ of Charaparser

• Logic Reasoning, Information Theory, Ontology building can– Read and query charaparser results from wiki (RDF

triplets)– Be possibly integrated in wiki

Open Questions

• Charaparser as GATE compliant pipeline; Effort?• Alternatives to Semantic Assistant Wiki?

Apache Stanbol?• Representation of Structure, Character,

Relation in Wiki (e.g. duplicate structure names)

ReferencesBahar Sateli and René Witte. Natural Language Processing for MediaWiki: The Semantic Assistants Approach. WikiSym 2012.

Bahar Sateli, Marie-Jean Meurs, Greg Butler, Justin Powlowski, Adrian Tsang, René Witte. IntelliGenWiki: An Intelligent Semantic Wiki for Life Sciences. NETTAB 2012.

René Witte and Thomas Gitzinger. Connecting Wikis and Natural Language Processing Systems. WikiSym 2007.

http://www.semanticsoftware.info/first-open-source-release-semantic-assistants-wiki-nlp

Architecture

top related