stellar introduction douglas tudhope hypermedia research unit, university of glamorgan

19
STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Upload: jeremy-boyle

Post on 14-Mar-2016

33 views

Category:

Documents


5 download

DESCRIPTION

STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan. STELLAR. 12 month AHRC funded project Hypermedia Research Unit, University of Glamorgan Archaeology Data Service, University of York English Heritage Centre for Archaeology, Portsmouth - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR IntroductionDouglas Tudhope

Hypermedia Research Unit, University of Glamorgan

Page 2: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR

12 month AHRC funded project Hypermedia Research Unit, University of Glamorgan Archaeology Data Service, University of York– English Heritage Centre for Archaeology, Portsmouth

Builds on previous 3 year AHRC funded STAR Project

Acknowledgments Ceri Binding (University of Glamorgan)Andreas Vlachidis (University of Glamorgan)Keith May, English Heritage (EH)Stuart Jeffrey, Julian Richards, Michael Charno, Tim Evans, Holly WrightArchaeology Data Service (ADS)Archaeology Department, University of York

Page 3: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STAR – Aims and background

• Investigate semantic technologies for integrating and cross searching datasets and associated grey literature

• Current situation - fragmented datasets with different terminology

• Lack of semantic interoperability and cross search

• Need for integrative metadata framework CIDOC CRM (ISO standard) as high level, core ontologytogether with the CRM-EH archaeological extension of the CRM

along with relevant EH thesauri and glossaries

Page 4: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STAR Project - General Architecture

RRAD RPRE

RDF Based Semantic Layer (CRM / CRMEH / SKOS)

Greyliterature

EH thesauri,

glossaries

LEAPSTAN MoLAS

Data Mapping / NormalisationConversionIndexing

Web Services, SQL, SPARQL

Applications – Server Side, Rich Client, Browser

Page 5: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Natural Language Processing (NLP)of archaeological grey literature

Extract key concepts in same semantic representation as for data.

Allows unified searching of different datasets and grey literature

in terms of same underlying CRM-based conceptual structure

Output as RDF triples in Demonstrator and as XML with greylit

“ditch containing prehistoric pottery dating to the Late Bronze Age”

Page 6: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STAR Demonstrator – search for a conceptual pattern

An Internet Archaeology publication on one of the (Silchester Roman) datasets we used in STAR discusses the finding of a coin

within a hearth.-- does the same thing occur in any of the grey literature reports?

Requires comparison of extracted data with NLP indexing in terms of the ontology.

Page 7: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STAR Demonstrator – search for a conceptual patternResearch paper reports finding a coin in hearth – exist elsewhere?

Page 8: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Stratigraphic query

Page 9: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR aims and outcomes

• Make it easier to map and extract datasets to CIDOC CRM ontologyin a consistent manner

• Generalise the data extraction tools produced by STAR so third party data providers can use them

• Develop methods for mapping and extraction of archaeological datasets into RDF/XML conforming to CIDOC CRM-EH ontologywith unique global identifiers for entities and concepts (http URIs)for publication as linked data

• Freely available tools and guidelines/tutorials

Page 10: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR background

• In practice mapping to CRM has tended to require specialist knowledge of the ontology and been resource intensive

• Given the wide scope of the CRM, it is possible to make multiple valid mappingsdepending on the intended purpose and focus of the mappings

• STELLAR tools convert archaeological data to CRM/RDF in a consistent manner, without requiring detailed knowledge of the underlying ontology

• User chooses a template for a particular data patternand supplies the corresponding input from their database(combination of optional elements with a mandatory ID)

• STELLAR templates for – CRM-EH archaeological extension to the CIDOC CRM– Some more general CIDOC CRM templates conforming to the CLAROS Project format– SKOSifying a glossary/thesaurus connected with the dataset

Page 11: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR templates Finds example

Page 12: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR applications

STELLAR.Console

STELLAR.Web

http://hypermedia.research.glam.ac.uk/resources/STELLAR-applications/

Page 13: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR.Console

Command line application

Freely downloadable

Data import / export / conversion

Page 14: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

STELLAR.Web Subset of STELLAR.Console

functionality CSV2RDF RDFStats

Using same templates Input CSV from any

external application No registration or login

Page 15: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

CSV2DB

SQL2CSV

CSV2RDFSQL2RDF

TAB2DB

SQL2TAB

TAB2RDF

TAB

Database

CSV

RDF

Data Conversions

Data Template

Page 16: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Consistent URI Construction Namespace prefix (user defined)

http://stellar/silchester/ Entity type (generated by template)

EHE0007 (i.e. Context) Value (From data, URL encoded if necessary)

1015 URI pattern: {prefix}{entity type}_{value}

http://stellar/silchester/EHE0007_1015

Page 17: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Using the RDF data

RDF application / triple store

SPARQL queries RDF enabled applicationsLinked data

browsers

RDF data output from STELLAR

Page 18: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Linked data publication by ADS

• Selected range of archived archaeological excavation datasets (academic and commercial sectors) converted to RDF using STELLAR toolsand ingested into a repository (triple store)

• The SPARQL endpoint allows consumption by semantic technologies including Pubby (an open source linked data front end) used for publishing linked data

• Content negotiation presents data in formats appropriate for the requesting application (eg RDF/XML/HTML browsers).

• Effort devoted to ensure URI construction appropriate for the domain. For ADS archives this includes use of existing DOI identifier codes in the target URI.For external data sets (not already archived with the ADS, eg from commercial contractors) site naming conventions validated by the ADS adopted.

• The linked data outputs (and the frontend) are available from ADS websitehttp://data.archaeologydataservice.ac.uk

Page 19: STELLAR Introduction Douglas Tudhope Hypermedia Research Unit, University of Glamorgan

Contact Information

Douglas TudhopeFaculty of Advanced TechnologyUniversity of GlamorganPontypridd CF37 1DLWales, [email protected]

http://hypermedia.research.glam.ac.uk/kos/STAR/http://hypermedia.research.glam.ac.uk/resources/star-demonstrator/STAR Research Demonstratorhttp://intarch.ac.uk/journal/issue30/tudhope_index.htmlSTAR Internet Archaeology paper (open access)http://andronikos.kyklos.co.uk/aboutus.phpNLP work - see reports with CRM and CRM-EH composite annotations in Sample Documents

http://hypermedia.research.glam.ac.uk/kos/STELLAR/ http://hypermedia.research.glam.ac.uk/resources/STELLAR-applications/ STELLAR tools, templates and documentationhttp://data.archaeologydataservice.ac.uk STELLAR linked data