openup! creating a cross- domain pipeline walter g. berendsohn & anton güntsch botanic garden...

19
OpenUp! Creating a cross-domain pipeline Walter G. Berendsohn & Anton Güntsch Botanic Garden & Botanical Museum Berlin- Dahlem, Germany open-up.eu

Upload: rosanna-cooper

Post on 18-Dec-2015

216 views

Category:

Documents


1 download

TRANSCRIPT

OpenUp! Creating a cross-domain pipeline

Walter G. Berendsohn & Anton GüntschBotanic Garden & Botanical Museum Berlin-Dahlem, Germany

open-up.eu

Connecting the cultural and the natural history domainsis the central idea behind OpenUp!

(“Opening up Europe’s natural history heritage to EUROPEANA”).

Tapestry called „Krokus“, B. Rendahl, 1976. © Upplandsmuseet, Uppsala, Sweden. www.europeana.eu

Herbarium specimen Crocus vernus L., BGBM Collection, Berlin.

open-up.eu

open-up.eu

www.europeana.eu/

Europeana

A cross-domain portal to Europe’s cultural and scientific heritage.

To-date 15 million digital images, text and sound files, and videos with a focus on cultural history.

1500 institutions, currently 29 funded projects Natural history:

BHL-Europe, Natural Europe, OpenUp!

Bronze Cat Coffin. © The Oriental Museum, University of Durham, Durham, UK. www.europeana.eu

OpenUp! Project Details

open-up.eu

3-years (03.2011 - 02.2014), 4.2 Million Euro 80% co-funded by the European Commission

An initiative of CETAF (Consortium of European Taxonomic Facilities) and several European GBIF Nodes.

Task 1: Bringing Content to Europeana (and GBIF)

open-up.eu

• OpenUp! is making multimedia content in the BioCASE network accessible to EUROPEANA.

• Committed to serve at least 1.1 million objects by the end of the project (Feb. 2014).

• Most of these records are presently not accessible to GBIF• All content will automatically be served to GBIF when

mobilised by OpenUp.

A drawer with tropical butterflies collected by Alfred Russel Wallace (Natural History Museum, London)

Task 2: Linking BioCASE / GBIF with Europeana

open-up.eu

GBIF Index-based Portals

Provider Database

GBIF IndexPy-Wrapper ABCD

Py-Wrapper DwC-A

Task 2: Linking BioCASE / GBIF with Europeana

open-up.eu

ABCD data records with linked multimedia content are transferred to Europeana standards and offered to Europeana (and others) for harvesting via OAI-PMH

Task 3: Enhancing Data Quality

open-up.eu

• 275 Person-months reserved for local data cleaning

• Data Quality Toolkit uses webservices, reports problems

• Starting with names and integrity rules

Task 3: Enhancing Data QualityExample: Data Quality Toolkit - Integrity Rules

1 Atomized Genus element1 Atomized Genus elements should start with a single uppercase character followed ny a non-empty sequence of lower-case charactersABCD elements:/DataSets/DataSet/Units/Unit/Identifications/Identification/Result/TaxonIdentified/ScientificName/NameAtomised/Zoological/GenusOrMonomial/DataSets/DataSet/Units/Unit/Identifications/Identification/Result/TaxonIdentified/ScientificName/NameAtomised/Botanical/GenusOrMonomialRegular expression:[A-Z][a-z]+2 Collection date fields3 Site coordinate latitude4 Site coordinate longitude5 Syntax of email elements6 ISO country element7 Scientific name (zoology)8 Scientific name (botany)9 Mime type for multimedia objects10 Check whether multimedia object file is available

Task 4: Extending the Network

open-up.eu

• OpenUp! is actively promoting participation beyond the initial consortium

• Concerted Helpdesk activity• Outreach and mobilisation activities supported by

participating GBIF Nodes• Dissemination activities

• New providers will be new GBIF/BioCASE providers (also for non-multimedia content)

The type specimen of a goliath beetle Goliathus atlas Nickerl, 1887 deposited in the collections of the National Museum, Prague, and its original type labels

Task 5: Enhancing the Metadata

open-up.eu

• ABCD specimen data records contain many elements that are useful for semantic linking

• Multicultural context • Enhance scientific names by multilingual common

names – see next talk• Enhance names with synonyms (using same services as

data quality toolkit)• ? Multilingual terms for specific areas

E.g. transliteration of collector names?

Problem #1: Data Access Rights for Metadata

open-up.eu

BioCASE & GBIF networks: GBIF Data Use Agreement

OpenUp!: CC-by

Attribution!

EUROPEANA now requires CC-0

For Europeana, OpenUp! Provides

open-up.eu

• A single access point to distributed non-bibliographic multimedia content in the natural history domain

• Validation mechanisms to ensure compliance with EUROPEANA standards.

• Sustained item-level access by integration with existing networks in the domain (i.e. GBIF, BioCASE and CETAF).

• Metadata enrichment by means of multilingual metadata vocabularies and thesauri for natural history data (e.g. names) to enhance cross-linking of Europeana content.

• A mechanism to extend participation in content provision.

For GBIF and the NH-Collections OpenUp! Provides

open-up.eu

• 80% funding for 275 person months of qualified staff time for data cleaning and quality control.

• Increased relevance of our data by inclusion in EUROPEANA.• Tools for quality control of species names and other data.• Help to extend data provision for GBIF and BioCASE.• Funding for a multilingual index of common names.• Technical solutions for shared but distributed information

infrastructures.

Kionidella moravicensis, Miocene bryozoans from Moravian part of Carpathain Foredeep, old about 14 milions years. Collection of National museum Prague.

BioCASE - Biological Collection Access Service – www.biocase.org

BHL-Europe - Biodiversity Heritage Library Europe - http://www.bhl-europe.eu/

CETAF – Consortium of European Taxonomic Facilities – www.cetaf.org

EUROPEANA – www.europeana.eu/portal/aboutus.html

GBIF - Global Biodiversity Information Facility – www.gbif.org

OpenUp! – Opening up Europe’s natural history heritage for Europeana – www.open-up.eu

SYNTHESYS – A Synthesis of Systematics Ressources - http://www.synthesys.info/

Thank you for your attention!

open-up.eu

Thank you for your attention!

open-up.eu

Lateral view of Epimetopus mendeli from Peru, which is currently under description as new for science in a collaborative paper by the scientists of National Museum in Prague and the Museum of Natural History in London.

Thank you for your attention!

open-up.eu

Lateral view of Epimetopus mendeli from Peru, which is currently under description as new for science in a collaborative paper by the scientists of National Museum in Prague and the Museum of Natural History in London.