service networks, service sets and biodiversity catalogue

18
the biodiversity informatics community at the European level and be Service Networks, Service Sets and Biodiversity Catalogue Alex Hardisty Cardiff University

Upload: alex-hardisty

Post on 27-Jun-2015

93 views

Category:

Technology


2 download

DESCRIPTION

Presentation given at Biodiversity Informatics Horizons 2013 (BIH2013) on 5th September 2013. Provision of data and processing of information are both “services”. How can we establish a landscape in which services are both discoverable and interoperable? What is a Service Network for biodiversity science and what are Service Sets? Why do we need them and how do we build them? What is the role of the Biodiversity Catalogue? Where is this all going?

TRANSCRIPT

Page 1: Service Networks, Service Sets and Biodiversity Catalogue

Structuring the biodiversity informatics community at the European level and beyond

Service Networks, Service Sets and Biodiversity Catalogue

Alex HardistyCardiff University

Page 2: Service Networks, Service Sets and Biodiversity Catalogue

Outline

Provision of data and processing of information are both “services”. How can we establish a landscape in which services are both discoverable and interoperable?

• What is a Service Network and what are Service Sets?

• Why do we need them and how do we build them?

• What is the role of the Biodiversity Catalogue?

• Where is this all going?

Page 3: Service Networks, Service Sets and Biodiversity Catalogue

What is a Service Network?

• A set of Web service (WS) instances that interact together to perform an application objective– In our case: multiple objectives, varying over time and

from one user to another

• Usage and hence composition needs to be dynamic

• In a Service Network:– Instances may join and leave– Instances are discoverable– Managed to a greater or lesser extent

WS1 WS2WS4

WS5

WS3

Biodiversity Service Network

Page 4: Service Networks, Service Sets and Biodiversity Catalogue

Workflow and experiments to study the ecological niche of the Horseshoe crab (Limulus polyphemus)

Workflows, pipelines and other applications are built from “services”

• Workflows allow to run studies and experiments to process vast amounts of data, repeatedly– Select and apply successive “services”

(data analysis and processing steps)– Import data from own research and/or

from existing public sources– Choose input parameters

• Service nesting– Services can themselves be made up

of combinations of other services or of workflow fragments

Page 5: Service Networks, Service Sets and Biodiversity Catalogue

A grouping of Web services having related functionality is called a ‘Service Set’

Taxonomy Metagenomics and metagenetics

Ecological niche and population modelling

Ecosystem functioning and valuation

Mapping, visualization, transformation

Catalogue of Life name lookup

QIIME ENM (openModeller)

Get meteor-ological data

Spatio-temporal visualization

GBIF occurrence data retrieval

BOLD PopBio Weather to Biome-BGC data

GeoServer WMS/WFS/WCS

GBIF ChecklistBank

BlastX Biome-BGCmonte carlo

Raster Diff

WoRMS aphia name

Sequence (OTU) clustering

Biome-BGC sensitivity anal.

ISO Country Code

PESI name Functional diversity

Data-Model harmonization

DwC-A to JSON shim

Checklist Cross-mapping

Taxonomic diversity

Biome-BGC CARBON

DwC-A to CSV shim

?

Taxonomy &Systematics

Ecological niche andpopulation modelling

Ecosystem functioningand valuation

?

Genes-Species-Specimens(multi-scale linkages)

Citizen Science &Observations

Mapping, visualization andtransformation services

Page 6: Service Networks, Service Sets and Biodiversity Catalogue

Service sets driven by science and policy needs

• CO2 emissions continuously increasing– 10 GtC in 2010; Sequestration is the sustainable

process to mitigate the effects

• Over the past 50 years, humans have changed ecosystems– resulting in a substantial and largely irreversible loss

of biodiversity

• Invasions of alien species– A leading cause of biodiversity loss and related

economic damages. They degrade ecosystem services, generate human health problems and impact outdoor recreation.

“transportation with ships is a high risk to

spread the species to these spots”

Stelzer et al 2013

Source: NOAA

Page 7: Service Networks, Service Sets and Biodiversity Catalogue

Service sets driven by science and policy needs

• CO2 emissions continuously increasing– 10 GtC in 2010; Sequestration is the sustainable

process to mitigate the effects

• Over the past 50 years, humans have changed ecosystems– resulting in a substantial and largely irreversible loss

of biodiversity

• Invasions of alien species– A leading cause of biodiversity loss and related

economic damages. They degrade ecosystem services, generate human health problems and impact outdoor recreation.

“transportation with ships is a high risk to

spread the species to these spots”

Stelzer et al 2013

Source: NOAA

Modellingecosystem services

ModellingCO2 sequestration

Calculating measures of genetic diversity

Assessing adaptationto changing conditions

Supporting processesof conservation

Assisting invasivespecies management

Page 8: Service Networks, Service Sets and Biodiversity Catalogue

• Connecting biology and IT communities– Distinct languages, different understandings– Service Network approach connects them

• Supporting use cases we know today ...– … and use cases in the future that we cannot

yet imagine

• Different Service Providers are good (competent) at different things

• Deals with multiple jurisdictions and supports a business model– Leading to sustainability

Why do we need them?

Scientists’ perspectives

Info

rmati

on T

echn

olog

ists

’pe

rspe

ctive

s

Biodiversity studies & experiments

Services for biodiversity science

compose to support

ICT Technical Capabilities

ICT Technical Elements

combine to deliver

combine to support

Page 9: Service Networks, Service Sets and Biodiversity Catalogue

How to build Service Networks and Service Sets?Discipline

Scientists

Scientific PAL

Technical PAL

Scientific and Technical Service Providers

ScientificRequirements

Translation

TechnicalRequirements

TechnicalCapabilities

ScientificCapabilities

ApplicationServices Team

Prioritisation

Support Centre

Training &Issue Resolution

Service LevelRequirements

Sustainability

Community

CommunitySource: M.Obst

Page 10: Service Networks, Service Sets and Biodiversity Catalogue

Objectives in building Service Networks

1. Services MUST be secure, scalable, reliable, and well-documented

2. Services CAN be deployed on standard cloud configurations

3. Services SHOULD be implemented according to recognised ‘best practices’

4. Services MUST be discoverable

Page 11: Service Networks, Service Sets and Biodiversity Catalogue

Services MUST be secure, scalable, reliable, and well-documented

Users’ workflows and applications

Sustained Service and Data ProvidersGBIF, CoL, ITIS, OBIS, WoRMS,EBI, BGBM, CRIA, EoL, BHL, ALA, etc. + many many more

Recognised and stable Resource ProvidersNational, EGI.eu, PRACE, commercial, etc.

Page 12: Service Networks, Service Sets and Biodiversity Catalogue

Services CAN be deployed on standard cloud configurations

• Deployment on standard cloud platforms for better availability and scalability

– 2-tier trust relations

• Pilot service for ‘e-Infrastructure Commons’– Private / public collaboration– Builds in 2014

?

Page 13: Service Networks, Service Sets and Biodiversity Catalogue

Services SHOULD be implemented according to recognised ‘best practices’

• Best practices for improving ease of use and scalability– “The perfect API” : What would be its characteristics?– Wider adoption of standard data and parameter formats to promote

interoperability

• Classes of “shims”– Utility, Format handling, Data from sources

• Founded on standard infrastructure– Use of standard approaches

• to e.g., authentication, authorization– For data storage and staging– For persistent identifiers– For metadata generation– Etc.

?

Page 14: Service Networks, Service Sets and Biodiversity Catalogue

Services MUST be discoverable

www.biodiversitycatalogue.orgA fully curated, well-founded catalogue of

Web services for biodiversity science

Page 15: Service Networks, Service Sets and Biodiversity Catalogue

• GEO BON– Biodiversity Observation

Network– By 2015

• Functional infrastructure“from observations to Essential

Biodiversity Variables (EBV) and derived indicators”

• Using a plug-and-play, service-oriented approach

• Coordinated through a registry system

• Linked to the GEOSS Common Infrastructure

Where’s this all going?

Page 16: Service Networks, Service Sets and Biodiversity Catalogue

• ALA• SiBBr• SpeciesLink• GBoWS• DataONE• GBIF• GEOSS• LifeWatch• SANBI

Towards interoperability guidelines for an Integrated Virtual Environment (IVE) for Biodiversity Science at the

international level

Page 17: Service Networks, Service Sets and Biodiversity Catalogue

What’s trending?

• Software is a key enabling technology– In a distributed and mobile world, this means Web apps

• Web apps means APIs. APIs give access to services– How to create connections between APIs that don’t usually

talk to each other?

• WebApp automation like:– IFTTT – “IF this THEN that”– Zapier

• API / service management & monetization like:– 3Scale

Page 18: Service Networks, Service Sets and Biodiversity Catalogue

Take home messages

• Separate:– Services from underlying infrastructure– Applications, workflows and VREs from the services

• Ensure Service Networks are built using standard Web 2.0 technologies

• BiodiversityCatalogue is the well-founded place to register and discover services

• There’s much more to do!