aginfra science gateway for workflows and integrated services 07/02/2012 robert lovas...

26
agINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas [email protected] MTA SZTAKI

Upload: clifford-booth

Post on 04-Jan-2016

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

agINFRA science gateway for workflows and integrated services

07/02/2012

Robert Lovas

[email protected]

MTA SZTAKI

Page 2: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Why workflows?

• ‘Orchestration of Tasks’ (not only in sequence; organized in Direct Acyclic Graphs)

• Data-driven• To exploit large computational resources / process large data sets• To make the complex applications run faster

– By applying paralellization techniques on them

• Paralellization Techniques:– Indepent tasks can be executed concurrently– Execute tasks against LARGE datasets parameter study, domain

decomposition, etc.

Page 3: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Parameter study / domain decomposition

GEN

SEQ

COLL

SEQSEQSEQ

Generates input

parameter space

Evaluates the results of the

simulation

Parameter sweep jobs

3

Page 4: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Liferay-based WS-GRADE/gUSE portal for agINFRA:– http://aginfra-portal.lpds.sztaki.hu/liferay-portal-6.0.5/– Open Registration for project participants– X509 Certificate authentication required to be able to submit jobs NEW: robot certs– agINFRA VO is accessible (ca. 5000 CPU cores, >50 TB storage)

Page 5: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Access modes

5

WS-PGRADEWF

DeveloperUI

gUSE DCI Bridge

DCI 1

DCI 2

DCI n

ApplicationSpecific

User Interface

ExistingApplicationSpecific UI

WS-PGRADEEnd-User

UI

Remote API

BES interface

ASM API

A

B

C

D

E

BES interface

ASM API

WS-PGRADE UI

Customized UI

Other, existing UI

gUSE Workflow engine

agINFRA VOagINFRA VO

Volunteers’ computersVolunteers’ computers

Page 6: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Workflow building blocks (glossary)• “Jobs” operating on data• Jobs can be:

– Grid-enabled applications (e.g. AgrovocTagging)

– Web-services

– NEW: REST services

– Another workflows (embedded)

• “Ports” representing inputs and outputs for the jobs– Available port types:

• Value• Local file e.g. from the scientists laptop• Remote File (gsiftp, lfc) in the Grid• Database Queries (SQL)• ….

– Extensions or improvement might be required (e.g. Drupal / Dublin Core / SPARQL / CIARD RING support?) 6

Page 7: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

agINFRA overview

Page 8: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Services to be integrated

Harvesting, validation, transformation

Page 9: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

The Organic.Edunet Ingest Workflow

Page 10: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Schematic Representation of the AGRIS workflow

Page 11: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Cross-community workflows identified at the Athens

Page 12: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

DEMONSTRATION I.

Page 13: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

AgroTagger

Page 14: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

First demo application

Page 15: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

First demo application - details

Page 16: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Job details

Page 17: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Inputs and outputs

Page 18: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Monitoring of execution

18

Page 19: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Successful execution of AgrovocTagging application

Page 20: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

DEMONSTRATION II.

Harvesting workflow

Page 21: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

ARIADNE aggregation panel

- Select schedulingLink to the workflow interface

-Invoke the workflow-Check the status of the workflow

-Stop the workflow-Add metadata for the

aggregation

ARIADNE aggregation panel

- Select schedulingLink to the workflow interface

-Invoke the workflow-Check the status of the workflow

-Stop the workflow-Add metadata for the

aggregation

gUSE WS-PGRADEgUSE WS-PGRADE

Harvesting- Add parameters of

agDataHarvesters web service

Harvesting- Add parameters of

agDataHarvesters web service

Metadata Validation vs target

schema - Add parameters of

agMetadataValidation web service

Metadata Validation vs target

schema - Add parameters of

agMetadataValidation web service

Target Validation- Add parameters of

agTargetValidation web service

Target Validation- Add parameters of

agTargetValidation web service

Target schema?

Target schema?

No

• Stop the process for the specific target• send message or store logs and send them

through the gUSE API

Yes

Transformation- Get parameters for the

transformation and invoke agMetadataTransformation

web service

Transformation- Get parameters for the

transformation and invoke agMetadataTransformation

web serviceNo

Store metadata on the GRID (agINFRA VO)

Store metadata on the GRID (agINFRA VO)

YesValid?Valid?

starting a pre-defined procedure as an agINFRA workflow

Page 22: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Integration of multiple components/services

Page 23: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Details in demo…

Page 24: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Plan: Continue the development of aggregation workflow

Page 25: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

Plan: Further integration

+ volunteers…

Page 26: AgINFRA science gateway for workflows and integrated services 07/02/2012 Robert Lovas robert.lovas@sztaki.mta.hu MTA SZTAKI

07/02/2012

Questions?

26