sequence services phase 2 webinar series: constellation technology and genestack

11
Constellation Technologies & GeneStack Development of Sequence Services 2 in the Constellation Framework 1

Upload: pistoia-alliance

Post on 11-May-2015

1.107 views

Category:

Technology


0 download

DESCRIPTION

The presentation given by Constellation and Genestack about their response to the Pistoia Alliance Sequence Services Phase 2 RFP.

TRANSCRIPT

Page 1: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Constellation Technologies& GeneStack

Development of Sequence Services 2 in the Constellation

Framework

1

Page 2: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

ConstellationExperts in big data and bioinformatics

2

• Spin out from STFC (Science and Technology Facilities Council)– Largest research facility in UK specialising in large data computing

• CERN, European physics and astronomy science• Supporting all UK disciplines in computing

• Strong IT & Bioinformatics expertise– Strong Bioinformatics delivery expertise– Strong connections into European academia– Excellent access to newly developed applications, tools and algorithms

• Supplier of cloud computing services to large Pharma.• Partners for Pistoia SS2

– Microsoft Azure– STFC

Page 3: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Service ServiceService

Constellation’s “Roadmap”

Service

Core

Text Mining/Search

GenomeAnalysis

Data Integration

“Workflow Management”

Seamless Integration with Client systems

API

“AppMarket”

Page 4: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

IT

• IT– Platform Design– Support– Maintenance– Testing– Stability / Scalability– Security

• Bioinformatics– Novel Algorithms– Research– Scientific support– Discovery– Analysis– Value Added

4

BioinformaticsIT

Page 5: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

• Hosted– Single Vendor– Hardware limitations– Restricted storage– Limited cost models– “Lock in”

• Cloud– Vendor Agnostic– As required– Selectable storage– Best model available– “Flexible”

5

HostingCloud

Page 6: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Vendor Agnostic

Cloud Vision

6

Flexible Storage

Flexible Compute

TrueCloud

Client Business

Logic

MinimiseSupport

“Bioinformatics Marketplace”

Virtual Organisation

ClientApplications

Academic or bespoke solutions

Your Informatics

Page 7: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

High Level Architecture

Distributed Storage

Distributed Compute

BioinformaticsSystems

Workflow Tools

Portal

Workflow UIDeployed

Workflow (Apps)Bioinformatics

UIs

Bioinformatics Applications

Page 8: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Our goal for SS2• We believed the end goal was a flexible platform where ALL the

application described in SS2 scope could be deployed for individual clients as required.

• Platform should be scalable where security, support and maintenance can be easily managed.– Reducing support costs allows for more focus on research

• Bioinformatics applications added as required:– GeneStack (Analysis Portal)– VIB (Arctix) (Workflow) (in discussion)– EBI (Services) (in discussion)

• Workflow delivered as a fundamental development principle• Development of the “AppMarket” for Bioinformatics

8

Page 9: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

9

CompanySpecific

Integrating3rd PartySystems

SecureScalableStorage

WorkflowCore

IntegrationWith otherSystems

FutureDevelopment

Page 10: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Deliverables achieved• Portal with access to all the “Must Have” Web Services described in the SS2 documentation

– Constellation Managed Administration Interface to allow organisational mapping of users to Programs / Projects / Applications

• “Tool Box” of Integrated Applications– Galaxy– Secure Ensembl– Secure CellProfiler– Content Search (New development)

• Galaxy workflow engine with integrating applications deployed as a secure web application to cover “Must Have” tools– Restricted set of apps based on feedback from “testing pool” (Restrictions based on Need/Security)– Tools can be added on request

• Scalable storage and compute (dependant on need and security)– Structured Program - Project – User mapping– Cost effective data storage and compute

• Initial Integration with another Bioinformatics Vendor (GeneStack)

10

Page 11: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

Other Available SAAS tools

• Secure EnsEMBL– Private copy of EnsEMBL (Rackspace)– Secure UI and API Access– Ability to map DAS (secure or Public)

• Parallelised CellProfiler– Private scalable version of CellProfiler on Azure

11