sequence services phase 2 webinar series: constellation technology and genestack

Post on 11-May-2015

1.108 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

The presentation given by Constellation and Genestack about their response to the Pistoia Alliance Sequence Services Phase 2 RFP.

TRANSCRIPT

Constellation Technologies& GeneStack

Development of Sequence Services 2 in the Constellation

Framework

1

ConstellationExperts in big data and bioinformatics

2

• Spin out from STFC (Science and Technology Facilities Council)– Largest research facility in UK specialising in large data computing

• CERN, European physics and astronomy science• Supporting all UK disciplines in computing

• Strong IT & Bioinformatics expertise– Strong Bioinformatics delivery expertise– Strong connections into European academia– Excellent access to newly developed applications, tools and algorithms

• Supplier of cloud computing services to large Pharma.• Partners for Pistoia SS2

– Microsoft Azure– STFC

Service ServiceService

Constellation’s “Roadmap”

Service

Core

Text Mining/Search

GenomeAnalysis

Data Integration

“Workflow Management”

Seamless Integration with Client systems

API

“AppMarket”

IT

• IT– Platform Design– Support– Maintenance– Testing– Stability / Scalability– Security

• Bioinformatics– Novel Algorithms– Research– Scientific support– Discovery– Analysis– Value Added

4

BioinformaticsIT

• Hosted– Single Vendor– Hardware limitations– Restricted storage– Limited cost models– “Lock in”

• Cloud– Vendor Agnostic– As required– Selectable storage– Best model available– “Flexible”

5

HostingCloud

Vendor Agnostic

Cloud Vision

6

Flexible Storage

Flexible Compute

TrueCloud

Client Business

Logic

MinimiseSupport

“Bioinformatics Marketplace”

Virtual Organisation

ClientApplications

Academic or bespoke solutions

Your Informatics

High Level Architecture

Distributed Storage

Distributed Compute

BioinformaticsSystems

Workflow Tools

Portal

Workflow UIDeployed

Workflow (Apps)Bioinformatics

UIs

Bioinformatics Applications

Our goal for SS2• We believed the end goal was a flexible platform where ALL the

application described in SS2 scope could be deployed for individual clients as required.

• Platform should be scalable where security, support and maintenance can be easily managed.– Reducing support costs allows for more focus on research

• Bioinformatics applications added as required:– GeneStack (Analysis Portal)– VIB (Arctix) (Workflow) (in discussion)– EBI (Services) (in discussion)

• Workflow delivered as a fundamental development principle• Development of the “AppMarket” for Bioinformatics

8

9

CompanySpecific

Integrating3rd PartySystems

SecureScalableStorage

WorkflowCore

IntegrationWith otherSystems

FutureDevelopment

Deliverables achieved• Portal with access to all the “Must Have” Web Services described in the SS2 documentation

– Constellation Managed Administration Interface to allow organisational mapping of users to Programs / Projects / Applications

• “Tool Box” of Integrated Applications– Galaxy– Secure Ensembl– Secure CellProfiler– Content Search (New development)

• Galaxy workflow engine with integrating applications deployed as a secure web application to cover “Must Have” tools– Restricted set of apps based on feedback from “testing pool” (Restrictions based on Need/Security)– Tools can be added on request

• Scalable storage and compute (dependant on need and security)– Structured Program - Project – User mapping– Cost effective data storage and compute

• Initial Integration with another Bioinformatics Vendor (GeneStack)

10

Other Available SAAS tools

• Secure EnsEMBL– Private copy of EnsEMBL (Rackspace)– Secure UI and API Access– Ability to map DAS (secure or Public)

• Parallelised CellProfiler– Private scalable version of CellProfiler on Azure

11

top related