visualization tools for the refinery platform - supporting reproducible research with provenance...

80
Visualization Tools for the Refinery Platform Nils Gehlenborg, PhD HARVARD MEDICAL SCHOOLCENTER FOR BIOMEDICAL INFORMATICS SUPPORTING REPRODUCIBLE RESEARCH WITH PROVENANCE VISUALIZATION REPRODUCIBLE RESEARCH

Upload: ngehlenborg

Post on 16-Jul-2015

196 views

Category:

Science


1 download

TRANSCRIPT

Page 1: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

Visualization Tools for the Refinery Platform

Nils Gehlenborg, PhDHARVARD MEDICAL SCHOOL・CENTER FOR BIOMEDICAL INFORMATICS

SUPPORTING REPRODUCIBLE RESEARCH WITH PROVENANCE VISUALIZATION

REPRODUCIBLE RESEARCH

Page 2: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

REPRODUCIBLE RESEARCH

Page 3: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

Health & Science

The new scientific revolution: Reproducibility at lastBy By Joel AchenbachJoel Achenbach January 27January 27

Diederik Stapel, a professor of social psychology in the Netherlands, had been a rock-star scientist — regularlyDiederik Stapel, a professor of social psychology in the Netherlands, had been a rock-star scientist — regularly

appearing on television and publishing in top journals. Among his striking discoveries was that people exposed toappearing on television and publishing in top journals. Among his striking discoveries was that people exposed to

litter and abandoned objects are more likely to be bigoted.litter and abandoned objects are more likely to be bigoted.

And yet there was often something odd about Stapel’s research. When students asked to see the data behind hisAnd yet there was often something odd about Stapel’s research. When students asked to see the data behind his

work, he couldn’t produce it readily. And colleagues would sometimes look at his data and think: It’s beautiful.work, he couldn’t produce it readily. And colleagues would sometimes look at his data and think: It’s beautiful.

Too beautiful. Most scientists have messy data, contradictory data, incomplete data, ambiguous data. This dataToo beautiful. Most scientists have messy data, contradictory data, incomplete data, ambiguous data. This data

was was too good to be truetoo good to be true..

In late 2011, Stapel admitted that he’d been fabricating data for many years.In late 2011, Stapel admitted that he’d been fabricating data for many years.

The Stapel case was an outlier, an extreme example of scientific fraud. But this and several other high-profileThe Stapel case was an outlier, an extreme example of scientific fraud. But this and several other high-profile

cases of misconduct resonated in the scientific community because of a much broader, more pernicious problem:cases of misconduct resonated in the scientific community because of a much broader, more pernicious problem:

Too often, experimental results can’t be reproduced.Too often, experimental results can’t be reproduced.

That doesn’t mean the results are fraudulent or even wrong. But in science, a result is supposed to be verifiable byThat doesn’t mean the results are fraudulent or even wrong. But in science, a result is supposed to be verifiable by

a subsequent experiment. An irreproducible result is inherently squishy.a subsequent experiment. An irreproducible result is inherently squishy.

And so there’s a movement afoot, and building momentum rapidly. Roughly four centuries after the invention ofAnd so there’s a movement afoot, and building momentum rapidly. Roughly four centuries after the invention of

the scientific method, the leaders of the scientific community are recalibrating their requirements, pushing forthe scientific method, the leaders of the scientific community are recalibrating their requirements, pushing for

the sharing of data and greater experimental transparency.the sharing of data and greater experimental transparency.

Top-tier journals, such as Science and Nature, have Top-tier journals, such as Science and Nature, have announced new guidelinesannounced new guidelines for the research they publish. for the research they publish.

“We need to go back to basics,” said Ritu Dhand, the editorial director of the Nature group of journals. “We need“We need to go back to basics,” said Ritu Dhand, the editorial director of the Nature group of journals. “We need

to train our students over what is okay and what is not okay, and not assume that they know.”to train our students over what is okay and what is not okay, and not assume that they know.”

The pharmaceutical companies are part of this movement. Big Pharma has massive amounts of money at stakeThe pharmaceutical companies are part of this movement. Big Pharma has massive amounts of money at stake

and wants to see more rigorous pre-clinical results from outside laboratories. The academic laboratories act asand wants to see more rigorous pre-clinical results from outside laboratories. The academic laboratories act as

Page 4: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

OBITUARY Wylie Vale and an elusive stress hormone p.542

HISTORY OF SCIENCE Descartes’ lost letter tracked using Google p.540

EARTH SYSTEMS Past climates give valuable clues to future warming p.537

AVIAN INFLUENZA Shift expertise to track mutations where they emerge p.534

Raise standards for preclinical cancer research

C. Glenn Begley and Lee M. Ellis propose how methods, publications and incentives must change if patients are to benefit.

Efforts over the past decade to characterize the genetic alterations in human cancers have led to a better

understanding of molecular drivers of this complex set of diseases. Although we in the cancer field hoped that this would lead to more effective drugs, historically, our ability to translate cancer research to clinical suc-cess has been remarkably low1. Sadly, clinical

trials in oncology have the highest failure rate compared with other therapeutic areas. Given the high unmet need in oncology, it is understandable that barriers to clinical development may be lower than for other disease areas, and a larger number of drugs with suboptimal preclinical validation will enter oncology trials. However, this low suc-cess rate is not sustainable or acceptable, and

investigators must reassess their approach to translating discovery research into greater clinical success and impact.

Many factors are responsible for the high failure rate, notwithstanding the inher-ently difficult nature of this disease. Cer-tainly, the limitations of preclinical tools such as inadequate cancer-cell-line and mouse models2 make it difficult for even

Many landmark findings in preclinical oncology research are not reproducible, in part because of inadequate cell lines and animal models.

S. G

SCH

MEI

SSN

ER/S

PL

2 9 M A R C H 2 0 1 2 | V O L 4 8 3 | N A T U R E | 5 3 1

COMMENT

© 2012 Macmillan Publishers Limited. All rights reserved

L I N K TO O R I G I N A L A RT I C L E

A recent report by Arrowsmith noted that the success rates for new development projects in Phase II trials have fallen from 28% to 18% in recent years, with insufficient efficacy being the most frequent reason for failure (Phase II failures: 2008–2010. Nature Rev. Drug Discov. 10, 328–329 (2011))1. This indicates the limi-tations of the predictivity of disease models and also that the validity of the targets being investigated is frequently questionable, which is a crucial issue to address if success rates in clinical trials are to be improved.

Candidate drug targets in industry are derived from various sources, including in-house target identification campaigns, in-licensing and public sourcing, in particular based on reports published in the literature and presented at conferences. During the transfer of projects from an academic to a company setting, the focus changes from ‘interesting’

to ‘feasible/marketable’, and the financial costs of pursuing a full-blown drug discovery and development programme for a particular tar-get could ultimately be hundreds of millions of Euros. Even in the earlier stages, investments in activities such as high-throughput screen-ing programmes are substantial, and thus the validity of published data on potential targets is crucial for companies when deciding to start novel projects.

To mitigate some of the risks of such invest-ments ultimately being wasted, most phar-maceutical companies run in-house target validation programmes. However, validation projects that were started in our company based on exciting published data have often resulted in disillusionment when key data could not be reproduced. Talking to scien-tists, both in academia and in industry, there seems to be a general impression that many

results that are published are hard to repro-duce. However, there is an imbalance between this apparently widespread impression and its public recognition (for example, see REFS 2,3), and the surprisingly few scientific publica-tions dealing with this topic. Indeed, to our knowledge, so far there has been no published in-depth, systematic analysis that compares reproduced results with published results for wet-lab experiments related to target identifica-tion and validation.

Early research in the pharmaceutical indus-try, with a dedicated budget and scientists who mainly work on target validation to increase the confidence in a project, provides a unique opportunity to generate a broad data set on the reproducibility of published data. To substanti-ate our incidental observations that published reports are frequently not reproducible with quantitative data, we performed an analysis of our early (target identification and valida-tion) in-house projects in our strategic research fields of oncology, women’s health and cardio-vascular diseases that were performed over the past 4 years (FIG. 1a). We distributed a ques-tionnaire to all involved scientists from target discovery, and queried names, main relevant published data (including citations), in-house data obtained and their relationship to the pub-lished data, the impact of the results obtained for the outcome of the projects, and the models

Believe it or not: how much can we rely on published data on potential drug targets?Florian Prinz, Thomas Schlange and Khusru Asadullah

Figure 1 | Analysis of the reproducibility of published data in 67 in-house projects. a | This figure illustrates the distribution of projects within the oncology, women’s health and cardiovascular indications that were ana-lysed in this study. b | Several approaches were used to reproduce the pub-lished data. Models were either exactly copied, adapted to internal needs (for example, using other cell lines than those published, other assays and so on) or the published data was transferred to models for another indication. ‘Not applicable’ refers to projects in which general hypotheses could not be verified. c | Relationship of published data to in-house data. The proportion

of each of the following outcomes is shown: data were completely in line with published data; the main set was reproducible; some results (including the most relevant hypothesis) were reproducible; or the data showed incon-sistencies that led to project termination. ‘Not applicable’ refers to projects that were almost exclusively based on in-house data, such as gene expres-sion analysis. The number of projects and the percentage of projects within this study (a– c) are indicated. d | A comparison of model usage in the repro-ducible and irreproducible projects is shown. The respective numbers of projects and the percentages of the groups are indicated.

0CVWTG�4GXKGYU�^�&TWI�&KUEQXGT[

+PEQPUKUVGPEKGU0QV�CRRNKECDNG.KVGTCVWTG�FCVC�CTG�KP�NKPG�YKVJ�KP�JQWUG�FCVC/CKP�FCVC�UGV�YCU�TGRTQFWEKDNG5QOG�TGUWNVU�YGTG�TGRTQFWEKDNG

1PEQNQI[ /QFGN�CFCRVGF�VQ�KPVGTPCN�PGGFU�9QOGPoU�JGCNVJ%CTFKQXCUEWNCT

.KVGTCVWTG�FCVC�VTCPUHGTTGF�VQ�CPQVJGT�KPFKECVKQP�0QV�CRRNKECDNG��/QFGN�TGRTQFWEGF�����

������� ������� �������

������ �������

�����

�����

�������

����� �����

������������

+P�JQWUG�FCVC�KP�NKPG�YKVJ�RWDNKUJGF�TGUWNVU

+PEQPUKUVGPEKGU�VJCV�NGF�VQ�RTQLGEV�VGTOKPCVKQP

/QFGN�TGRTQFWEGF����

�����

�������

/QFGN�CFCRVGF�VQ�KPVGTPCN�PGGFU�EGNN�NKPG��CUUC[U�

�������

�������

.KVGTCVWTG�FCVC�VTCPUHGTTGF�VQ�CPQVJGT�KPFKECVKQP

�����

0QV�CRRNKECDNG

�����

�����

C

F

D E

CORRESPONDENCE

NATURE REVIEWS | DRUG DISCOVERY www.nature.com/reviews/drugdisc

© 2011 Macmillan Publishers Limited. All rights reserved

Page 5: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

1. Statistical issues2. No access to data3. No access to software4. Insufficient description of experimental protocols5. Insufficient description of data analysis process

CHALLENGES FOR REPRODUCIBILITY

Page 6: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

Refinery Platform

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

Page 7: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

! DATA REPOSITORY

Page 8: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

! DATA REPOSITORY Meta Data

Page 9: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

! DATA REPOSITORY Meta DataTREATMENT

CELL LINE

TIME POINT

Page 10: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! DATA REPOSITORY

N Gehlenborg et al. , manuscript in preparation

RawData

DerivedData

DerivedData

Meta Data

Page 11: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

! DATA REPOSITORY

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

DerivedData

Meta Data

Page 12: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

! DATA REPOSITORY

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

DerivedData

ProvenanceMeta Data

PROTOCOLS

Page 13: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

! DATA REPOSITORY

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

DerivedData

ProvenanceMeta Data

PROTOCOLS

ALGORITHMS

Page 14: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

! DATA REPOSITORY

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

DerivedData

Experiment Graph

ProvenanceMeta Data

Page 15: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 16: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 17: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 18: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

Refinery Platform

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

Page 19: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES!

Page 20: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY!

Page 21: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY!

REST API

Page 22: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY!

REST API

Page 23: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY!Tools

REST API

Page 24: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY! ToolshedTools

REST API

Page 25: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY! Toolshed

Workflow Editor

Tools

REST API

Page 26: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

! ANALYSIS PIPELINES

N Gehlenborg et al. , manuscript in preparation

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINES

GALAXY! Toolshed

Workflow Editor

Tools

REST API

Workflow Inputs

Workflow Outputs

Page 27: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

Page 28: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

Page 29: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

Page 30: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

DerivedData

Page 31: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

DerivedData

Page 32: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

WORKFLOW &

PARAMETERS

DerivedData

Page 33: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

DerivedData

Page 34: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

N Gehlenborg et al. , manuscript in preparation

AssaySampleSource RawData

DerivedData

Experiment Graph

! ANALYSIS PIPELINES

DerivedData

DerivedData

Page 35: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

! ANALYSIS PIPELINES

Page 36: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

Refinery Platform

!

!

|

DATA REPOSITORY

VISUALIZATION TOOLS

ANALYSIS PIPELINESISA-TAB ISA-TAB

Page 37: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

| VISUALIZATION TOOLS

N Gehlenborg et al. , manuscript in preparation

Page 38: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

| VISUALIZATION TOOLS

N Gehlenborg et al. , manuscript in preparation

Page 39: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

Page 40: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

USE CASES

1. Collaboration between computational and experimental labs 2. Repository for large-scale, data-generating projects 3. Integration with existing repositories

Page 41: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 42: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

AssaySampleSource RawData

DerivedData

DerivedData

VISUALIZATION OF PROVENANCE INFORMATION

Page 43: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

AssaySampleSource RawData

DerivedData

DerivedData

VISUALIZATION OF PROVENANCE INFORMATION

Page 44: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

AssaySampleSource RawData

DerivedData

DerivedData

VISUALIZATION OF PROVENANCE INFORMATION

1. How do we represent provenance information?2. How do we make provenance information actionable?

Page 45: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

AssaySampleSource RawData

DerivedData

DerivedData

VISUALIZATION OF PROVENANCE INFORMATION

Stefan Luger, BSc JOHANNES KEPLER UNIVERSITY LINZ

Marc Streit, PhD JOHANNES KEPLER UNIVERSITY LINZ

Page 46: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

!

AssaySampleSource RawData

DerivedData

DerivedData

VISUALIZATION OF PROVENANCE INFORMATION

Page 47: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

VISUALIZATION OF PROVENANCE INFORMATION

breadth

depth

Page 48: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

BASIC APPROACH

Page 49: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 50: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 51: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 52: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 53: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 54: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

FILTERING: BASED ON META DATA

Page 55: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 56: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 57: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 58: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 59: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 60: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

DEALING WITH BREADTH: LAYERING

Page 61: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 62: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 63: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 64: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

CONTROLLING LEVEL OF DETAIL: DEGREE OF INTEREST (DOI)

Page 65: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

no lens fish-eye lens

Page 66: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 67: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 68: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 69: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 70: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 71: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 72: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 73: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 74: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 75: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

OUTLOOK: MAKE PROVENANCE ACTIONABLE

Page 76: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 77: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization
Page 78: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

N Gehlenborg et al. , manuscript in preparation

REPRODUCIBLE AND INTEGRATIVE ANALYSIS

Page 79: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

HARVARD MEDICAL SCHOOL

JOHANNES KEPLER UNIVERSITY LINZ Stefan Luger Samuel Gratzl Holger Stitz Marc Streit

HARVARD CHAN SCHOOL OF PUBLIC HEALTH

Funding NIH/NHGRI K99 HG007583 & Harvard Stem Cell Institute

Ilya Sytchev Shannan Ho Sui Winston Hide

AcknowledgementsRichard Park Psalm Haseley Anton Xue Peter J Park

Page 80: Visualization Tools for the Refinery Platform - Supporting reproducible research with provenance visualization

Methods to Enhance the Reproducibility of Precision Medicine

Pacific Symposium on Biocomputing The Big Island of Hawaii

January 4-8, 2016

people.fas.harvard.edu/~manrai/

http://bit.ly/patient-driven http://bit.ly/psb16-reproducibility

WE ARE HIRING!http://j.mp/refinery-developer-jr

GREAT CONFERENCES!