biosharing at internatiomnal data week - nih bd2k session, denver 2016

27
Interna’onal Data Week, SciDataCon, Denver, 12 September, 2016 Community standards for interoperability: BioSharing, an informa’ve and educa’onal resource Susanna-Assunta Sansone, PhD Associate Director, Oxford e-Research Centre, University of Oxford

Upload: susanna-assunta-sansone

Post on 13-Apr-2017

183 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Interna'onalDataWeek,SciDataCon,Denver,12September,2016

Communitystandardsforinteroperability:

BioSharing,aninforma'veandeduca'onalresource

Susanna-AssuntaSansone,PhDAssociateDirector,Oxforde-ResearchCentre,

UniversityofOxford

Page 2: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Interoperability standards - Defini3on

•  Enabletheopera'onalprocessesunderlyingexchangeandsharingofinforma'onbetweendifferentsystemsto

§  ensurealldigitalresearchoutputsareFindable,Accessible,InteroperableandReusable(FAIR)

Page 3: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Interoperability standards - Defini3on

•  Enabletheopera'onalprocessesunderlyingexchangeandsharingofinforma'onbetweendifferentsystemsto

§  ensurealldigitalresearchoutputsareFindable,Accessible,InteroperableandReusable(FAIR)

•  Amongtheinteroperabilitystandards,onecategoryfocuseson

thedescrip'ons(ormetadata)ofdigitalobjects

•  withinthiscategorytherearecontentstandards

Page 4: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Opensdatasetsto

transparent

interpreta'on,

verifica'onand

exchangeand(re)use

Content standards – What for?

Page 5: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Content standards – Three types

Formats Terminologies Guidelines

Minimuminforma+onrepor+ng

requirements,checklists

o  Reportthesamecore,

essen'alinforma'on

o  e.g.MIAMEguidelines

Controlledvocabularies,taxonomies,

thesauri,ontologiesetc.

o  Usethesamewordandreferto

thesame‘thing’

o  e.g.GeneOntology

Conceptualmodel,conceptual

schema,exchangeformatsetc

o  Allowdatatoflowfromone

systemtoanother

o  e.g.FASTA

Page 6: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

de jure de facto grass-roots

groups standard

organizations NanotechnologyWorkingGroup

Over 700 content standards in biomedical sciences

miame!MIAPA!

MIRIAM!MIQAS!MIX!

MIGEN!

ARRIVE!MIAPE!

MIASE!

MIQE!

MISFISHIE….!

REMARK!

CONSORT!

MAGE-Tab!GCDML!

SRAxml!SOFT! FASTA!

DICOM!

MzML!SBRML!

SEDML…!

GELML!

ISA-Tab!

CML!

MITAB!

AAO!CHEBI!

OBI!

PATO! ENVO!MOD!

BTO!IDO…!

TEDDY!

PRO!XAO!

DO

VO!

Formats Terminologies Guidelines

…….... …….... ……....

Page 7: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Datapoliciesbyfunders,journalsandotherorganiza'ons

(100s+?)

Database,toolsandservices(1000s?)

Contentstandards(700+)

Complex and evolving landscape

Formats Terminologies Guidelines

Page 8: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Fromthestandardsdevelopers’view,incl.:•  Complexlifecycleanddiversestakeholdercommuni'es•  Nocentralauthorityrecognizedbyallthepar'esinvolved•  Mainlyvolunteerac'vitywithli_le/nofund(exceptthecurrentNIHBD2KRFA!)•  Standalone,fragmentedstandards:unnecessaryduplica'onsandgaps•  Socialandtechnicalchallenges,extensivecommunitydynamics•  Lackofrewardsandincen'vesforallcontributors•  Ownershipofopenstandardsandthelegalframeworkareveryembryonic

Fromthestandardsconsumers’viewincl.:•  Li_le/noguidanceandtrainingmaterialtonavigate,select,re-use,extendor

recommendmostappropriatestandards•  Domain-specificfragmentedstandardsthatcannotbeusedincombina'on•  Standardsseenasburdensomeand/orover-prescrip've•  Limitednumberoftools/databasesimplemen'ngstandardsforan‘invisibleuse’•  Li_le/noappropriatefundingmechanismstosupportuseofstandards

Challenges emerged already 10 years ago

Referencepoints:CDISCsince1997

MIAMEpublishedin2001

Page 9: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Isthereadatabase,implemen3ngstandards,wheretodepositmy

metagenomicsdataset?

Myfunder’sdatasharingpolicyrecommendstheuseof

establishedstandards,butwhichonesarewidelyendorsedand

applicabletomytoxicologicalandclinicaldata?

AmIusingthemostup-to-dateversionofthisterminologytoannotatecell-basedassays?

Iunderstandthisformathasbeendeprecated;whathasbeenreplacedby

andhowisleadingthework?

Aretheredatabasesimplemen'ngthisexchangeformat,whosedevelopment

wehavefunded?

Whatarethematurestandardsandstandards-compliantdatabasesweshouldrecommendto

ourauthors?

BioSharing: inform and educate, working with and for the community

Page 10: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 11: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

What is BioSharing?

Aweb-based,curatedandsearchableportalthatmonitorsthedevelopmentandevolu3onofstandards,theiruseindatabasesandtheadop'onofbothindata

policies,toinformandeducatetheusercommunity.

Page 12: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

What is BioSharing?

StandardsaredigitalobjectstooandwemakethemFAIR

Page 13: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 14: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 15: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 16: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Using indicators to describe the ‘status’ of a resource

Readyforuse,implementa'on,orrecommenda'on

Indevelopment

Statusuncertain

Deprecatedassubsumedorsuperseded

Manuallycurated,approvedbythecommunity

Page 17: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 18: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 19: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Helping you discover standards, databases, data policies and the rela3onships between them

Page 20: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Pre-package the resources to help you find what it is relevant to you

Page 21: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Is BioSharing used?

Success stories?

YES!

Page 22: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 23: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 24: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
Page 25: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

“BioSharinganditsinteracAvebrowserwillallowustodiscoverwhichdatabasesandstandardsarenotcurrentlyincludedinourauthorguidelines,enablingustoregularlymonitorandrefineourpoliciesasappropriate,insupportofourmissiontohelpourauthorsenhancethereproducibilityoftheirwork.”–HollyMurray,F1000Research

Page 26: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

…to export standards-derived metadata for the crea3on of annota3on templates...next talk!

studyMUST

study titleSHOULD

study descriptionMAY

seriesMUST

series titleMUST

series summaryMUST

ExampleofMIAMEelements:

experimentMUST

experiment titleMUST

experiment descriptionMUST

Page 27: BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016

Advisory Board Opera3onal Team