7-23-12 duraspace & vivo sponsor webinar slides

48
DuraSpace and VIVO DuraSpace Sponsors Webinar July 23, 2012 Michele Kimpton - DuraSpace Jonathan Markow - DuraSpace Dean Krafft – Cornell/VIVO Jon Corson-Rikert – Cornell/VIVO

Upload: duraspace

Post on 20-Aug-2015

945 views

Category:

Technology


0 download

TRANSCRIPT

DuraSpace and VIVO

DuraSpace Sponsors WebinarJuly 23, 2012

Michele Kimpton - DuraSpaceJonathan Markow - DuraSpace

Dean Krafft – Cornell/VIVOJon Corson-Rikert – Cornell/VIVO

Using the Webinar Platform

• 2-way audio for all participants is muted

• We will utilize the Chat Window for the Q&A portion or you may use it if you are having technical difficulties

• You may type your question here & hit ‘enter’

DuraSpace and VIVO

DuraSpace Sponsors WebinarJuly 23, 2012

Michele Kimpton - DuraSpaceJonathan Markow - DuraSpace

Dean Krafft – Cornell/VIVOJon Corson-Rikert – Cornell/VIVO

Agenda

• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Current Status

and Next Steps• Questions

The DuraSpace Incubator

• Uses the Apache and Jasig models as a starting point

• Projects are mentored by DuraSpace• Projects work towards best practices,

governance, sustainability• End of process is status as officially

recognized project

DuraSpace Umbrella Services

• Legal• Licensing• Governance• Developer resources• Mentoring• Technology expertise• International exposure• Contributors• Recognition• Marketing and

Communications

• Webinars• Education• Peer Support• Conferences• Community

development• Service providers• Admin support• Grants• Advocacy• Strategic planning

Advantages to DuraSpace and our Community

• Broader community support• New technologies of interest to

DuraSpace community (e.g., VIVO: semantic web, linked open data)

• Technology synergies (e.g., Fedora)• Attractiveness to corporate sponsors• Wider base of developers• Management and overhead cost

efficiencies

DuraSpace Expectations

• Projects must meet incubation criteria vis-à-vis best practices, licensing, broad committer support, etc.

• Project community must fund DuraSpace resources

• Project works with DuraSpace to develop sustainability plan

Agenda

• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Current Status

and Next Steps• Questions

VIVO: An Open Source Tool for Describing and Linking Researchers

and Research

DuraSpace Sponsors WebinarJuly 23, 2012

Dean B. Krafft (presenter) andJon Corson-Rikert

Cornell University Library

VIVO CollaborationCornell UniversityDean Krafft (Cornell PI)

Manolo BeviaJim Blake

Nick CappadonaBrian Caruso

Jon Corson-RikertElly Cramer

Medha DevareElizabeth Hines

Huda KhanDepak Konidena

Brian LoweJoseph McEnerneyHolly Mistlebauer

Stella MitchellAnup Sawant

Christopher WestlingTim Worrall

Rebecca Younes

University of FloridaMike Conlon (VIVO and UF PI)

Beth AutenMichael Barbieri

Chris BarnesKaitlin Blackburn

Cecilia BoteroKerry Britt

Erin BrooksAmy Buhler

Ellie BushhousenLinda Butson

Chris CaseChristine Cogar

Valrie DavisMary Edwards

Nita FerreeRolando Garcia-Milan

George HackChris HainesSara HenningRae Jesano

Margeaux JohnsonMeghan Latorre

Yang LiJennifer LyonPaula Markes

Hannah NortonJames Pence

Narayan RaumNicholas Rejack

Alexander RockwellSara Russell Gonzalez

Nancy SchaeferDale SchepplerNicholas SkaggsMatthew Tedder

Michele R. TennantAlicia Turner

Stephen Williams

Indiana UniversityKaty Borner (IU PI)

Kavitha ChandrasekarBin Chen

Shanshan ChenRyan CobineJeni Coffey

Suresh DeivasigamaniYing Ding

Russell DuhonJon Dunn

Poornima GopinathJulie Hardesty

Brian KeeseNamrata Lele

Micah LinnemeierNianli Ma

Robert H. McDonaldAsik Pradhan Gongaju

Mark PriceMichael Stamper

Yuyin SunChintan TankAlan Walsh

Brian WheelerFeng Wu

Angela Zoss

Ponce School of MedicineRichard J. Noel, Jr. (Ponce PI)

Ricardo Espada ColonDamaris Torres Cruz

Michael Vega Negrón

This project is funded by the National Institutes of Health, U24 RR029822"VIVO: Enabling National Networking of Scientists”

The Scripps Research Institute

Gerald Joyce (Scripps PI)Catherine Dunn

Sam KatkovBrant KelleyPaula King

Angela MurrellBarbara NobleCary Thomas

Michaeleen Trimarchi

Washington University School of Medicine in St. Louis

Rakesh Nagarajan (WUSTL PI)Kristi L. HolmesCaerie HouchinsGeorge JosephSunita B. Koul

Leslie D. McIntosh

Weill Cornell Medical CollegeCurtis Cole (Weill PI)

Paul AlbertVictor Brodsky

Mark BronnimannAdam Cheriff

Oscar CruzDan Dickinson

Richard HuChris Huang

Itay KlazKenneth Lee

Peter MicheliniGrace Migliorisi

John RuffingJason Specland

Tru TranVinay Varughese

Virgil Wong

What is VIVO?

• A semantic-web-based researcher and research discovery tool– People plus much more

• Institution-wide, publicly-visible information– For external as well as internal audiences

• An open, shared platform for connecting scholars, communities, and campuses using Linked Open Data

What does VIVO do?• Integrates multiple sources of data– Systems of record– Faculty activity reporting– External sources (e.g., Scopus, PubMed, NIH

RePORTER)• Provides a review and editing interface– Single sign-on for self-editing or by proxy

• Provides integrated, filterable feeds to other websites

A brief VIVO history2003-2005 First development for the life sciences

at Cornell, as a relational database2006-2008 Expansion to all disciplines at Cornell,

and conversion to Semantic Web2009-2012 National Institutes of Health-sponsored

VIVO: Enabling the National Networking of Scientists project transforms VIVO to a multi-institutional open source platform

2012+ Transitioning VIVO to DuraSpace for open community development

What does VIVO model?• People, but also organizations, grants, programs,

projects, publications, events, facilities, and research resources

• Relationships among the above– Meaningful connections among people and activities– Bidirectional– Context and navigation from one point of interest to

another• Links to URIs outside VIVO– Concepts– People, places, organizations, events

Typical data sources• HR and/or directory – people, appointments• Research administration – grants & contracts• Registrar – courses• Graduate-level programs/advising affiliations• Faculty reporting system(s) and/or self or proxy

editing – narratives, service, research areas, awards• Events calendar• News releases and/or press clippings• PubMed, Scopus, repositories – publications

What is a Semantic Web application?• Provides data readable by machines, not just text for

humans• Provides self-describing data via shared ontologies– Defined types– Defined relationships

• Provides search & query augmented by relationships• Does simple reasoning to categorize and find

associations– Teaching faculty = any faculty member teaching a course– All researchers involved with any gene associated with

breast cancer (through research project, publication, etc.)

The VIVO ontology

• Defines types– Having individual instances (e.g., persons)

• Defines relationships– Expressed as statements about individuals

• Statements (“triples”) can describe or connect– Data property statements describe individuals– Object property statements connect individuals

Andrew McDonald

author of

has author

research area

research area for

academic staff in

academic staff

Susan Riha

Mining the record: Historical evidence for…

author ofhas author

teaches research area for

research area

headed byNYS WRI

Earth and Atmospheric Sciences

crop management

CSS 4830

Cornell’s supercomputers crunch weather data to help farmers manage chemicals

head offaculty appointment in

faculty members

taught by

featured in

features person

Example relationships for a researcher

Structured data for visualizations

What is Linked Open Data (LOD)?• Data– Structured information, not just documents with text– A common, simple format

• Open– Available, visible, mine-able– Anyone can post, consume, and reuse

• Linked– Directly by reference– Indirectly through common references and inference

An HTTP request can return HTML or data

Linked data indexing for search

Ponce VIVOPonce VIVO

WashU VIVOWashU VIVO

IU VIVO

IU VIVO

CornellIthaca VIVO

CornellIthaca VIVO

Weill

Cornell VIVO

Weill

Cornell VIVO

eagle-IResearchresources

eagle-IResearchresources Harvard

ProfilesRDF

HarvardProfiles

RDF

OtherVIVOsOtherVIVOs

DigitalVitaRDF

DigitalVitaRDF

IowaLokiRDF

IowaLokiRDF

Linked Open DataLinked Open Data

vivosearch

.org

UF VIVO

UF VIVO

Scripps VIVOScripps VIVO

Solrsearchindex

Solrsearchindex

Alter-nateSolr

index

Alter-nateSolr

index

Value for institutions• Common data substrate– Public, granular and direct– Discovery via external and internal search engines– Available for reuse at many levels

• Distributed curation– E.g., affiliations beyond what HR system tracks– Data coordination across functional silos– Feeding changes back to systems of record– Direct linking across campuses

• Data that is visible gets fixed

Enter once, use many times• Provides normalized public data to a range of

campus applications– Parameterized, filtered queries

• By affiliation, org or person id, research area, geography

– Search results

• Easily consumable data– XML, HTML, JSON– Import module for Drupal– Widgets (Duke, UCSF)

Partnerships – research resources

• CTSAconnect– OHSU, Harvard, Cornell, Florida, Buffalo & Stony

Brook– eagle-i sister NIH project – Harvard, OHSU, 7 others

• Facilities, services, techniques, protocols, skills, and research outputs beyond publications– Extended ways to represent expertise– Improve attribution for data and other contributions

to science

Partnerships – ORCID

• ORCID – Open Researcher and Contributor ID– Create an identifier for all authors. Attribute

works to authors through ORCID identifier• ORCID and VIVO– VIVO provides assertion of ORCIDs for people

through institutional identity management. ORCID is an attribute in a VIVO profile.

– Anticipating batch submission of basic researcher registrations by universities

http://orcid.org

Partnerships – research data

• VIVO/ANDS consortium in Australia– Link research data with researcher profiles and

publications– Harvest to national registry

• DataStaR local data registry tool– Add-on to VIVO– Complement to other library data-related services– Institute for Museum and Library Services (IMLS)

grant

Partnerships – information standards

• euroCRIS and the CERIF research data standard– Linked Open Data task force slated to map VIVO

and CERIF ontologies during 2012• CASRAI (http://casrai.org)

– University, corporate, and government partnership– Building a common data dictionary for research

information– Coordinating with euroCRIS

Partnerships - APA

• The American Psychological Association is using VIVO as one component of a trusted identity framework for online submission and review of scholarly publications

• VIVO’s integration with institutional identity systems through the InCommon Federation provides a platform for representing and exchanging information about authors and their works

• http://vivo.apa.org, http://publishtrust.org

Partnerships - Sakai

• Cambridge University is testing VIVO as a profiling component of the new Sakai Open Academic Environment product

• VIVO’s design permits integration into the Sakai interface without major modification to either platform

• NYU and other sites may expand the testing pending grant approval

Institutions Adopting VIVO• American Psychological Assn.• Brown University• Univ. of Cambridge• Univ. of Colorado, Boulder• Cornell University• Duke University• Eindhoven Univ. of Technology• Univ. of Florida• Griffiths University• Indiana University• Johns Hopkins University• MIT

• Univ. of Melbourne• New York University• Northeastern University• Notre Dame University• Univ. of Pennsylvania• Penn State• Scripps Research Institute• SUNY, Stony Brook• U.S. Dept. of Agriculture• Univ. of Virginia• Univ. of Washington• Washington Univ., St Louis• Weill Cornell Medical College

VIVO Conference Corporate SponsorsSponsor 2010 2011 2012

AAAS (Science) X

CrossRef Silver

Elsevier X Silver Gold

IMO Interface Terminology X Gold

Microsoft Research Silver

Nature Jobs Silver

NETE Silver

PLOS X

ProQuest Gold

Springer Gold

Symplectic Gold Platinum

Refworks | COS X Gold

Thomson Reuters X Gold Silver

Wellspring Worldwide Silver

VIVO Service Providers• Symplectic Elements – research management tools

supporting repositories and VIVO– Preliminary data reviews from the Duke VIVO team are positive

• Wellspring Worldwide’s Flintbox.com– Integrating faculty expertise and licensable technologies– Membership free to universities and research institutes– Announced plans to create VIVO-based researcher profiles for

all member organizations• JK Software in Australia

– Custom forms and other interface work• American Psychological Assocation

– Presenting at VIVO 2012 on licensing the APA research thesaurus as a service integrated with VIVO

Development beyond the VIVO project

• Duke – VIVO widgets• Indiana – Query builder interface for HUBzero• Pittsburgh – Digital Vita Documents• Stony Brook – UMLS terminology web service• Weill Cornell – Google Refine integration• Nebraska – BEPress publication importer• UCSF – Open Social container and RDF gadgets• USC – Karma Information Integration Tool

Agenda

• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Planning and

Next Steps• Questions

VIVO Planning

• Roles and Responsibilities• Governance• Working groups• Advisory bodies• Work objectives and plan• Costs• Revenue streams

Roles and Responsibilities

• Five part-time VIVO staff positions identified for operational roles

• DuraSpace effort– Strategic planning– Business development– Marketing and communications– Community development– Technology best practices– Admin support

Costs

• Existing DuraSpace staff effort (0.85 FTE aggregated) – funded by VIVO

• Dedicated VIVO staff (0.95 FTE aggregated) - Administered by DuraSpace, sourced and funded by VIVO

• VIVO “in-kind” staff (0.75 FTE) hosted at contributing institutions

• Other costs (travel, marketing, event coordination, sponsorship campaign)

Revenues

• Grants and contracts• Hosted services• Corporate sponsorships• Registered service providers• Conference/event sponsorships• Community sponsorship• Founding contributors

Next Steps

• Joint announcement on intent• VIVO raises funds to support its startup

period• Initial fund-raising success will trigger

the start of incubation for VIVO.

Questions