7-23-12 duraspace & vivo sponsor webinar slides
TRANSCRIPT
DuraSpace and VIVO
DuraSpace Sponsors WebinarJuly 23, 2012
Michele Kimpton - DuraSpaceJonathan Markow - DuraSpace
Dean Krafft – Cornell/VIVOJon Corson-Rikert – Cornell/VIVO
Using the Webinar Platform
• 2-way audio for all participants is muted
• We will utilize the Chat Window for the Q&A portion or you may use it if you are having technical difficulties
• You may type your question here & hit ‘enter’
DuraSpace and VIVO
DuraSpace Sponsors WebinarJuly 23, 2012
Michele Kimpton - DuraSpaceJonathan Markow - DuraSpace
Dean Krafft – Cornell/VIVOJon Corson-Rikert – Cornell/VIVO
Agenda
• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Current Status
and Next Steps• Questions
The DuraSpace Incubator
• Uses the Apache and Jasig models as a starting point
• Projects are mentored by DuraSpace• Projects work towards best practices,
governance, sustainability• End of process is status as officially
recognized project
DuraSpace Umbrella Services
• Legal• Licensing• Governance• Developer resources• Mentoring• Technology expertise• International exposure• Contributors• Recognition• Marketing and
Communications
• Webinars• Education• Peer Support• Conferences• Community
development• Service providers• Admin support• Grants• Advocacy• Strategic planning
Advantages to DuraSpace and our Community
• Broader community support• New technologies of interest to
DuraSpace community (e.g., VIVO: semantic web, linked open data)
• Technology synergies (e.g., Fedora)• Attractiveness to corporate sponsors• Wider base of developers• Management and overhead cost
efficiencies
DuraSpace Expectations
• Projects must meet incubation criteria vis-à-vis best practices, licensing, broad committer support, etc.
• Project community must fund DuraSpace resources
• Project works with DuraSpace to develop sustainability plan
Agenda
• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Current Status
and Next Steps• Questions
VIVO: An Open Source Tool for Describing and Linking Researchers
and Research
DuraSpace Sponsors WebinarJuly 23, 2012
Dean B. Krafft (presenter) andJon Corson-Rikert
Cornell University Library
VIVO CollaborationCornell UniversityDean Krafft (Cornell PI)
Manolo BeviaJim Blake
Nick CappadonaBrian Caruso
Jon Corson-RikertElly Cramer
Medha DevareElizabeth Hines
Huda KhanDepak Konidena
Brian LoweJoseph McEnerneyHolly Mistlebauer
Stella MitchellAnup Sawant
Christopher WestlingTim Worrall
Rebecca Younes
University of FloridaMike Conlon (VIVO and UF PI)
Beth AutenMichael Barbieri
Chris BarnesKaitlin Blackburn
Cecilia BoteroKerry Britt
Erin BrooksAmy Buhler
Ellie BushhousenLinda Butson
Chris CaseChristine Cogar
Valrie DavisMary Edwards
Nita FerreeRolando Garcia-Milan
George HackChris HainesSara HenningRae Jesano
Margeaux JohnsonMeghan Latorre
Yang LiJennifer LyonPaula Markes
Hannah NortonJames Pence
Narayan RaumNicholas Rejack
Alexander RockwellSara Russell Gonzalez
Nancy SchaeferDale SchepplerNicholas SkaggsMatthew Tedder
Michele R. TennantAlicia Turner
Stephen Williams
Indiana UniversityKaty Borner (IU PI)
Kavitha ChandrasekarBin Chen
Shanshan ChenRyan CobineJeni Coffey
Suresh DeivasigamaniYing Ding
Russell DuhonJon Dunn
Poornima GopinathJulie Hardesty
Brian KeeseNamrata Lele
Micah LinnemeierNianli Ma
Robert H. McDonaldAsik Pradhan Gongaju
Mark PriceMichael Stamper
Yuyin SunChintan TankAlan Walsh
Brian WheelerFeng Wu
Angela Zoss
Ponce School of MedicineRichard J. Noel, Jr. (Ponce PI)
Ricardo Espada ColonDamaris Torres Cruz
Michael Vega Negrón
This project is funded by the National Institutes of Health, U24 RR029822"VIVO: Enabling National Networking of Scientists”
The Scripps Research Institute
Gerald Joyce (Scripps PI)Catherine Dunn
Sam KatkovBrant KelleyPaula King
Angela MurrellBarbara NobleCary Thomas
Michaeleen Trimarchi
Washington University School of Medicine in St. Louis
Rakesh Nagarajan (WUSTL PI)Kristi L. HolmesCaerie HouchinsGeorge JosephSunita B. Koul
Leslie D. McIntosh
Weill Cornell Medical CollegeCurtis Cole (Weill PI)
Paul AlbertVictor Brodsky
Mark BronnimannAdam Cheriff
Oscar CruzDan Dickinson
Richard HuChris Huang
Itay KlazKenneth Lee
Peter MicheliniGrace Migliorisi
John RuffingJason Specland
Tru TranVinay Varughese
Virgil Wong
What is VIVO?
• A semantic-web-based researcher and research discovery tool– People plus much more
• Institution-wide, publicly-visible information– For external as well as internal audiences
• An open, shared platform for connecting scholars, communities, and campuses using Linked Open Data
What does VIVO do?• Integrates multiple sources of data– Systems of record– Faculty activity reporting– External sources (e.g., Scopus, PubMed, NIH
RePORTER)• Provides a review and editing interface– Single sign-on for self-editing or by proxy
• Provides integrated, filterable feeds to other websites
A brief VIVO history2003-2005 First development for the life sciences
at Cornell, as a relational database2006-2008 Expansion to all disciplines at Cornell,
and conversion to Semantic Web2009-2012 National Institutes of Health-sponsored
VIVO: Enabling the National Networking of Scientists project transforms VIVO to a multi-institutional open source platform
2012+ Transitioning VIVO to DuraSpace for open community development
What does VIVO model?• People, but also organizations, grants, programs,
projects, publications, events, facilities, and research resources
• Relationships among the above– Meaningful connections among people and activities– Bidirectional– Context and navigation from one point of interest to
another• Links to URIs outside VIVO– Concepts– People, places, organizations, events
Typical data sources• HR and/or directory – people, appointments• Research administration – grants & contracts• Registrar – courses• Graduate-level programs/advising affiliations• Faculty reporting system(s) and/or self or proxy
editing – narratives, service, research areas, awards• Events calendar• News releases and/or press clippings• PubMed, Scopus, repositories – publications
What is a Semantic Web application?• Provides data readable by machines, not just text for
humans• Provides self-describing data via shared ontologies– Defined types– Defined relationships
• Provides search & query augmented by relationships• Does simple reasoning to categorize and find
associations– Teaching faculty = any faculty member teaching a course– All researchers involved with any gene associated with
breast cancer (through research project, publication, etc.)
The VIVO ontology
• Defines types– Having individual instances (e.g., persons)
• Defines relationships– Expressed as statements about individuals
• Statements (“triples”) can describe or connect– Data property statements describe individuals– Object property statements connect individuals
Andrew McDonald
author of
has author
research area
research area for
academic staff in
academic staff
Susan Riha
Mining the record: Historical evidence for…
author ofhas author
teaches research area for
research area
headed byNYS WRI
Earth and Atmospheric Sciences
crop management
CSS 4830
Cornell’s supercomputers crunch weather data to help farmers manage chemicals
head offaculty appointment in
faculty members
taught by
featured in
features person
Example relationships for a researcher
What is Linked Open Data (LOD)?• Data– Structured information, not just documents with text– A common, simple format
• Open– Available, visible, mine-able– Anyone can post, consume, and reuse
• Linked– Directly by reference– Indirectly through common references and inference
Linked data indexing for search
Ponce VIVOPonce VIVO
WashU VIVOWashU VIVO
IU VIVO
IU VIVO
CornellIthaca VIVO
CornellIthaca VIVO
Weill
Cornell VIVO
Weill
Cornell VIVO
eagle-IResearchresources
eagle-IResearchresources Harvard
ProfilesRDF
HarvardProfiles
RDF
OtherVIVOsOtherVIVOs
DigitalVitaRDF
DigitalVitaRDF
IowaLokiRDF
IowaLokiRDF
Linked Open DataLinked Open Data
vivosearch
.org
UF VIVO
UF VIVO
Scripps VIVOScripps VIVO
Solrsearchindex
Solrsearchindex
Alter-nateSolr
index
Alter-nateSolr
index
Value for institutions• Common data substrate– Public, granular and direct– Discovery via external and internal search engines– Available for reuse at many levels
• Distributed curation– E.g., affiliations beyond what HR system tracks– Data coordination across functional silos– Feeding changes back to systems of record– Direct linking across campuses
• Data that is visible gets fixed
Enter once, use many times• Provides normalized public data to a range of
campus applications– Parameterized, filtered queries
• By affiliation, org or person id, research area, geography
– Search results
• Easily consumable data– XML, HTML, JSON– Import module for Drupal– Widgets (Duke, UCSF)
Partnerships – research resources
• CTSAconnect– OHSU, Harvard, Cornell, Florida, Buffalo & Stony
Brook– eagle-i sister NIH project – Harvard, OHSU, 7 others
• Facilities, services, techniques, protocols, skills, and research outputs beyond publications– Extended ways to represent expertise– Improve attribution for data and other contributions
to science
Partnerships – ORCID
• ORCID – Open Researcher and Contributor ID– Create an identifier for all authors. Attribute
works to authors through ORCID identifier• ORCID and VIVO– VIVO provides assertion of ORCIDs for people
through institutional identity management. ORCID is an attribute in a VIVO profile.
– Anticipating batch submission of basic researcher registrations by universities
http://orcid.org
Partnerships – research data
• VIVO/ANDS consortium in Australia– Link research data with researcher profiles and
publications– Harvest to national registry
• DataStaR local data registry tool– Add-on to VIVO– Complement to other library data-related services– Institute for Museum and Library Services (IMLS)
grant
Partnerships – information standards
• euroCRIS and the CERIF research data standard– Linked Open Data task force slated to map VIVO
and CERIF ontologies during 2012• CASRAI (http://casrai.org)
– University, corporate, and government partnership– Building a common data dictionary for research
information– Coordinating with euroCRIS
Partnerships - APA
• The American Psychological Association is using VIVO as one component of a trusted identity framework for online submission and review of scholarly publications
• VIVO’s integration with institutional identity systems through the InCommon Federation provides a platform for representing and exchanging information about authors and their works
• http://vivo.apa.org, http://publishtrust.org
Partnerships - Sakai
• Cambridge University is testing VIVO as a profiling component of the new Sakai Open Academic Environment product
• VIVO’s design permits integration into the Sakai interface without major modification to either platform
• NYU and other sites may expand the testing pending grant approval
Institutions Adopting VIVO• American Psychological Assn.• Brown University• Univ. of Cambridge• Univ. of Colorado, Boulder• Cornell University• Duke University• Eindhoven Univ. of Technology• Univ. of Florida• Griffiths University• Indiana University• Johns Hopkins University• MIT
• Univ. of Melbourne• New York University• Northeastern University• Notre Dame University• Univ. of Pennsylvania• Penn State• Scripps Research Institute• SUNY, Stony Brook• U.S. Dept. of Agriculture• Univ. of Virginia• Univ. of Washington• Washington Univ., St Louis• Weill Cornell Medical College
VIVO Conference Corporate SponsorsSponsor 2010 2011 2012
AAAS (Science) X
CrossRef Silver
Elsevier X Silver Gold
IMO Interface Terminology X Gold
Microsoft Research Silver
Nature Jobs Silver
NETE Silver
PLOS X
ProQuest Gold
Springer Gold
Symplectic Gold Platinum
Refworks | COS X Gold
Thomson Reuters X Gold Silver
Wellspring Worldwide Silver
VIVO Service Providers• Symplectic Elements – research management tools
supporting repositories and VIVO– Preliminary data reviews from the Duke VIVO team are positive
• Wellspring Worldwide’s Flintbox.com– Integrating faculty expertise and licensable technologies– Membership free to universities and research institutes– Announced plans to create VIVO-based researcher profiles for
all member organizations• JK Software in Australia
– Custom forms and other interface work• American Psychological Assocation
– Presenting at VIVO 2012 on licensing the APA research thesaurus as a service integrated with VIVO
Development beyond the VIVO project
• Duke – VIVO widgets• Indiana – Query builder interface for HUBzero• Pittsburgh – Digital Vita Documents• Stony Brook – UMLS terminology web service• Weill Cornell – Google Refine integration• Nebraska – BEPress publication importer• UCSF – Open Social container and RDF gadgets• USC – Karma Information Integration Tool
Agenda
• The DuraSpace Incubator• The VIVO Project• DuraSpace and VIVO – Planning and
Next Steps• Questions
VIVO Planning
• Roles and Responsibilities• Governance• Working groups• Advisory bodies• Work objectives and plan• Costs• Revenue streams
Roles and Responsibilities
• Five part-time VIVO staff positions identified for operational roles
• DuraSpace effort– Strategic planning– Business development– Marketing and communications– Community development– Technology best practices– Admin support
Costs
• Existing DuraSpace staff effort (0.85 FTE aggregated) – funded by VIVO
• Dedicated VIVO staff (0.95 FTE aggregated) - Administered by DuraSpace, sourced and funded by VIVO
• VIVO “in-kind” staff (0.75 FTE) hosted at contributing institutions
• Other costs (travel, marketing, event coordination, sponsorship campaign)
Revenues
• Grants and contracts• Hosted services• Corporate sponsorships• Registered service providers• Conference/event sponsorships• Community sponsorship• Founding contributors
Next Steps
• Joint announcement on intent• VIVO raises funds to support its startup
period• Initial fund-raising success will trigger
the start of incubation for VIVO.