building soware to support the research community’s data...

1
Building so+ware to support the research community’s data cura7on and preserva7on needs G. Sayeed Choudhury, Hanh Vu, Elliot Metsger, Aaron Birkland dataconservancy.org The Data Conservancy (DC) was launched through a grant from the Na:onal Science Founda:on’s DataNet program, which built upon prior experience with managing data from the Sloan Digital Sky Survey. The grant provided the DC team an opportunity to broaden its data infrastructure development and gain beEer understanding of the challenges in collec:ng, preserving and cura:ng different types of research data. Since the DataNet funding, the Data Conservancy has redesigned and refactored its core infrastructure to leverage exis:ng soGware and technologies and to build deeper connec:ons with both research and technology communi:es. Most notably, we have embraced approach of data representa:on by the Linked Data PlaKorm (LDP) by building our data archive with the Fedora 4 repository plaKorm and leading the development of the RMap Services with funding from the Sloan Founda:on. July 2016 Packaging Specifica:on Packaging tools Package Ingest Service Fedora data archive Fedora API-X framework RMap Services Current DC Components Packaging Specifica7on Based on popular BagIt specifica:on Domain model agnos:c Adds seman:c informa:on about content May be used with any RDF-based domain model Packaging Tool A JavaFX point-and-click interface Produces DC-specifica:on compliant packages Supports mul:ple domain models Allows seman:c enrichment Package Ingest Service Deposits package content into an archive Exposes content as linked data Fedora 4 is the current reference implementa:on of a DC archive Extends core func:onali:es of a Fedora 4 repository Facilitate: Mapping between domain specific data models and Fedora data model Support for commonly used web- service standards Domain specific federated discovery and access Support advance data cura:on capabili:es Fedora API-X Protocol for Linked Data representa:ons Developed through the RMap project Captures rela:onships between publica:on and underlying data Distributed Scholarly Compound Object (DiSCO) protocol for resource aggrega:on OAI-ORE based REST APIs are available RMap Services

Upload: others

Post on 22-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Building soware to support the research community’s data ...dataconservancy.wpshared.library.jhu.edu/wp... · Building soware to support the research community’s data curaon and

[Insert poster presentation date here. Click to type.]

Buildingso+waretosupporttheresearchcommunity’sdatacura7onandpreserva7onneeds

G. Sayeed Choudhury, Hanh Vu, Elliot Metsger, Aaron Birkland!

dataconservancy.org

TheDataConservancy(DC)waslaunchedthroughagrantfromtheNa:onalScienceFounda:on’sDataNetprogram,whichbuiltuponpriorexperiencewithmanagingdatafromtheSloanDigitalSkySurvey.ThegrantprovidedtheDCteamanopportunitytobroadenitsdatainfrastructuredevelopmentandgainbeEerunderstandingofthechallengesincollec:ng,preservingandcura:ngdifferenttypesofresearchdata.SincetheDataNetfunding,theDataConservancyhasredesignedandrefactoreditscoreinfrastructuretoleverageexis:ngsoGwareandtechnologiesandtobuilddeeperconnec:onswithbothresearchandtechnologycommuni:es.Mostnotably,wehaveembracedapproachofdatarepresenta:onbytheLinkedDataPlaKorm(LDP)bybuildingourdataarchivewiththeFedora4repositoryplaKormandleadingthedevelopmentoftheRMapServiceswithfundingfromtheSloanFounda:on.

July 2016

•  PackagingSpecifica:on•  Packagingtools•  PackageIngestService

•  Fedoradataarchive•  FedoraAPI-Xframework•  RMapServices

CurrentDCComponentsPackagingSpecifica7on

•  BasedonpopularBagItspecifica:on•  Domainmodelagnos:c•  Addsseman:cinforma:onabout

content•  MaybeusedwithanyRDF-based

domainmodel

PackagingTool•  AJavaFXpoint-and-clickinterface•  ProducesDC-specifica:oncompliant

packages•  Supportsmul:pledomainmodels•  Allowsseman:cenrichment

PackageIngestService•  Depositspackagecontentintoan

archive•  Exposescontentaslinkeddata•  Fedora4isthecurrentreference

implementa:onofaDCarchive

•  Extendscorefunc:onali:esofaFedora4repository

•  Facilitate:•  Mappingbetweendomainspecificdata

modelsandFedoradatamodel•  Supportforcommonlyusedweb-

servicestandards•  Domainspecificfederateddiscovery

andaccess•  Supportadvancedatacura:on

capabili:es

FedoraAPI-X

•  ProtocolforLinkedDatarepresenta:ons•  DevelopedthroughtheRMapproject•  Capturesrela:onshipsbetween

publica:onandunderlyingdata•  DistributedScholarlyCompoundObject

(DiSCO)protocolforresourceaggrega:on•  OAI-OREbased•  RESTAPIsareavailable

RMapServices