data management planning at the dcc
DESCRIPTION
Presentation given at the Statsbiblioteket, Aarhus, Denmark, 31 October 2012TRANSCRIPT
STATSBIBLIOTEKET, AARHUS31 October 2012
Data management planning at the DCC
Martin DonnellyDigital Curation CentreUniversity of Edinburgh
- Digital Curation Centre, est. 2004- Three partners: Edinburgh, Glasgow and Bath- Primary funder is JISC
Helping to build capacity, capability and skills in data management and curation across the UK’s higher education research community
- DCC Phase 3 Business Plan
www.dcc.ac.uk
Running order
1. Policies, Principles, Expectations2. The DCC and DMP3. DMP Online
1. Policies, Principles, Expectations
• Public good• Preservation• Discovery• Confidentiality• First use• Recognition• Public funding
7 principles agreed by all of the UK research councils in May 2011
http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
UK research funder expectations
• timely release of data– once patents are filed, or on (acceptance for) publication
• open data sharing – minimal or no restrictions– deposit in data centres, structured databases, data enclave
• preservation of data – most funders expect 5-10 years (or more)
• submission of data management and sharing plans…
What is a DMP? (1)
UK research funders typically ask for:
• A short statement/plan to be submitted alongside grant applications (NERC ask for two versions: one at application stage, another when project is underway)
• An outline of what you will create/collect, methods, standards, data management and long-term plans
• How and why – justify your decisions and any limitations
- Just the principal investigator?- What about the research assistants?- And the partners based in other institutions?- And commercial partners?- And the institution’s funding office?- And the Library/IT?
Who’s involved in this process?
Researcher
Research Support Office Data Library / Repository
Computing Support
Faculty Ethics Committee Etc...
DATAMANAGEMENT…PLAN?
UNRULYDATA
Key things to remember
All research projects are different, so there’s no one-size-fits-all DMP approach
The DMP will depend upon the nature of the research AND its context (funder, domain, institution(s) etc)
DMPs are useful communication tools between multiple stakeholders
2. The DCC and DMP
Links to all DMP resources via http://www.dcc.ac.uk/resources/data-management-plans
We’ve responded to requirements by offering support…
Analysed requirements
Developed a Checklist
Provided tools & guidance
Policy analysis
http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
What is a DMP? (2)
In general, funders tend to ask:
- What kinds of data will be created and how?- How will the data be documented and described?- Are there ethical and Intellectual Property issues?- What are the plans for data sharing and third-
party access? - What is the strategy for longer-term preservation?
However, different funders ask these questions in different ways…
§1: Introduction and Context§2: Data Types, Formats, Standards and Capture
Methods§3: Ethics and Intellectual Property§4: Access, Data Sharing and Re-use§5: Short-Term Storage and Data Management§6: Deposit and Long-Term Preservation§7: Resourcing§8: Adherence and Review§9: Agreement/Ratification by Stakeholders§10: Annexes
A Generic and Comprehensive Checklist
Checklist for a Data Management Plan v3.0 (Donnelly and Jones,
March 2011)
http://www.dcc.ac.uk/resources/data-management-plans
Printed DMP resources– “Dealing with Data” (Lyon, 2008)– Analysis of Funder Policies (Jones, 2009)– Checklist for a Data Management Plan
(Donnelly and Jones, 2009)– “How to Develop a Data Management and
Sharing Plan” (Jones, 2011)– “Data Management Plans and Planning”
(Donnelly, 2012) in Pryor (ed.) Managing Research Data, London: Facet
– DMP Online briefing paper (Donnelly and Richardson, forthcoming 2012)
Links to all DCC resources via http://www.dcc.ac.uk/resources/data-management-plans
3. DMP Online
What does do?
A free and Open web-based tool enabling users to...
i. Create, store and update multiple versions of Data Management Plans across the research lifecycle
ii. Meet a variety of specific data-related requirements (from funders, institutions, publishers, etc.) in a single place
iii. Get tailored guidance on best practice and helpful contacts, at the point of need
iv. Customise, export and share DMPs in a variety of formats in order to facilitate communication within and beyond research projects
New features in v3.0 (May 2012)- Improved user interface, inc. customisable
institutional versions- New features
- Overlaying multiple templates for ‘hybrid’ DMPs- Multiple template phases (e.g. pre- / during / post-
project)- Granular read / write / share permissions- Multilingual support / boilerplate text- API for systems interoperability
- Endorsement from funders
Technologies involved (v3.0)
– Ruby on Rails (v3.1.3)– JavaScript (jQuery v1.7.1)– MySQL database (v5+)– Hosting: University of Edinburgh Information Services
Virtual Hosting (13 managed servers across 2 sites)– Authentication: registered users with passwords encrypted
in DB (we have also used Shibboleth for integration with UK Access Management Federation for Education and Research)
– Various export formats (PDF, DOCX, XLSX, CSV, XML etc)
http://dmponline.dcc.ac.uk
HEFCE Institutional Engagements: from planning to practice
- We are currently working with c. 20 institutions over an 18 month period to improve their data management capabilities
- Broad variety of institutional types and sizes, from research intensive ancient universities, to new universities and specialist institutions (e.g. art schools) from all parts of the UK
- Institutions select from a ‘menu’ of tools and services, e.g. (next slide)
Components of a Data Management Strategy (Research and Admin)
DCC Tools DCC Services
Policy Data Asset Framework (DAF)
Policy development
Planning DMP Online Strategy development
Advocacy CARDIO Training
Tools DRAMBORA Workflow assessment
Training Costing
Institutional data catalogues (discovery)
The Menu
Institutional workflowsDMP Online can also be used in conjunction with other tools that support the data management/curation lifecycle, e.g.…
- DAF (Data Asset Framework)- DRAMBORA (Digital Repository Audit Method
Based On Risk Assessment)- CARDIO (Collaborative Assessment of
Research Data Infrastructure and Objectives)
Also non-DCC tools:
- LIFE- Planets tools- CRIS systems- and more
Systems– CRIS / admin systems– RCUK Je-S system– Institutional Repositories– DDI repository– DMP Tool (US) (TBC)– Other instances of DMP
Online via federated model (? -TBC)
– Metadata catalogues (?)
External connectionsStandards / protocols– CERIF*
– SWORD2– DDI* – RDF (DMP-Oxford)
* via the RESTful API
For machine readership…
- Facilitates quick public sharing
- Compatible with API for linking with other systems
- Minimal formatting
For human readership…
- Pleasant formatting
- Editable. Can be used in conjunction with (e.g.) MS Sharepoint
- Removes all formatting
How to connect: six export formats
- Guidance- Generic data management guidance (in conjunction with
UK Data Archive)- Tailored guidance developed in collaboration with funders
themselves (ESRC, MRC, Wellcome Trust)- Institution-specific guidance developed with key contacts in
universities- Disciplinary guidance developed and deployed through JISC
MRD projects (e.g. DMT Psych at York, DATUM for Health at Northumbria)
- Templates developed with funders and institutions- Joint training events organised and delivered by DCC
and
Collaborations
- DCC is a founder member of the US DMPTool consortium, and we continue to work together. Joint workshops at IDCC in Amsterdam (Jan 2013) and iSchools Conference in Texas (Feb 2013)
- We’re working with ANDS in Australia to deploy DMP Online on the NECTAR academic cloud
- European Commission has encouraged DCC to propose a pilot DMP tool for Horizon 2020. Expecting a DMP requirement in the next funding programme
DMP International
Mange Tak!
Image credits: Slide 1 - http://upload.wikimedia.org/wikipedia/commons/8/88/LernaeanHydraRephael.jpgSlide 4 - http://www.flickr.com/photos/axis/ Slide 9 - http://en.wikipedia.org/wiki/File:Hercules_slaying_the_Hydra.jpg
This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License.
Martin DonnellyDigital Curation CentreUniversity of Edinburgh
[email protected]: @mkdDCC
www.dcc.ac.uk/resources/data-management-plans
For other DCC services see www.dcc.ac.uk or follow us on twitter @digitalcuration and #ukdcc