research data management services at uwa (november 2015)
TRANSCRIPT
Research Data Management Services Katina Toufexis Research Data CoordinatoreResearch Support UnitUniversity [email protected] 5319
Research Data Management
1.Manage your data
2.Store your data
3.Share your data
Supporting Data @ UWA
www.library.uwa.edu.au/research/services
What are we covering today?
• What is Research Data?
• Why should you care about Research Data Management?
• What services does the Library offer?
• Data Management Planning help
• Storage
• UWA’s IRDS
• Pawsey Supercomputing (Luke Edwards)
• Sharing your data
What is Research Data?
Journal article
Data?What data?
Book
http://cdn.list25.com/wp-content/uploads/2013/03/Slide173.jpg
What do we mean by research data?
What is Research Data?
It is not possible to apply a uniform definition of research data across all disciplines. Research data may be numerical, textual, audio-visual, digital or physical, depending on the discipline and the nature of the research.
What do we mean by digital research data?
What is Digital Research Data?
Observational
Experimental
Simulation
Derived or Compiled
Reference or Canonical
Images: http://www.designweek.co.uk/news/kram/weisshaar-visualises-big-data/3037975.article; http://www.auntminnie.com/user/images/content_images/sup_mol/2010_07_30_15_55_33_600_PET_CT_MRI_02_450.jpg; http://2.bp.blogspot.com/-k35MuQYvTZA/TbTov-2kdRI/AAAAAAAAADE/oBF-FZPM6Wo/s400/multivariate.jpg; http://labs.openviewpartners.com/files/2012/05/graph_with_stacks_of_coins-e1349903025956.jpg; http://www.dpgp.org/syntenic_assembly/images/CG15386.png; http://www.matrixscience.com/images/dbmanager_2.gif
Humanities Research Data?
What is Research Data?
Artist’s statement Exhibition catalogues Critical reviews Visual diaries Journals Drawings Photographs Manuscripts Musical annotations 3D models Audio-visual recordings of a creative
work
http://ands.org.au/presentations/index.html
The Data Deluge
“Data is more like soup – it’s messy and you don’t
know what’s in it.” – Liz Lyon (UK DCC)
“I worry there won't be enough people around to do
the analysis.”–Chris Ponting (University of Oxford UK, Computational biologist)
“A single DNA sequencer can now generate in a day what it took 10 years to collect for the Human Genome Project. Computers are central to archiving and
analysing this information, but their processing power isn’t increasing fast
enough, and their costs are decreasing too slowly, to keep up with the deluge.”
- Elizabeth Pennisi (Science Author)
When should I think about Research Data?Research Lifecycle – incorporating research data management
Benefits of Research Data Management
• Meets Compliance
• Promotes Efficiency
• Ensures Security
• Allows Access
• Improves Quality
\Why should YOU care about research data?
Research Data Management 101
Benefits of Research Data Management
Research Data Management 101
Security– Risk. Where is your data?
– Safeguards against data loss.
– Ensures confidentiality and ethical compliance.
– Guarantees legal compliance to intellectual property rights such as copyright.
Why protect your data?
Research Data Management 101
Benefits of Research Data Management
Allows for data replication or reproducibility.Increases the accuracy or reliability of the data.Improves research data integrity.
Improves research process.Encourages systematic documentation and descriptions of the research data.Provides guidelines and procedures ensuring consistency.
Allows data validation and verification.Enables collaborative research opportunities.Prevents duplication.Allows data sharing and future reuse.Increases researcher citations.
Research Data Management 101
Benefits of Research Data Management
OECD cares about research data
Access to research data increases the returns from public investment in this area; reinforces open scientific inquiry; encourages diversity of studies and opinion; promotes new areas of work and enables the exploration of topics not envisioned by the initial investigators.
(OECD principles and guidelines for access to research data from public funding 2007)
http://www.oecd.org/sti/sci-tech/oecdprinciplesandguidelinesforaccesstoresearchdatafrompublicfunding.htm
Governments care about research data
The Australian Government now declares that, in order to promote greater participation in Australia’s democracy, it is committed to open government based on a culture of engagement, built on better access to and use of government held information, and sustained by the innovative use of technology.
Declaration of Open Government
Governments care about research data
Governments care about research data
“Open access to information held by the State Government will help stimulate the development of new, innovative applications by all sorts of private individuals, companies and non-government organisations”
– Campbell Newman 9 Oct 2012 http://statements.qld.gov.au/Statement/2012/10/9/queensland-governments-open-data-revolution-begins
Governments care about research data
“This declaration commits the government to proactively release data. Government data will be open by default; proactively released by government in accordance with international best practice.”
– HonJay Weatherill MP, Sept 2013 http://dpc.sa.gov.au/sites/default/files/pubimages/documents/Signed-Declaration-of-Open-Data.pdf
FUNDERS care about research data
Australian Code for the Responsible Conduct of Research (NHMRC, ARC and Universities Australia 2007)Section 2: Management of Research Data & Primary Materials
“The responsible conduct of research includes the proper management and retention of the research data. …The central aim is that sufficient materials and data are retained to justify the outcomes of the research and to defend them if they are challenged. The potential value of the material for further research should also be considered, particularly where the research would be difficult or impossible to repeat.”
https://www.nhmrc.gov.au/guidelines/publications/r39
INSTITUTIONS care about research data
UWA Code of Conduct for the Responsible Practice of Research
http://www.research.uwa.edu.au/staff/research-policy/guidelines
Section 2 refers to the management of research data and primary materials and states:
2.1 Data (including electronic data) must be recorded in a durable and appropriately referenced form.
2.2 Data must be held for sufficient time to allow access and reference. Recommended a minimum 5 years from date publication, but up to 15 years for specific types (eg clinical studies)
2.3 Wherever possible, original data must be retained in the school or research centre in which it was generated... In all cases, prior to the publication of research findings a Location of Data Form must be completed.
These guidelines should be seen as a framework for sound research practice and for the protection of individual research workers, including both staff and postgraduate research students, from possible misunderstandings.
ARC Discovery Grant requirements, February 2014:
MANAGEMENT OF DATA
Outline plans for the management of data produced as a result of the proposed research, including but not limited to storage, access and re-use arrangements.
FUNDERS care about research data
NHMRC Statement on Data Sharing, July 2015:
Data Sharing
NHMRC encourages data sharing and providing access to data and other research outputs (metadata, analysis code, study protocols, study materials and other collected data) arising from NHMRC supported research.
FUNDERS care about research data
• NHMRC encourages researchers to disseminate and share their research data through publicly accessible databases or repositories
• Limitations? • ethics (particularly consent), legal, IP• data format and standards, variable ontologies used to describe data
FUNDERS care about research data
https://www.nhmrc.gov.au/grants-funding/policy/nhmrc-statement-data-sharing
The Public Library of Science PLOS Data Sharing policy:
PLOS journals require authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception.
From March 2014, you will need to include a “data availability statement” when you submit your article, explaining where and how your data can be accessed.
PUBLISHERS care about research data
http://www.plosone.org/static/policies#sharing
Nature
“An inherent principle of publication is that others should be able to replicate and build upon the authors' published claims. Therefore, a condition of publication in a Nature journal is that authors are required to make materials, data and associated protocols promptly available to readers without undue qualifications”.
PUBLISHERS care about research data
http://www.nature.com/authors/policies/availability.html
BMJIn Jan 2013 The BMJ was one of the first medical journals to require sharing of individual patient data for trials of drugs or devices.
Our initial data sharing policy focused on trials of drug and devices because many high profile, serious allegations of selective or non-reporting of trial results related to such products.
From 1 July 2015 The BMJ will extend its requirements for data sharing to apply to all submitted clinical trials, not just those that test drugs or devices.
PUBLISHERS care about research data
http://www.bmj.com/content/350/bmj.h2373
British Journal of Educational Technology
“BJET strongly encourages authors of articles
describing empirical research to make their data available to others, for example through an institutional or other repository. If this is not possible then it will not necessarily preclude publication in the Journal, but you should note that it is one of the criteria by which your work will be assessed.”.
PUBLISHERS care about research data
Nature Journals: Scientific Data JournalDataset-ONLY journals
PUBLISHERS care about research data
http://www.nature.com/sdata/
What about the Humanities??
PUBLISHERS care about research data
Archives of Scientific Psychology
DATA REUSE and Open Data Citation advantages
The citation benefit intensified over time... ...with publications from 2004 and 2005 cited 30 per cent more often
if their data was freely available. Every 100 papers with open data prompted 150 "data reuse papers"
within five years Original authors tended to use their data for only two years, but
others re-used it for up to six years.
Piwowar HA, Vision TJ. (2013) Data reuse and the open data citation advantage. PeerJ 1:e175 http://dx.doi.org/10.7717/peerj.175
Supporting Data @ UWA
www.library.uwa.edu.au/research/services
Creating a Research Data Management Plan
Creating a Research Data Management Plan
The Data Management Plan is:
• an online form
• uses the Qualtrics survey platform
• Asks relevant questions pertaining to your research data
• Sends you an electronic plan directly to your email after completion
Research Data Management Toolkit
http://www.library.uwa.edu.au/research/research-data-management-toolkit
IRDS – Data Storage at UWA
https://www.youtube.com/watch?v=-DSl1pCMkDc
IRDS
IRDS – New developments
“Any Stores with less than 50% usage for longer than 3 months will be automatically reduced in size.”
Access to External Collaborators
IRDS – New developments
Pheme authentication provided to external collaborators via HR Form:
http://www.hr.uwa.edu.au/__data/assets/pdf_file/0006/2172606/Commencement_of_Non-university_Staff.pdf
• All data and associated information created in the course of conducting research is subject to the UWA Policy on Records Management.
• All research records must be retained in accordance with the WA University Sector Disposal Authority.
• Retention periods range from a minimum of 7 years to permanent depending on the significance and type of research.
• Information can be kept longer if required but must not be destroyed before the required minimum retention period.
Information Governance Services
Retention and Disposal of Research Data
How long must research data be kept?
IRDS vs Dropbox
• UWA’s Information Governance Services (IGS) provide guidance on using public cloud storage.
– http://www.igs.uwa.edu.au/policies/guides/auth/cloud-storage
IRDS vs Dropbox
• University Policy on: Institutional Data Centre
• University Policy on: Records Management
• Computer and Software Use Regulations
• University Policy on: Records Management
• UWA Code of Conduct for the Responsible Practice of Research
• UWA Recordkeeping Plan
• Western Australian University Sector Disposal Authority
• Australian Code for the Responsible Conduct of Research
• University’s Policy on Privacy of Electronic Material
AARNet+ CloudStor
Supporting Data @ UWA
www.library.uwa.edu.au/research/services
Research Data Online (RDO)
Research Data Online (RDO)
• Publicly accessible• Discoverable• Harvested into RDA• DOIs OR Locally
generated handles
Research Data Online (RDO)
Research Data Online (RDO)
RDO (UWA) RDA (ANDS)
What about in Australia?
What about in Australia?
What about in Australia?
DISCOVERABLE
RDO vs IRDS ??
Institutional Research Data Store (IRDS)
Research Data Online (RDO)
Hooper, CM; Castleden IR; Aryamanesh N; Jacoby RP; Millar AH, (2015): The compendium of crop Proteins with Annotated Locations (cropPAL) version 1. The University of Western Australia. http://dx.doi.org/10.4225/23/556e4e260e093
Research Data Online (RDO)
Research Data Online (RDO)
PublicationsData
Data connections can be counted
Images: http://www.bigdatarevolutionbook.com/ , http://auditfutures.org/publications/
Research Data Online (RDO)
PublicationsData
Data connections can be counted
Images: http://www.bigdatarevolutionbook.com/ , http://auditfutures.org/publications/
Research Data Online (RDO)
Senior Librarians will help you establish your ORCID account
Toolkit RDO Submitter’s Quick Start Guide
Research Data Online (RDO)
http://www.ersa.edu.au/sites/default/files/research-data-sharing-lr.jpg
AUSGOAL Licences
Licensing your data
http://www.ausgoal.gov.au/creative-commons-v4.0
Licence Chooser Tool
Licensing your data
http://www.ausgoal.gov.au/licence-chooser
Currently Offline
Licence Chooser Tool
http://creativecommons.org/choose/
Contacts
Questions?
Katina ToufexisResearch Data [email protected] 5319
http://ands.org.au/presentations/index.html
Senior Librarian for your Facultyhttp://www.library.uwa.edu.au/contact/faculty-liaison-coordinators
Data Archiving and Networked Services (DANS)
Sharing data: good for science, good for you
DANS promotes sustained access to digital research data. For this, DANS encourages scientific researchers to archive and reuse data in a sustained form, for instance via the Australian National Data Service (ANDS)