ischools workshop - 4 - data discovery
TRANSCRIPT
![Page 1: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/1.jpg)
Natasha Simons
Managing Research Data WorkshopData discovery and metadata
iSchools Data Science Winter InstituteHong Kong7 December 2017
![Page 2: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/2.jpg)
Why do people search for data?
![Page 3: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/3.jpg)
Why do people search for data*?•Exploratory/Scoping
•Reuse/Secondary data analysis
•Can be starting point or ad hoc
•Peer review
•Reproduce/extend results
•Repurpose (e.g. for mashups, visualisations, simulations)
•Verify claims (e.g. report findings)
*Not in any order; not exhaustive!
![Page 4: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/4.jpg)
How do people find data?
![Page 5: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/5.jpg)
How do people find data*?•Google
•Ask a colleague
•Find link to data in a journal article
•Data journals
•Data registries e.g. re3data
•Open data portals e.g. data.gov
•Institutional repositories
•Data / Discipline repositories e.g. Dryad
•Project website
•Data discovery aggregators like Research Data Australia
•Library catalogues, databases
*Not in any order; not exhaustive!
![Page 6: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/6.jpg)
Characteristics of finding data
When creating metadata records, keep in mind that finding data is:
● Movable feast / changing beast
● No standard practice, universal standard or vocab
● Databases are non-exhaustive
● Methods for searching and terms driven by why people are
looking and how the data is stored
![Page 7: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/7.jpg)
FAIR DataTo aid discovery and reuse, data needs to be:
● Findable
● Accessible
● Interoperable
● Reusable
More on FAIR Data:● FAIR Data Principles (FORCE11): https://www.force11.org/group/fairgroup/fairprinciples
● ANDS and FAIR Data: https://www.ands.org.au/working-with-data/fairdata
● FAIR Data ANDS Webinar series: https://www.youtube.com/user/andsdata (FAIR Data Playlist)
ANDS/Nectar/RDS
“FAIRground” booth
at eResearch
Australasia 2017
![Page 8: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/8.jpg)
Hands-on exercise: data descriptionYour task:
1. Divide into pairs
2. Each pair take one of the CSV data files
3. Describe the data by creating a metadata record. Think about:
title, creators, date, short description and so on.
You have 15 minutes - go!!
If you are unfamiliar with metadata, take few minutes
to view the introductory video at:
https://www.youtube.com/watch?v=ABF2FvSPVYE
![Page 9: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/9.jpg)
Class discussionHow did you go?
What did you learn?
Here are the original metadata descriptions:
CSV dataset #1 - https://data.qld.gov.au/dataset/marine-oil-spills-
data
CSV dataset #2 –
https://data.qld.gov.au/dataset/koala-hospital-data
![Page 10: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/10.jpg)
Australian data discovery portals
![Page 11: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/11.jpg)
Open data case studyUniversity of Tasmania - IMAS Marine Data
https://www.youtube.com/watch?v=_Bs56PnYK9g
More Open Data project stories: https://www.youtube.com/user/andsdata
(Open Data Playlist)
![Page 13: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/13.jpg)
TERN - Terrestrial/ecology data
http://portal.tern.org.au/#/00629597
![Page 17: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/17.jpg)
re3data includes Aus data repositories
![Page 18: Ischools workshop - 4 - data discovery](https://reader033.vdocument.in/reader033/viewer/2022051710/5a64b0667f8b9ac21c8b45d3/html5/thumbnails/18.jpg)
With the exception of third party images or where otherwise indicated, this work is licensed under the Creative
Commons 4.0 International Attribution Licence.
ANDS, Nectar and RDS are supported by the Australian Government through the National Collaborative Research
Infrastructure Strategy Program (NCRIS).
[email protected]@n_simonsorcid.org/0000-0003-0635-1998
Natasha Simons