michener workshop montpellier

93
DataONE Data Life Cycle: Tools and Tips

Upload: alison-specht

Post on 22-Jan-2018

57 views

Category:

Science


1 download

TRANSCRIPT

Page 1: Michener workshop montpellier

DataONEData Life Cycle:

Tools and Tips

Page 2: Michener workshop montpellier

The DataONE Data Life Cycle

2

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 3: Michener workshop montpellier

Field Research

3

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 4: Michener workshop montpellier

Monitoring Project

4

Publish

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 5: Michener workshop montpellier

Synthesis Project

5

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Publish

Page 6: Michener workshop montpellier

Develop Solutions for Research

6

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 7: Michener workshop montpellier

The DataONE Data Life Cycle

7

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 8: Michener workshop montpellier

1. Plan:Create and Follow a Data Management Plan

8

Michener WK (2015) Ten Simple Rules

for Creating a Good Data Management Plan.

PLoS Comput Biol 11(10): e1004525.

doi:10.1371/journal.pcbi.1004525

Page 9: Michener workshop montpellier

9

Page 10: Michener workshop montpellier

10

Page 11: Michener workshop montpellier

11

Page 12: Michener workshop montpellier

12

Page 13: Michener workshop montpellier

13

Page 14: Michener workshop montpellier

14

Page 15: Michener workshop montpellier

15

Page 16: Michener workshop montpellier

16

Page 17: Michener workshop montpellier

17

Page 18: Michener workshop montpellier

18

Page 19: Michener workshop montpellier

19

Page 20: Michener workshop montpellier

20

Page 21: Michener workshop montpellier

21

Page 22: Michener workshop montpellier

22

Page 23: Michener workshop montpellier

23

Page 24: Michener workshop montpellier

24

Page 25: Michener workshop montpellier

25

Page 26: Michener workshop montpellier

The DataONE Data Life Cycle

26

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 27: Michener workshop montpellier

2. Collect and Organize:Logically Structure the Data to Support Use

27

CC

im

ag

e b

y J

ustin

Se

e o

n F

lickr

Jones et al. 2007

Page 28: Michener workshop montpellier

2. Collect and Organize

28

• Columns of data are consistent:

only numbers, dates, or text

• Consistent Names, Codes, Formats (date) used in each column

• Data are all in one table, which is much easier for a statistical program to work with than multiple small tables which each require human intervention

Page 29: Michener workshop montpellier

2. Collect and Organize

29

• Columns of data are consistent:

only numbers, dates, or text

• Consistent Names, Codes, Formats (date) used in each column

• Data are all in one table, which is much easier for a statistical program to work with than multiple small tables which each require human intervention

Page 30: Michener workshop montpellier

Googledocs Forms

Page 31: Michener workshop montpellier

Googledocs Forms

Page 32: Michener workshop montpellier

Data Entry Tools: Excel

Page 33: Michener workshop montpellier

Data Entry Tools: Excel

Page 34: Michener workshop montpellier

Excel: Data Validation

20

Page 35: Michener workshop montpellier

Excel: Data Validation

20

Page 36: Michener workshop montpellier

Excel: Data Validation

20

Page 37: Michener workshop montpellier

The DataONE Data Life Cycle

37

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 38: Michener workshop montpellier

3. Assure:Incorporate Quality Assurance & Quality

Control

38

0

10

20

30

40

50

60

0 10 20 30 40

Quality Engine

MetaDIG DIBBs

Page 39: Michener workshop montpellier

3. Assure

39

Page 40: Michener workshop montpellier

3. Assure

40

Page 41: Michener workshop montpellier

3. Assure

41

Page 42: Michener workshop montpellier

3. Assure

42

Page 43: Michener workshop montpellier

3. Assure

43

Page 44: Michener workshop montpellier

3. Assure

44

Page 45: Michener workshop montpellier

3. Assure

45

Page 46: Michener workshop montpellier

3. Assure

46

Page 47: Michener workshop montpellier

3. Assure

47

Page 48: Michener workshop montpellier

3. Assure

• JMP

• R

• MATLAB

• many others

48

Page 49: Michener workshop montpellier

The DataONE Data Life Cycle

49

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 50: Michener workshop montpellier

4. Describe:Develop Comprehensive, Standardized

Metadata

50

Darwin Core – species and biodiversity

collections

EML – Ecological Metadata Language

ISO 19115 – geospatial data

http://rs.tdwg.org/dwc/

Page 51: Michener workshop montpellier

4. Describe

51

Tools Specify

Morpho

https://knb.ecoinformatics.org/#tools/morpho

http://specifyx.specifysoftware.org

Page 52: Michener workshop montpellier

The DataONE Data Life Cycle

52

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 53: Michener workshop montpellier

5. Preserve:Protect and Preserve Data for Long-term

Use

53

Catalog of 1,500+ Data Repositories

Page 54: Michener workshop montpellier

Exercise• Search for repositories that host particular

types of data (e.g., biodoversity, trait)

• Visit one of the repositories and identify the

services that they offer

54

Page 55: Michener workshop montpellier

The DataONE Data Life Cycle

55

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 56: Michener workshop montpellier

6. Discover Search a Domain Portal

56

Page 57: Michener workshop montpellier

57

Page 58: Michener workshop montpellier

58

Page 59: Michener workshop montpellier

59

Page 60: Michener workshop montpellier

60

Dryad links to journals

Page 61: Michener workshop montpellier

61

Provides citation instructions

Page 62: Michener workshop montpellier

6. Discover Search a Data Aggregator

62

Page 63: Michener workshop montpellier

63

Page 64: Michener workshop montpellier

64

Page 65: Michener workshop montpellier

65

Page 66: Michener workshop montpellier

Data Federations (DataONE,

GBIF)

66

Page 67: Michener workshop montpellier

Data Federations (DataONE,

GBIF)carbon cycling

67

Page 68: Michener workshop montpellier

Data Federations (DataONE,

GBIF)carbon cycling

68

Page 69: Michener workshop montpellier

Data Federations (DataONE,

GBIF)carbon cycling plant biomass

69

Page 70: Michener workshop montpellier

Data Federations (DataONE,

GBIF)carbon cycling plant biomass

70

Page 71: Michener workshop montpellier

Data Federations (DataONE,

GBIF)carbon cycling plant biomass

ocean nitrogen avian distribution

71

Page 72: Michener workshop montpellier

Exercise• Search datadryad.org for plant trait

• Search DataONE.org for plant trait

72

Page 73: Michener workshop montpellier

73

Page 74: Michener workshop montpellier

74

Page 75: Michener workshop montpellier

75

Page 76: Michener workshop montpellier

76

Page 77: Michener workshop montpellier

77

Page 78: Michener workshop montpellier

78

Page 79: Michener workshop montpellier

6. Discover:Support Discovery of Relevant Data

79

Dryad DataONE google

plant trait 2,137 26,300,000

plant trait datadryad 803 1,908 17,400

• Differential content searched

• Automated annotation via ontologies and other

approaches

• Differential filtering

• Different definitions of data sets (e.g., entire

package vs individual data sets)

Page 80: Michener workshop montpellier

The DataONE Data Life Cycle

80

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 81: Michener workshop montpellier

7. Integrate:Enable Data Integration from Different

Sources

81 Jones et al. 2007

Page 82: Michener workshop montpellier

7. Integrate:DataONE Provenance Tracking System

82

Page 83: Michener workshop montpellier

The DataONE Data Life Cycle

83

Plan

Collect

Assure

Describe

Preserve

Discover

Integrate

Analyze

Page 84: Michener workshop montpellier

8. Analyze:https://www.vistrails.org

84

Page 85: Michener workshop montpellier

85

8. Analyze:http://kepler-project.org

Page 86: Michener workshop montpellier

86

8. Analyze:http://kepler-project.org

Page 87: Michener workshop montpellier

87

8. Analyze:https://taverna.incubator.apache.org

Page 88: Michener workshop montpellier

8. Analyze:https://www.myexperiment.org/

88

Page 89: Michener workshop montpellier

Best PracticesWebinar series Lessons and

Exercises

DataONE.orgEducation Resources

89

Page 90: Michener workshop montpellier

90

DataONE Vision and Mission

Page 91: Michener workshop montpellier

91

Page 92: Michener workshop montpellier

92

Page 93: Michener workshop montpellier

dataone.org