the vegbank taxonomic datamodel robert k. peet

27
The VegBank taxonomic datamodel Robert K. Peet Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National Center for Ecological Analysis and Synthesis

Upload: bonner

Post on 28-Jan-2016

33 views

Category:

Documents


0 download

DESCRIPTION

The VegBank taxonomic datamodel Robert K. Peet. Sponsored by: The Ecological Society of America US National Science Foundation Produced at: The National Center for Ecological Analysis and Synthesis. 1. The Intended User Groups. Specifically - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The VegBank  taxonomic datamodel  Robert K. Peet

The VegBank taxonomic datamodel

Robert K. PeetSponsored by:

The Ecological Society of America US National Science Foundation

Produced at:The National Center for Ecological Analysis and Synthesis

Page 2: The VegBank  taxonomic datamodel  Robert K. Peet

1. The Intended User Groups

Specifically• “Ecologists” who record species

composition of vegetation.• “Ecologists” who use records of

vegetation composition.

In general • Creators and distributors of data

wherein organisms need to be labeled.• Users and consolidators of data wherein

organisms are labeled.

Page 3: The VegBank  taxonomic datamodel  Robert K. Peet

VegBank

• The ESA Vegetation Panel is currently developingVegBank (www.vegbank.org) as a public vegetation plot archive

• VegBank is expected to function for vegetation plot data in a manner analogous to GenBank.

• Primary data will be deposited for reference, novel synthesis, and reanalysis, particularly for classification.

Page 4: The VegBank  taxonomic datamodel  Robert K. Peet

2. Intended Functionality

• Ecologists, like most users of organism names, care about how to label organisms, and how to interpret labels others have placed on organisms.

• We wish for ease in combining datasets.

• We abhor name changes and ambiguity

Page 5: The VegBank  taxonomic datamodel  Robert K. Peet

Plot

PlotObservation

Taxon Observation

Taxon Interpretation

Plot Interpretation

Core elements of VegBank

Taxon Assignment

Plot Assignment

Page 6: The VegBank  taxonomic datamodel  Robert K. Peet

Taxonomic database challenge:Standardizing organisms and communities

The problem: Integration of data potentially representing different times, places, investigators and

taxonomic standards.

The traditional solution: A standard checklists of organisms.

Page 7: The VegBank  taxonomic datamodel  Robert K. Peet

Standard lists are available for Taxa

Representative examples for higher plants in North America / US

USDA Plants http://plants.usda.govITIS http://www.itis.usda.gov NatureServe http://www.natureserve.orgBONAP Flora North America

These are intended to be checklists wherein the taxa recognized perfectly partition all plants. The lists can be dynamic.

Page 8: The VegBank  taxonomic datamodel  Robert K. Peet

Most taxon checklists fail to allow effective dataset integration

The reasons include:

• The user cannot reconstruct the database as viewed at an arbitrary time in the past,

• Taxonomic concepts are not defined (just lists),

• Multiple party perspectives on taxonomic concepts and names cannot be supported or reconciled.

Page 9: The VegBank  taxonomic datamodel  Robert K. Peet

Intended functionality

• Organisms are labeled by reference to concept (name-reference combination),

• Party perspectives on concepts and names can be dynamic,

• User can select which party perspective to follow,

• Party perspectives are perfectly archived,

• Different names systems are supported,

• Enhanced stability in recognized concepts by separating name assignment and rank from concept,

Page 10: The VegBank  taxonomic datamodel  Robert K. Peet

Name ReferenceConcept

3. Taxonomic theory

A taxon concept represents a unique combination of a name and a reference

“Taxon concept” roughly equivalent to “Potential taxon” & “assertion”

Page 11: The VegBank  taxonomic datamodel  Robert K. Peet

Name ConceptUsage

A usage represents a unique association of a concept with a

name.

• Usage does not appear in the IOPI model, but instead is a special case of concept

• Usage can be used to apply multiple name systems to a concept

• Desirable for stability in recognized concepts

Page 12: The VegBank  taxonomic datamodel  Robert K. Peet

Name ConceptStart, StopConceptStatusLevel, Parent

UsageStart, StopNameStatusName system

Reference

Data relationshipsVegBank taxonomic data model

Single party, dynamic perspective

Page 13: The VegBank  taxonomic datamodel  Robert K. Peet

Party Perspective

The Party Perspective on a concept includes:

•Status – Standard, Nonstandard, Undetermined

• Correlation with other concepts – Equal, Greater, Lesser, Overlap,

Undetermined.

•Lineage – Predecessor and Successor concepts.

•Start & Stop dates.

Page 14: The VegBank  taxonomic datamodel  Robert K. Peet

Name Concept

Party

UsageStart, StopNameStatusName system

StatusStart, StopConceptStatusLevel, Parent

Reference

Data relationshipsVegBank taxonomic data model

Multiple parties, dynamic perspectives

Page 15: The VegBank  taxonomic datamodel  Robert K. Peet

Name Concept

Party

UsageStart, StopNameStatusName system

StatusStart, StopConceptStatusLevel, Parent

Correlation

Reference

Lineage

Data relationshipsVegBank taxonomic data model

With party correlations and lineages

Page 16: The VegBank  taxonomic datamodel  Robert K. Peet

Primary differences between the VegBank and IOPI models

The IOPI model is optimized for

• describing taxonomic decisions represented in literature.

The VB model is optimized for• stability in accepted concepts (super concepts),

• support of multiple dynamic party perspectives,

• support of multiple name systems.

Page 17: The VegBank  taxonomic datamodel  Robert K. Peet

4. State of Development

1. VegBank2. Collaborators• NatureServe Biotics4

• USDA PLANTS & ITIS

Page 18: The VegBank  taxonomic datamodel  Robert K. Peet

Status of VegBank• Working Prototype; open for deposit Nov 1• Production version July 2004• Access version on VegBranch today• Efficient data exchange by July 2004• Functionality mandated in draft FGDC

standards• IAVS working group established for data

exchange standards established 2003

Page 19: The VegBank  taxonomic datamodel  Robert K. Peet

VegBank data contentPrototype populated with USDA PLANTS lists

and synonyms = weak concepts.Contract with NatureServe and John Kartesz • Develop reference-based concepts for

14000 by July 2004 of the ~32000 vascular plant taxa at species level and below

• List of unambiguous taxa (~6000?)• Treatment of most ambiguous taxa• Demonstration mapping to FNA• A few demosntration groups in depth

Page 20: The VegBank  taxonomic datamodel  Robert K. Peet

Concept workbench

• Concept workbench for both plant concepts and community concepts is planned.

Page 21: The VegBank  taxonomic datamodel  Robert K. Peet

NatureServe Biotics 4

• In production• Loading the same initial concepts• Already using concepts where strong

differences between state programs• Routine data exchange with

Vegbank under development.

Page 22: The VegBank  taxonomic datamodel  Robert K. Peet

PLANTS & ITIS

• Frequent communication during redesign.

• Design documents for conversion to concept-based system complete

• Collaboration to assure that the concepts we develop will be employed in the new system

• Uncertainties about funding

Page 23: The VegBank  taxonomic datamodel  Robert K. Peet
Page 24: The VegBank  taxonomic datamodel  Robert K. Peet

NamesCarya ovata Carya carolinae-septentrionalisCarya ovata v. ovataCarya ovata v. australis

Concepts(One shagbark)C. ovata sec Gleason ’52C. ovata sec FNA ‘97

(Southern shagbark)C. carolinae-s. sec Radford ‘68C. ovata v. australis sec FNA ‘97

(Northern shagbark)C. ovata sec Radford ‘68C. ovata (v. ovata) sec FNA ‘97

ReferencesGleason 1952. Britton & BrownRadford et al. 1968. Flora CarolinasStone 1997. Flora North America

Six shagbark hickory conceptsPossible synonyms are listed together

Page 25: The VegBank  taxonomic datamodel  Robert K. Peet

ITISFNA CommitteeNatureServe

Carya ovata sec Gleason 1952Carya ovata sec FNA 1997 Carya ovata sec Radford 1968Carya carolinae sec Radford 1968Carya ovata (ovata) sec FNA 1997Carya ovata australis sec FNA 1997

Party Concept

Party Concept Status Start Usage:SciName

ITIS ovata –G52 NS 1996ITIS ovata –R68 St 1996 C. ovataITIS carolinae-s –R68 St 1996 C. carolinae-sept.ITIS carolinae-s –R68 NS 2000ITIS ovata aust –FNA St 2000 C. carolinae-sept.ITIS ovata – R68 NS 2000ITIS ovata ovata –FNA St 2000 C. ovata

Status

Application of Party Perspective

Page 26: The VegBank  taxonomic datamodel  Robert K. Peet

Name Concept

Party

UsageStart, StopNameStatusName system

StatusStart, StopConceptStatusLevel, Parent

Correlation

Reference

Lineage

Data relationshipsVegBank taxonomic data model

With party correlations and lineages

Page 27: The VegBank  taxonomic datamodel  Robert K. Peet

Name Concept

SourceType: Lit, Org Correlation

Status

Reference

Core elements of the IOPI model

Is author

Assigns status