gem metadata development xiaoping wang, macrosearch allen macklin, pmel and bernard megrey, afsc

21
GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

Upload: lilian-preston

Post on 19-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

GEM METADATA DEVELOPMENT

Xiaoping Wang, Macrosearch

Allen Macklin, PMEL and Bernard Megrey, AFSC

Page 2: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

TOPICS

• Introduction about EML metadata standard• GEM metadatabase development• Advantages of Oracle Database

Page 3: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

GOAL

Generate EML-compliant metadata documents for the datasets that are in the interests of GEM (Gulf of Alaska Ecosystem Monitoring, a program of the Exxon Valdez Oil spill Trustee Council).

Page 4: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

WHAT IS EML

• Stands for Ecological Metadata Language.• Exists as a set of XML Schema documents.• Allows for the structural expression of

metadata elements.

Page 5: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

ADVANTAGES OF EML

• Includes almost all metadata elements covered by other metadata standards.

• Can be used in a modular and extensible manner.

• Can be used to describe:- Dataset- Literature- Software- Protocol

Page 6: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

USE OF DATASET MODULE

• Data table• Spatial raster• Spatial vector• Stored procedure• Other entity

Page 7: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATA ELEMENTS(1) General Information

• Dataset title, abstract and purpose• Data creator(s), metadata provider(s) and

contact information• Keywords• Data maintenance• Data distribution• Geographic/time coverage

Page 8: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATA ELEMENTS(2) Research Project Information

• Project title and description• Participants and their roles• Funding sources• Study area description• Design description

Page 9: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATA ELEMENTS(3) Method Information

• Method description• Sampling• Instruments• Software

Page 10: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATA ELEMENTS(4) Data Information

• Table name and description• Attribute name and definition• Attribute domain code and definition• Data unit• Data precision• Missing value code• Accuracy

Page 11: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATABASE DEVELOPMENT(1) Database Table Design

• Main table – One row for each dataset• Other tables – One or multiple rows for each

dataset.- Keywords- Personnel- Data tables - Table attributes- Attribute domain codes- Instruments.

Page 12: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATABASE DEVELOPMENT(2) Integrity Constraints

• Primary key – dataset record ID in main table• Foreign key – dataset record ID in other tables• Check constraints – allowed values of EML

elements• NOT Null constraints – mandatory EML

elements

Page 13: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATABASE DEVELOPMENT(3) Stored Procedures

• Handle repeated database operations• Input large text files

Page 14: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

METADATA FILE GENERATION

• Java Program development– Read data from metadatabase– Generate EML-compliant metadata files

• Validate metadata files against EML– no XML errors – no EML errors

Page 15: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

ORACLE DATABASE

• Advantages– Can be used on multiple platforms

(Windows, Unix, and Linux…)– Has the best security features– Has the highest availability and reliability– Has a powerful language (PL/SQL) for data

query and manipulation• Disadvantage

– More expensive

Page 16: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

FUNDAMENTAL DATA SECURITY REQUIREMENTS

• Confidentiality - users can see only the data that they are supposed to see.

• Integrity - data is protected from deletion and corruption.

• Availability - data is available to authorized users without delay.

Page 17: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

DATA AVAILABILITY(1) Real Application Clusters

Page 18: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

DATA AVAILABILITY(2) Replication

Page 19: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

DATA AVAILABILITY(3) Data Guard

Page 20: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

DATA AVAILABILITY(4) Stream

Page 21: GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC

DATA MANAGEMENT

• Database management - Data storage • Metadata management - Data documentation • Data availability - Online data share