ecogrid seek all hands meeting february 2003 albuquerque, nm

10
EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Upload: tracen

Post on 05-Jan-2016

25 views

Category:

Documents


1 download

DESCRIPTION

EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM. EcoGrid in SEEK. ANAYSIS & MODELING LAYER. SEMANTIC MEDIATION LAYER. DATA/COMPUTE LAYER. Aims of EcoGrid. Which, Where, How, Who ???? Share Data and Information Relate Data from multiple projects/groups - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

EcoGrid SEEK All Hands Meeting

February 2003Albuquerque, NM

Page 2: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

EcoGrid in SEEK

ANAYSIS & MODELING LAYER

SEMANTIC MEDIATION LAYER

DATA/COMPUTE LAYER

Page 3: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Aims of EcoGrid• Which, Where, How, Who ????• Share Data and Information• Relate Data from multiple projects/groups• Crosswalks across data structures• Develop Eco-related Finding Aids for Data• Global User: Authenticate and Authorize• Provide an infrastructure for “Archivable

Collection-building” for SEEK scientists• Facilitate the A&M layer and the SMS layer

Page 4: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Challenges of EcoGrid• Data & User Diversity

– 6000+ datasets & 1500+ scientists– themes, methods, units,structures– Small data sizes but high complexity - metadata

• Multiple Data Organizations– Biodiversity Surveys– Population data– GIS, Satellite Images, Weather Data, …

• Ontologies & Taxonomies• Data Discovery: No single place to find• Data Entropy – rapid decline of information on data

• Autonomy with Centralized access• Leverage Computational Grid work

Page 5: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Our Charge• Develop a framework for “global access to ecologically-

related data”• Look at current approaches, existing systems & grids• List features/functionalities we want to see in EcoGrid• Study how to leverage, integrate, extend existing work• Come up with architectural framework & user interfaces• Identify Datasets that should be in the EcoGrid• Identify Networks that will be part of the EcoGrid• Identify Methods that can be used through the EcoGrid• Identify people, members, partners• Identify timeline, goals, milestones

Page 6: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Existing services• Metacat – syntactic and semantic metadata

querying/inserting/updating/deleting, user registration/authentication, data replication, data/metadata versioning, - supports any XML-based metadata

• Xanthoria – common-schema mediator (currently 8 sites) metadata query/insert/update/delete for any XML schema to underlying metadatabase (SQL, native XML)

Page 7: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Existing Systems• Prometheus – querying classification taxonomy,

query/describe graph structures

• DiGIR – querying arbitrary XML-describable resources (underlying data sources can be any type: RDB, XMLDB).

• ClimDB – integrating (using wrapping at the data source) diverse format climate data. Access through web, common schema identified beforehand – tabular description

• HyperLTER – summary ontology as metadata for images put in as metadata, image extraction /geographicsubsetting/band-level subsetting/ - integration with MODIS images and Hyperspectral images, TM images, airphotos, …

Page 8: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

Existing Systems

• Veg Bank – 3 databases co-occurrence records, species taxonomic database that is concept-driven, community classification. Distributed vegbank, querying by plots. Querying/insert/update/annotate across three diverse databases that are described using XML

• SRB – access distributed data, syntactic, semantics,user-defined (arbitrary relational) metadata based querying. Annotations for data. Opertions on data. Extraction of metadata. ingest,bulk ingest, delete,upate of data/metadata

Page 9: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

What is needed for Data/EcoGrid• Define definitions for structures for data that will

be processed in SEEK data grid– Vectors, graphs, tables, trees, ….

• Extend EML to take account of DDI and other metadata standards

• List a set of services that might be supported by data

• Identify common languages and mappings for a structural vocabulary– Ex. Occurrence, co-occurrence, as key words

• Identify the services of EcoGrid Components• Identify Registry Language –

users/data/methods/resources/srorage/compute

Page 10: EcoGrid SEEK All Hands Meeting February 2003 Albuquerque, NM

What do we here

• Specifications document • Calendering, Meetings • Milestones (Priorities, Duration

for pubs, software, and other products) • Deliverables for Annual report (June 1) • Planned activies for dev mtg, wg, all-hands mtg • Staff coordination and task allocation