hydroshare: advancing collaboration through hydrologic data and model sharing

38
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade, Alva Couch, Jennifer Arrigo, Rick Hooper, David Valentine http://www.hydroshare.org OCI-1148453 OCI-1148090

Upload: bendek

Post on 26-Feb-2016

48 views

Category:

Documents


3 download

DESCRIPTION

HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing. David Tarboton , Ray Idaszak , Jeffery Horsburgh , Dan Ames, Jon Goodall , Larry Band, Venkatesh Merwade , Alva Couch, Jennifer Arrigo , Rick Hooper, David Valentine. http://www.hydroshare.org. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

HydroShare: Advancing Collaboration through

Hydrologic Data and Model Sharing

David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade, Alva Couch,

Jennifer Arrigo, Rick Hooper, David Valentine

http://www.hydroshare.org

OCI-1148453 OCI-1148090

Page 2: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

CUAHSI HIS Challenges

• Publishing data requires access to or setting up a HydroServer

• Accessing data requires HydroDesktop

• Generally limited to time series at a point Server

Desktop

Catalog

Page 3: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

A digital divideBig Data and HPCResearchers

• Experimentalists• Modelers

awkgrep

vi

#PBS -l nodes=4:ppn=8mpiexec

chmod#!/bin/bash

How can we best structure data and computer models to enable the use of high-performance and data-intensive computing by discipline scientists coming to this problem without extensive computational knowledge and algorithmic experience?

Gateways, Web Interfaces, CyberGIS

Page 4: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Can sharing data and models be as easy as sharing photos on Facebook or videos on

YouTube?

Page 5: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Can finding data and models be as easy as shopping on Amazon?

Items

Possible Filters Available Formats

Recommendations Prices (perhaps usage)

Page 6: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Cloud Computing

Wikipedia: Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet)

Storage

Applications

Computation Services

Models

Google, Amazon, Microsoft, Apple, DropBox XSEDE, Condor, BOINC

Page 7: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

HydroShare is a web based collaborative system to support analysis,

modeling and data publication

Observers and

instrumentsData

Analysis

Models

Collaboration

Publication, Archival, Curation

Page 8: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Currently in beta testing http://beta.hydroshare.org

Page 9: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

HydroShare Functionality to be Developed

1. A new, web-based system for advancing model and data sharing

2. Sharing features to HydroDesktop 3. Access more types of hydrologic data using standards

compliant data formats and interfaces 4. Enhance catalog functionality that broadens discovery

functionality to different data types5. New model sharing and discovery functionality6. Facilitate and ease access to use of high performance

computing7. New social media and collaboration functionality8. Links to other data and modeling systems

Page 10: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing
Page 11: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing
Page 12: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing
Page 13: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Upload

Page 14: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Support additional types of dataResource Types• Time Series• Geographic feature set• Other• Referenced HIS time series• Geographic Raster• Multidimensional Space Time dataset• River geometry• Sample based observations (ODM2 and

CZO)• Documents• Tabular objects• HydroDesktop Project package• Scripts• Models• Model Components• Referenced data sets from other (non HIS

sources).

Tools• Uploaders to facilitate

loading of resources• Viewers to visualize the

resource• Exporters to download the

resource• Best practice tools for

hydrologic data preprocessing and analysis

Requires a Resource Data ModelDocumented resource content specification that dictates how the resource is stored in HydroShare

Page 15: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Imagine the Possibilities…

Observers and

instrumentsData

Analysis

Models

Collaboration

HydroShare to support integrated collaborative analysis, modeling and data publication

HydroServer (ODM)

12

1. Observe2. Publish and Catalog

3

3. Discover and Analyze/Model (in Desktop or Cloud)

Publication, Archival, Curation

Page 16: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Observers and

instrumentsData

Analysis

Models

Collaboration

HydroShare to support integrated collaborative analysis, modeling and data publication

4. Share the results (Data and Models)

HydroShare resourcestore

4

Publication, Archival, Curation

Imagine the Possibilities…

Page 17: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Observers and

instrumentsData

Analysis

Models

Collaboration

HydroShare to support integrated collaborative analysis, modeling and data publication

5. Group Collaboration using HydroShare

6. Preparation of a paper

5

6

Publication, Archival, Curation

Imagine the Possibilities…

Page 18: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Observers and

instrumentsData

Analysis

Models

Collaboration

HydroShare to support integrated collaborative analysis, modeling and data publication

7. Submittal of paper, review, archival of electronic paper with data, methods and workflow

7

Publication, Archival, Curation

DataOne, EarthCube, …

Imagine the Possibilities…

Page 19: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

HydroShare Modeling

• Data: Links to national and global data sets of essential terrestrial variables (e.g. NASA NEX, HydroTerre)

• Tools to preprocess and configure inputs (TauDEM + CyberGIS)

• Preconfigured models and modeling systems as services (CI-WATER)

• Standards for information exchange for interoperability (OpenMI, CSDMS BMI)

• Tools for visualization and analysis• Automated reasoning to couple models based on purpose, context, data and resources (Aaron Byrd)

xyt

Flow

Time

Page 20: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

A specific example• Big snow year• Will my city flood?• Click to delineate watershed

(model domain)• Generate model package from

Essential Terrestrial Variables• Generate suite of input

scenarios• Execute model and view results Time

Flow

Time

P

Page 21: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

But there is more…What if I could express my decision needs to the system and have it reason and deduce which models need to run, then configure and run them based on the inputs available, precision needs and resources and time available.

Page 22: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Resource Repository Centric Paradigm for Modeling and Analysis

Enable multiple models to use common “best practice” tools

Analysis Tools

Visualization Tools

Data LoadersData

Discovery Tools

Models

Resource Repository

Page 23: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

E.g. SWATShare• A web based tool for publishing, sharing, and

accessing Soil Water Assessment Tool (SWAT)

www.water-hub.org/swat-tool

Page 24: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Model pre and post processing workflow

• Each model interacts with information in the common data store• The modeler does not need to be concerned with and can take advantage of

standardized analysis, visualization loading and discovery tools

Resource Repository

Analysis Tools

Visualization Tools

Data Loaders

Data Discovery

Tools

Models

Resource Repository

Pre-Processing

Post -Processing

Input Files Output Files

Model

Page 25: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Architecture and Development

Page 26: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Drupal – Content Management System

• Extensible Open Source Content Management Framework for Publication written in PHP– Over 14,000 user contributed modules

• Themed and Styled Presentation of HydroShare Resources with in page visualization

• Off the shelf modules provide a Social Experience surrounding Hydrologic Data: Comments, Ratings, Group Behavior

• Custom module development supports HydroShare Data Model, GeoAnalytics and iRODS Integration

Page 27: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Enterprise iRODS

E-iRODS in HydroShare• Storage of HydroShare Resources

Replicated across multiple institutions

• Access to Computation • Access to Indexing for Discovery

Rule Engine MSVC

R. Server R. Server…

Client

Users

iCAT

Distributed Data Grid Middleware:• Metadata Catalog holding virtual

file system information and associated metadata

• Extensible number of ‘Resource Servers’ which may provide connectivity to storage resources

• Integrated Rule Engine for Policy Driven Data Management triggered by Data Management Activities

• Extensibility via Microservices (MSVC) – Plugins providing functionality to the Rule Engine

Page 28: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

http://www.cuahsi.org

A community project

• 109 US University members• 7 affiliate members• 20 international affiliate members• 3 corporate members

(as of January 2013)

Users CommitteeInformatics Standing Committee

Page 29: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Community GovernanceCUAHSI Board

Standing Committee on Informatics

HydroShareExecutive Committee

CUAHSIUser

Community

HydroShareDevelopment

Team

Implementation (Agile)

– Hydrologic Information System (HIS)

– Integrated Rule-Oriented Data System (iRODS)

– Drupal

HydroShareEvaluation

Metrics– End-user

involvement– Quantitative

and qualitative measurement

– Sustainability

• Prioritization• Decision Making

Oversight

Released Software

Community / User Requirements

– Surveys– Conferences– Workshops– Embed UI with

“Help us make our software better”

Specification Requests Prototype

Page 30: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

– USU– RENCI/UNC– CUAHSI– BYU– Tufts– UVA– Texas – Purdue– SDSC

HydroShare project team

OCI-1148453OCI-11480902012-2017

Page 31: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

User driven use casesAnnotate uploaded hydrology models using an ontologyRegister a Package with HydroShareAdd data resource for a model Notify Me When Related Resources Are RegisteredRegister a Resource with HydroShareEvaluate Load Reduction ScenariosSuggest a Resource Related to the Current ResourceBuilding an Intelligent Digital Watershed (IDW)Contribute to a Community DatasetDefine Relationships between ResourcesDiscover a Community Dataset to which I Can ContributeExecute a Model in HydroShareRegister a Workflow with HydroShareRegister a Community DatasetDownload a Model, Execute It, and Share the Model and ResultsDefine a Composite ResourceCrowd sourcing modeling tasksAutomated Visualization (thumbnails)User displays HydroShare GalleryExisting User Logs into HydroShareNew User Creates a HydroShare User AccountUser Sets Personal PreferencesUser is provided a personal DashboardUser Chooses to “Follow” Another UserUser Chooses to “Follow” a Group

User Views His/Her Personal ContentUser Uploads a ResourceUser Deletes a ResourceUser Shares a Resource in HydroShareUser Publishes a Resource to DataONEUser Publishes a Resource to the CUAHSI Water Data CenterUser Exports a Resource to their Local MachineUser Searches / Filters / Sorts their Personal ResourcesUser Views Details Page for a ResourceUser Groups Resources into a “Folder” or “Collection”User “Opens” a ResourceUser Edits Metadata Description for a ResourceUser Adds a Comment to a ResourceUser Rates / Reviews a ResourceUser Derives a New Resource from an Existing ResourceUser Executes a ResourceUser Explores / Searches Available HydroShare ResourcesUser “Pins” a Discovered Resource to a “Resource Collection”User Filters Discovered ResourcesUser Imports Data from Externally Hosted ResourcesUser Searches For Collaboration GroupsUser Views Group DetailsUser Creates a Collaboration GroupUser Requests Group MembershipUser Creates a Comment on a Collaboration GroupUser Creates a Discussion in a Collaboration Group Discussion Forum

User Edits a Collaboration Group’s DescriptionUser Searches / Filters / Sorts a Group’s ResourcesUser Views Documentation and Gets SupportUser Views / Subscribes to the HydroShare BlogUser Exports a HydroShare Resource Citation into Mendeley or Zotero User Transfers Ownership of a Resource to Another UserUser Receives HydroShare Social Media Notifications via Mobile DeviceUser Views Access / Download Statistics for a ResourceUser Views HydroShare Resources via Mobile DevicesSearching and/or browsing HydroShare Translate data automatically for HydroShare operations.Translate data automatically for export.Publish translated data.Translate replicated data.Registration of a new HydroShare ToolEditing a Published (with DOI) resourceUser Creates New “Model Package” ResourceUser Transfers Ownership of a Group to Another UserUser Develops a Client for HydroShareSummarize hydrologic model input parameters for a user defined regionDiscover specialist/ Promote specialized servicesVisualize Time Series Upload a Model

Page 32: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Metrics

Use Metric Number of active

users

Number of

resources

stored

Number of

resources

downloaded

Size of resourc

es stored (GB)

CPU hours of comput

e resources used

Number of

compute jobs

run

Number of

logons

Average duration

of session

Total use Use by user typeUniversity Faculty Post-Doctoral Fellow …. Use by Geographic LocationState Country Use by resource typeTime Series Geographic Feature Set

….

User Types: University Faculty, University Professional or Research Staff, Post-Doctoral Fellow, University Graduate Student, University Undergraduate Student, Commercial/Professional, Government Official, School Student Kindergarten to 12th Grade, School Teacher Kindergarten to 12th Grade, Other, Unspecified

Resource Types: Time Series, Geographic Feature Set, Geographic Raster, Multidimensional Space Time Array, River Geometry, Model, Workflow, Other, …

Metric NumberNumber of registered users 35Number of host institutions 15Github HydroShare code repository owners and members

15

Page 33: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Collaborative Open Development

http://github.com/organizations/hydroshare

http://hydrodesktop.codeplex.com

Page 34: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Summary• A collaborative website for the sharing of

hydrologic data and models• To expand data sharing capability of CUAHSI

HIS– Additional data classes– Models, scripts, tools and workflows

• Community Participation• Interoperability• Standards• Open Development

To boldly go where no one has gone before

Page 35: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

– USU– RENCI/UNC– CUAHSI– BYU– Tufts– USC– Texas – Purdue– SDSC

Thanks to a lot of people

HydroShare team: Dave Tarboton, Ray Idaszak, Dan Ames, Jeff Horsburgh, Jon Goodall, Larry Band, Venkatesh Merwade, Jeff Heard, Carol Song, Alva Couch, David Valentine, Rick Hooper, Jennifer Arrigo, David Maidment, Tim Whiteaker, Alex Bedig, Laura Christopherson, Pabitra Dash, Tian Gan, Tony Castronova, Karl Gustafson, Stephen Jackson, Cuyler Frisby, Stephanie Mills, Brian Miles, Jon Pollak, Stephanie Reeder, Ash Semien, Yaping Xiao, Lan Zhao

http://www.cuahsi.org/hydroshare.aspx OCI-1148453 OCI-1148090

Page 36: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Next Class

Page 37: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Representing River Geometry in HydroShare

Hydraulic Calculations

LiDAR

Cross Sections

Cross Sections Attached to River Network

Page 38: HydroShare:  Advancing Collaboration through Hydrologic Data and Model Sharing

Modular design, linking river geometry, catchment geometry, network topology, and time series observations

Data is linked by common reference points along the river, which can be represented as point or cross section shapefiles and shown on a map.

Based on OGC HY_Features Model