earthcube data science publications dr. joan aron dr. sophia liu dr. brand niemann may 29, 2015

16
EarthCube Data Science Publications Dr. Joan Aron Dr. Sophia Liu Dr. Brand Niemann May 29, 2015 http:// semanticommunity.info/Data_Science/EarthCube_Data_Science_Publications http:// earthcube.org/forum/earthcube-data-science-publications/data-science-publication-usgs-minerals-big-data http://www.meetup.com/Federal-Big-Data-Working-Group/events/221810524 / 1

Upload: robert-price

Post on 19-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

1

EarthCube Data Science Publications

Dr. Joan AronDr. Sophia Liu

Dr. Brand NiemannMay 29, 2015

http://semanticommunity.info/Data_Science/EarthCube_Data_Science_Publicationshttp://earthcube.org/forum/earthcube-data-science-publications/data-science-publication-usgs-minerals-big-data

http://www.meetup.com/Federal-Big-Data-Working-Group/events/221810524/

2

Agenda

• Context:– Dr. Brand Niemann, Director and Senior Data

Scientist, Semantic Community• Example:– Dr. Sophia Liu, Mendenhall Postdoc Fellow at the

U.S. Geological Survey• Discussion:– Dr. Joan Aron, President and Founder of Science

Communication Studies

3

Timeline• November 2011:

– EarthCube Shows How Collaboration Is Changing Geoscience Research (EarthCube Charrette & AOL Gov Story)

• June 2014:– EarthCube Data Science Publications Special Interest Group Breakout Session

• February 2015:– Data Science for NSF Polar Cyberinfrastructure and MIT Big Data Course

Meetup• May 2015:

– EarthCube All Hands Meeting– Dynamic Earth: GEO Imperatives & Frontiers 2015-2020 (2014)– Data Science Publication for USGS Minerals Big Data Mini Session

• June 2015:– Data Science Publication for USGS Minerals Big Data Meetup

4

EarthCube 2011 Shows How Collaboration Is Changing Geoscience Research

• Question: When EarthCube exists and is widely useful in 2021, what does a day in the life of a scientist in your field look like? Think about your: Research, Teaching, Outreach, Workforce Development, and Interaction with the greater scientific community.

• Answer: I will still have some form of the tools I am using now, they will just be more integrated with one another and even more connected to many sources of information and data so I can create data stories (like I do now for AOL Government) with greater ease and frequency because the time for collection and communication is lessened and the time for analysis is maximized -- actually the analysis tool facilitates the collection and communication parts.

• Note: I also provided an example of the kind of agile analysis of Earth Science data that I had done earlier in the year for the Earth Science Federation Annual Conference.http://gov.aol.com/2011/11/11/earthcube-shows-how-collaboration-is-changing-geoscience-researc/

5

EarthCube 2014 Sprint to "Stretch Goals: Open Research Data Publication and Integration

http://semanticommunity.info/Data_Science/EarthCube_Data_Science_Publications#Summary_History_and_Workplan

6

EarthCube 2015 Data Science for NSF Polar Cyberinfrastructure and MIT Big Data Course Meetup

• 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring) Slides Data Science for IEEE Big Data

• 6:45 p.m. Dr. George Strawn, Director, National Coordination Office/NITRD and Brand Niemann, Slides

• TO BE RESCHEDULED: Dr. Chaitan Baru, Senior Advisor for Data Science in the CISE Directorate at the National Science Foundation (confirmed) Slides

• 7:10 p.m. Brief Member Introductions • 7:30 p.m. Mark Silverman, Treeminer, Inc., Overview of

Hadoop for Data Mining Slides and Demo: https://www.youtube.com/watch?v=5X65WV0n4rU

• TO BE RESCHEDULED: Dr. Marco Tedesco, NSF Polar CyberInfrastructure Program Manager (confirmed) and Dr. Chris Mattmann, Chief Architect, Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion and NSF Polar CyberInfrastructure Hackathon Organizer (confirmed) Data Science for NSF Polar Cyberinfrastructure

• 8:15 p.m Data Science for Tackling the Challenges of Big Data and Members Who Took the Course

• 8:30 p.m. Open Discussion • 8:45 p.m. Networking • 9:00 p.m. Depart

Data Science for NSF Polar CyberinfrastructureData Science for NSF Polar Cyberinfrastructure and MIT Big Data Course

7

EarthCube May 2015 Data Science Publications

• EarthCube All Hands Meeting:– Mission: Community-led cyberinfrastructure that will allow for

unprecedented data sharing across the geosciences.– Keynotes: Two best practices examples.– Scope and Vision: Dynamic Earth: GEO Imperatives & Frontiers 2015-2020

(2014)– Eva Zanzerkia, EarthCube Program Director:

• Question: How do we get the EarthCube domain scientists to work with open data?• My Answer: Get data scientists to mine EarthCube GEO domain data like the NSF

DataViz Hackathon for Polar CyberInfrastructure in early November 2014 and then organize a Meetup in February 2015 to discuss the results to the Federal Big Data Working Group Meetup.

– Brand Niemann, Director and Senior Data Scientist, Semantic Community:• NSF needs a Data.gov for GEO that its Big Data/Data Science grant projects can use.• GEO needs a Data Science Data Publication Commons for its four domains: Earth,

Oceans, Polar, and Atmosphere/Geospace like we have started here.

9

Conclusions and Recommendations

• Semantic Community uses a Semantic Wiki for Data Science Data Publications.

• The recent 2015 NIST Big Data Framework documents and use cases were produced as a Data Science Data Publication to address the “Holdren Memo.”

• Data Science Data Publications address most of the topics in the other All Hands Meeting sessions.

• This is a “win-win” for the EarthCube and Data Science Communities:– Domain Scientists get increased use and visibility of their data.– Data Scientists get access to data and subject matter expertise.

• The Federal Big Data Working Group will continue to data mine EarthCube Use Cases and Data Sets and produce Data Science Data Publications for its Meetups.

• This is Data-to-Knowledge-to-Action for Decision Making.

10

Data Science Publication for USGS Minerals Big Data Mini Session

http://semanticommunity.info/Data_Science/Data_Science_for_USGS_Minerals_Big_Data#Slides_2

11

What I Need From You

12

Positives of Civic Hacking

13

Data Science Publication for USGS Minerals Big Data: Story and Slides

http://semanticommunity.info/Data_Science/Data_Science_for_USGS_Minerals_Big_Data#Story

14

15

16

Welcome to Semantic Community.info Community Infrastructure Sandbox for 2015• The Profit and Data Enterprises:

– Marcus Lemonis, star of The Profit, a CNBC reality show about saving small businesses through People, Process, and Products.

– The Federal Big Data Working Group Meetup is also about helping government agencies develop:• People – Data Scientists/Chief Data Officers• Process – Data Infrastructure• Products – Data Publications

– By provide MOOCs/Meetups for training and networking.• Five MOOCs for Big Data Applications and Analytics:

– Data Science for Mining of Massive Datasets by Niemann Based on Stanford MOOC (IN PROCESS)– USDA Data Science MOOC by Niemann (This Meetup)– Data Science for EPA Big Data Analytics (IN PROCESS)

• Federal Big Data Working Group and Virginia Big Data Meetups:– June 1: Data Science for Homeless Data: QlikView. Tableau, & Spotfire Bakeoff– June 15: Data Science for USGS Minerals Big Data– June 29: Data Science for Cyber Physical Systems-Internet of Things– July 20: Data Science for Affordable Care Act Data– Late Summer/Early Fall: DJ Patil, Chief Data Scientist, Whitehouse, Linda Powell, Chief Data Officer,

CFPB, etc.