epa big data analytics: data science for epa fracturing data dr. brand niemann director and senior...

16
EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community http://semanticommunity.info/ http://www.meetup.com/Virginia-Big-Data-Meetup/ http://www.meetup.com/Federal-Big-Data-Working-Group/ http://www.meetup.com/Northern-Virginia-Semantic-Web-Meetup/ http://semanticommunity.info/Data_Science/Federal_Big_Data_Working _Group_Meetup April 17, 2015 1

Upload: rosamund-bruce

Post on 24-Dec-2015

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

1

EPA Big Data Analytics:Data Science for EPA Fracturing Data

Dr. Brand NiemannDirector and Senior Data Scientist/Data Journalist

Semantic Communityhttp://semanticommunity.info/

http://www.meetup.com/Virginia-Big-Data-Meetup/ http://www.meetup.com/Federal-Big-Data-Working-Group/

http://www.meetup.com/Northern-Virginia-Semantic-Web-Meetup/ http://semanticommunity.info/Data_Science/Federal_Big_Data_Working_Group_Meetup

April 17, 2015

Page 2: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

2

Announcement

• The EPA released its peer-reviewed analysis of over two years of data from the FracFocus Chemical Disclosure Registry 1.0. FracFocus is a publicly accessible website, managed by the Ground Water Protection Council (GWPC) and Interstate Oil and Gas Compact Commission, where oil and gas production well operators can disclose information about ingredients used in hydraulic fracturing fluids at individual wells.

• In March 2015: U.S. Department of the Interior Releases Final Rule to Support Safe, Responsible Hydraulic Fracturing Activities on Public and Tribal Lands. See Press release and Final rule.

Page 3: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

3

EPA's Study of Hydraulic Fracturing for Oil and Gas and Its Potential Impact on Drinking Water Resources

• At the request of Congress, EPA is conducting a study to better understand any potential impacts of hydraulic fracturing for oil and gas on drinking water resources. The scope of the research includes the full lifespan of water in hydraulic fracturing.

• EPA's study will look at potential impacts of hydraulic fracturing at each stage of the Hydraulic Fracturing Water Cycle.

http://www2.epa.gov/hfstudy

Page 4: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

4

The Hydraulic Fracturing Water Cycle

http://www2.epa.gov/hfstudy/hydraulic-fracturing-water-cycle

Page 5: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

5

Data Mining for Data Science• Analysis of Hydraulic Fracturing Fluid Data from the FracFocus Chemical Disclosure

Registry 1.0 - See Attachments below– Four xlsx files totaling 23 MB

• Data Management and Quality Assessment Report - See Attachments below– 2 MB PDF and 96 MB ZIP file with Access Database and PDF documentation

• Project database developed from FracFocus 1.0 disclosures– Same as second item above

• Selected data tables from the analysis of FracFocus 1.0– Same as first item above

• State-level Summaries of FracFocus 1.0 Hydraulic Fracturing Data - See Attachments below– 7 MB PDF Guide and 22 PDF files

• EPA Analysis of FracFocus Data Fact Sheet - See Attachments below– 0.375 MB PDF

• EPA Webinar: Analysis of FracFocus Data - See Slides below– 2 MB PDF

http://www2.epa.gov/hfstudy/epa-analysis-fracfocus-1-data

Page 7: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

7

Data Mining Files and Attachments

My Note: See Attachments at Bottom of MindTouch PagePDF to MindTouch to Excel and Excel (Access)to Spotfire

Page 8: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

8

Data Science for EPA Fracturing Data: MindTouch Knowledge Base Find

My Note: Use Google Chrome FindFor Map Boundary Files

Data Science for EPA Big Data Analytics and EPA Fracturing Data

Page 9: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

9

Data Science for EPA Fracturing Data: Spreadsheet Knowledge Base

EPABigDataAnalytics.xlsx

Page 16: EPA Big Data Analytics: Data Science for EPA Fracturing Data Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community

16

Conclusions and Recommendations

• At the request of Congress, EPA is Studying Hydraulic Fracturing for Oil and Gas and Its Potential Impact on Drinking Water Resources at Each Stage of the Hydraulic Fracturing Water Cycle.

• In March 2015: U.S. Department of the Interior Released the Final Rule to Support Safe, Responsible Hydraulic Fracturing Activities on Public and Tribal Lands.

• This Rule was based on EPA peer-reviewed analysis of over two years of data from the FracFocus Chemical Disclosure Registry 1.0.

• EPA is Planning to Stand Up a Big Data Analytics Service and this Data Science for EPA Fracturing Data is an Example of EPA Big Data Analytics.

• This Data Science Data Product and Publication Includes a Knowledge Base in MindTouch, Excel, and Spotfire and Analytics and Visualizations in Spotfire.

• Additional Data Science Products and Publications Can be Done on More FracFocus PDF Files and the Access Database and Other EPA Projects.