data science for nsf polar cyberinfrastructure & mit big data course dr. brand niemann director...

10
Data Science for NSF Polar Cyberinfrastructure & MIT Big Data Course Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community http://semanticommunity.info/ http://www.meetup.com/Virginia-Big-Data-Meetup / http://www.meetup.com/Federal-Big-Data-Working-Group/ http://semanticommunity.info/Data_Science/Federal_Big_Data_Working_Group _Meetup January 12, 2015 1

Upload: milton-morgan

Post on 17-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

1

Data Science for NSF Polar Cyberinfrastructure & MIT Big Data Course

Dr. Brand NiemannDirector and Senior Data Scientist/Data Journalist

Semantic Communityhttp://semanticommunity.info/

http://www.meetup.com/Virginia-Big-Data-Meetup/ http://www.meetup.com/Federal-Big-Data-Working-Group/

http://semanticommunity.info/Data_Science/Federal_Big_Data_Working_Group_MeetupJanuary 12, 2015

2

Federal Big Data Working Group Meetup

• Federal: Supports the Federal Big Data Initiative, but not endorsed by the Federal Government or its Agencies;

• Big Data: Supports the Federal Digital Government Strategy which is "treating all content as data", so big data = all your content;

• Working Group: Data Science Teams composed of Federal Government and Non-Federal Government experts producing big data products (see Possible Team Presentations below); and

• Meetup: The world's largest network of local groups to revitalize local community and help people around the world self-organize like MOOCs (Massive Open On-line Classes) being considered by the White House

3

The Profit and Data Enterprises• Marcus Lemonis (born

November 16, 1973) is a Lebanese-born American businessman, investor, television personality and philanthropist. He is currently the chairman and CEO of Camping World and Good Sam Enterprises, and the star of The Profit, a CNBC reality show about saving small businesses through People, Process, and Products.– http://

en.wikipedia.org/wiki/Marcus_Lemonis

• The Federal Big Data Working Group Meetup is also about helping government agencies develop:– People – Data Scientists– Process – Data Infrastructure– Products – Data Publications

• Some examples:– EPA– FDA– NOAA– HHS– Eastern Foundry

• And provide MOOCs for training and networking. (Massive Open Online Courses)

4

Five MOOCs for BigApplications and Analytics

• Practical Data Science for Data Scientists by Niemann Based on Schutt and O’Neil Book

• Data Science for Data Mining by Niemann Based on North Book and Borne Class

• Federal Big Data Working Group Meetups by Niemann and Goodier

• Tackling the Challenges of Big Data, MIT ProfessionalX Online Course by Niemann Based on Rus and Madden MOOC

• Data Science for Big Data Application and Analytics MOOC by Niemann Based on Geoffrey Fox MOOC

• Data Science for Mining of Massive Datasets by Niemann Based on Stanford MOOC (IN PROCESS)

See: Top 5 MOOCs for Data Science

5

Tackling the Challenges of Big Data

7

Agenda• 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring)

Data Science for IEEE Big Data• 6:45 p.m. Dr. George Strawn, Director, National Coordination Office/NITRD and Brand

Niemann, Slides– TO BE RESCHEDULED: Dr. Chaitan Baru, Senior Advisor for Data Science in the CISE Directorate at the

National Science Foundation (confirmed) Slides• 7:10 p.m. Brief Member Introductions• 7:30 p.m. 7:30 p.m. Mark Silverman, Treeminer, Inc., Overview of Hadoop for Data Mining

Slides and Demo– TO BE RESCHEDULED: Dr. Marco Tedesco, NSF Polar CyberInfrastructure Program Manager (confirmed)

and Dr. Chris Mattmann, Chief Architect, Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion and NSF Polar CyberInfrastructure Hackathon Organizer (confirmed) Data Science for NSF Polar Cyberinfrastructure

• 8:15 p.m Data Science for Tackling the Challenges of Big Data and Members Who Took the Course

• 8:30 p.m. Open Discussion• 8:45 p.m. Networking• 9:00 p.m. Depart

http://www.meetup.com/Federal-Big-Data-Working-Group/events/217631412/

8

Calendar• First Virginia Big Data Meetup: Data Science for The Data Act at

Treasury, December 15, 2014, CGI Federal.– Summary Report to OMB and Data Collation (IN PROCESS).

• Government Technology & Innovation Incubator for Big Data Analytics, January 27, 2015.– Meetup of Meetups for Eastern Foundry Challenge Cup

• Data Science for the National Big Data R and D Initiative, February 2, 2015

• Data Science for Big Data Application and Analytics MOOC, March 2, 2015

• Data Science for HealthData.gov Developers & Family Caregivers. April 6, 2015

• Data Science for Natural Medicines and Genetic Data (in planning), May 4, 2015

9

Government Technology & Innovation Incubator for Big Data Analytics

• Purpose: Time Critical Because of Eastern Foundry Small Business/Start-Up Benefits and Challenge Cup Opportunity

• Opportunity: Challenge Cup

• Companies with government technology products (physical and software) can compete for free space at Eastern Foundry, educational programming on government contracting, product development, and general corporate skills, and access to VCs. Companies will be selected based on the maturity of their product and the urgency of the need the product would address.

• Eastern Foundry: Veteran owned incubator for startups and small businesses interested in the government contracting industry.– Sen. Mark Warner cuts ribbon at Eastern Foundry opening ceremony– The Crystal City incubator opened its doors December 1st and already has 33 companies in their space.– http://technical.ly/dc/2014/12/16/sen-mark-warner-cuts-ribbon-eastern-foundry-opening-ceremony/

• Web: Eastern Foundry (http://www.eastern-foundry.com and https://twitter.com/easternfoundry)

• Logistics: Free Parking After 5 p.m. in Underground Garage– Metro Blue and Yellow Lines: Crystal City Station– 202-725-7483 and [email protected]

10

Agenda• 6:30 p.m. Welcome and Introduction (New Tutorial and Mentoring)

Data Science for the DTIC Data Ecosystem RFI Brand Niemann

• 7:00 p.m. Brief Member Introductions

• 7:15 p.m. Big Data Technology, Dmitri Adler

• 7:45 p.m. Government Technology & Innovation Incubator (Eastern Foundry), Geoff Orazem

• 8:30 p.m. Open Discussion

• 8:45 p.m. Networking

• 9:00 p.m. Departhttp://www.meetup.com/Federal-Big-Data-Working-Group/events/219547654/