semantic data discovery: proof of concept for dhs dr. brand niemann director and senior data...

10
Semantic Data Discovery: Proof of Concept for DHS Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community http://semanticommunity.info/ http://www.meetup.com/Virginia-Big-Data-Meetup/ http://www.meetup.com/Federal-Big-Data-Working-Group/ http://www.meetup.com/Northern-Virginia-Semantic-Web-Meetup/ http://semanticommunity.info/Data_Science/Federal_Big_Data_Working _Group_Meetup March 25, 2015 1

Upload: kristina-palmer

Post on 24-Dec-2015

217 views

Category:

Documents


1 download

TRANSCRIPT

1

Semantic Data Discovery:Proof of Concept for DHS

Dr. Brand NiemannDirector and Senior Data Scientist/Data Journalist

Semantic Communityhttp://semanticommunity.info/

http://www.meetup.com/Virginia-Big-Data-Meetup/ http://www.meetup.com/Federal-Big-Data-Working-Group/

http://www.meetup.com/Northern-Virginia-Semantic-Web-Meetup/ http://semanticommunity.info/Data_Science/Federal_Big_Data_Working_Group_Meetup

March 25, 2015

2

Information Sharing at DHS

• NIEM (Michael Daconta):– XML Messages

• SOA (Wolf Tombe):– XML Messages and XML Data in an ESB

• Semantic Ontology (Barry Smith):– RDF/OWL UCore

• Semantic Knowledge Bases (Brand Niemann):– NIEM and NIEM and Thetus Savana

• Semantic Search Data Browser (Brand Niemann)– Global Terrorism Database Experience

• Semantic Quint Dynamic Case Management (Brand Niemann):– XML/RDF/OWL/RML in Be Informed

4

NIEM 3.0 Alpha 2 Release and Thetus Savanna Review

Spotfire Dashboard: Web PlayerMindTouch Knowledge Base: NIEM and Thetus

Key Questions:How can one review NIEM 3.0 Alpha 2 without some data science analytics?Does Thetus Savanna do what NIEM is ultimately trying to accomplish without NIEM 3.0?

6

Global Terrorism Database Experience

Spotfire Dashboard: Web PlayerMindTouch Knowledge Base: Global Terrorism Database

Pilot for Stephen Dennis, Director, Innovation, Homeland Security Advanced Research Projects Agency (HSARPA), Department of Homeland Security (DHS), “Big Data Analytics for Homeland Security”

7

A Quint for Cross Information Sharing and Integration in the Intelligence Community

Spotfire Dashboard: Web PlayerMindTouch Knowledge Base: A Quint-Cross Information Sharing and Integration

8

Dynamic Case Management Pilot for Healthcare.gov

Video Demo: VimeoMindTouch Knowledge Base: Healthcare.gov Data Science

9

Proof of Concept Steps• Introduction:

– Best practice system example with data dictionary (DD) and Application Programming Interface (API)

– Challenge all systems to do that and submit for internal web page• Phase I:

– All systems gain experience using internal web page for improved information sharing

– Initial work on semantic harmonization for multiple DDs• Phase II:

– Develop various semantic harmonization methods and tools (e.g. ontology)– Pilot those methods and tools (Dynamic Case Management – Be Informed?)

• Phase III:– Develop requirements for improved information sharing system based on Phase I

and II experience– Release RFI for RFQ

10

Semantic Community• Former Senior Enterprise Architect & Data Scientist with the US

EPA.• Led Federal CIO Council Web Services, SOA (with Mitre),

Semantic Interoperability, and Semantic Community Work.• Founded and Co-organize the Federal Big Data Working Group

Meetup to Continue the Above as a Private Citizen.• Helping Government Agencies (US, Europe, and Japanese)

Develop Data Scientists/Chief Data Officers, Data Infrastructure, and Data Publications.– http://semanticommunity.info/

• Providing MOOCS/Meetups for Training and Networking.– http://www.meetup.com/Federal-Big-Data-Working-Group