real time semantic analysis of streaming sensor data

46

Upload: knoesis-center-wright-state-university

Post on 03-Dec-2014

2.714 views

Category:

Education


5 download

DESCRIPTION

Harshal Patni, "Real Time Semantic Analysis of Streaming Sensor Data," MS Thesis Defense, Kno.e.sis Center, Wright State University, Dayton OH, March 21, 2001.More at: http://wiki.knoesis.org/index.php/SSWDissertation Advisor: Prof. Amit Sheth

TRANSCRIPT

Page 1: Real Time Semantic Analysis of Streaming Sensor Data
Page 2: Real Time Semantic Analysis of Streaming Sensor Data

xWEB DATA evolved over time

2

Static Document and files

Real-Time Sensor, Social, Multi-media

data

Dynamic User Generated Content

1990’s

2000’s

2010’s

Page 3: Real Time Semantic Analysis of Streaming Sensor Data

xProperties of Streaming Data

3

Continuous

Rapid

Huge Volume

Heterogeneous

Information Overload!!

Page 4: Real Time Semantic Analysis of Streaming Sensor Data

xSome Statistics

4

“Sensors Networks will produce 10-20 times the amount of generated by social media in the next few years” - GigaOmni Media

Solution - “Meaningfully summarize this data”“More data has been created in the last three years than in all the past 40,000 years”- Teradata

“A cross-country flight from New York to Los Angeles on a Boeing 737 plane generates a massive 240 terabytes of data”- GigaOmni Media

Page 5: Real Time Semantic Analysis of Streaming Sensor Data

From Sensor Streams to Feature Streams in Real Time

Harshal PatniOhio Center of Excellence in Knowledge enabled Computing (Kno.e.sis)

Wright State University, Dayton, OH

Part of Semantic Sensor Web @ Kno.e.sis

48th ACM Southeast Conference. ACMSE 2010. Oxford, Mississippi. April 15-17, 2010.

Page 6: Real Time Semantic Analysis of Streaming Sensor Data

xOutline

6

1. Introduction 2. Architecture3. Linked Sensor Data4. Feature Streams5. Demonstration

Page 7: Real Time Semantic Analysis of Streaming Sensor Data

xDomain

7

Weather Domain Features

Blizzard Flurry

RainShower RainStorm

Page 8: Real Time Semantic Analysis of Streaming Sensor Data

xExplaining the title

8

Huge amount of Raw Sensor Data

Background Knowledge

Features representing Real-World events

ABSTRACTION

BlizzardRain Storm

Page 9: Real Time Semantic Analysis of Streaming Sensor Data

xTypes of Abstractions

9

Sum

mar

izat

ion

over

the

Tem

pora

l Dim

ensi

on

Summarization across Thematic Dimension

Page 10: Real Time Semantic Analysis of Streaming Sensor Data

xTypes of Abstractions

10

Summarization across Thematic Dimension

Select

Join

Analyze

Background Knowledge

Features representing Real-World Events

Page 11: Real Time Semantic Analysis of Streaming Sensor Data

xAn example problem?

11

“Find the sequence of weather events observed near Dayton James Cox Airport between Jan 13th and Jan 18th?”

Thematic Spatial Temporal

Technologies required - 1. Linked Sensor Data2. Feature Streams

Page 12: Real Time Semantic Analysis of Streaming Sensor Data

xOutline

12

1. Introduction 2.Architecture3.Linked Sensor Data4.Feature Streams5. Demonstration

Page 13: Real Time Semantic Analysis of Streaming Sensor Data

xSystem Architecture

13

Page 14: Real Time Semantic Analysis of Streaming Sensor Data

xOutline

14

1. Introduction 2.Architecture3.Linked Sensor Data4.Feature Streams5. Demonstration

Page 15: Real Time Semantic Analysis of Streaming Sensor Data

Technology1: Linked Sensor Data

48th ACM Southeast Conference. ACMSE 2010. Oxford, Mississippi. April 15-17, 2010.

Harshal Patni, Cory Henson, Amit Sheth, 'Linked Sensor Data,' In: Proceedings of 2010 International Symposium on Collaborative Technologies and Systems (CTS 2010), Chicago, IL, May 17-21, 2010.

1. Find the sensor around Dayton James Cox Airport?2. Extract Data for the sensor near Dayton James Cox Airport?

Page 16: Real Time Semantic Analysis of Streaming Sensor Data

Sensor Discovery Application

16

Weather Station ID

Weather Station Coordinates

Weather Station Phenomena

Current Observations from MesoWest

MesoWest – Project under Department of Meteorology, University of UTAH

GeoNames – Geographic dataset

Page 17: Real Time Semantic Analysis of Streaming Sensor Data

What is Linked Sensor Data

17

Weather Sensors

Camera SensorsSatellite Sensors

GPS SensorsSensor Dataset

Page 18: Real Time Semantic Analysis of Streaming Sensor Data

What is Linked Sensor Data

18

Sensor Dataset

Publicly Accessible

Recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Web using URIs and RDF

RDF – language for representing data on the Web

loca

tedNea

r

GeoNames Dataset

Page 19: Real Time Semantic Analysis of Streaming Sensor Data

Linked Sensor Data on LOD

19

- First Sensor Dataset on LOD - Among the largest dataset on LOD

Page 20: Real Time Semantic Analysis of Streaming Sensor Data

znSensor Datasets

20

LinkedSensorDataset

• RDF Descriptions of ~20,000 weather stations in US• Average 5 sensors/weather station• Spatial attributes of the weather station• Links to locations in Geonames

LinkedObservationDataset

• RDF descriptions of Hurricanes and Blizzard observations in US

• Observations generated by sensors described in LinkedSensorDataset

Page 21: Real Time Semantic Analysis of Streaming Sensor Data

MesoWest Service Data

OGC Observation and Measurement

(O&M)RDF Instance Virtuoso RDF store

Data Generation Workflow

21

O&M2RDFCONVERTER

Page 22: Real Time Semantic Analysis of Streaming Sensor Data

Workflow – Phase 1

22

MesoWest Service Data

OGC Observation and

Measurement (O&M)

RDF Instance Virtuoso RDF store

Page 23: Real Time Semantic Analysis of Streaming Sensor Data

MesoWest Service Data

OGC Observation and

Measurement (O&M)

RDF Instance Sesame RDF store

Workflow – Phase 2

23

OGC (Open Geospatial Consortium) standard for encoding sensor observations

Page 24: Real Time Semantic Analysis of Streaming Sensor Data

Workflow – Phase 3

MesoWest Service Data

OGC Observation and

Measurement (O&M)

RDF Instance Virtuoso RDF store

Ontology – formal representation of knowledge by a set of concepts and relationship between those concepts

W3C SSN ontology

Page 25: Real Time Semantic Analysis of Streaming Sensor Data

Figure 1: System Components and Architecture

Workflow – Phase 3

MesoWest Service Data

OGC Observation and

Measurement (O&M)

RDF Instance Virtuoso RDF store

Page 26: Real Time Semantic Analysis of Streaming Sensor Data

Workflow – Phase 4

MesoWest Service Data

OGC Observation and

Measurement (O&M)

RDF Instance Virtuoso RDF store

Open Source RDF store by OpenLink Software for storing RDF data

PUBBY Linked Data Front End

Page 27: Real Time Semantic Analysis of Streaming Sensor Data

Summarizing Linked Sensor Data

ObservationKB

Sensor KB Location KB(Geonames)

procedure locationlocation

procedure location720F Thermometer Dayton Airport

• ~2 billion triples

• MesoWest

• Static + Dynamic

• 20,000+ systems

• MesoWest

• ~Static

• 230,000+ locations

• Geonames

• ~Static

Find the sensor around Dayton James Cox Airport?

Extract Data for the sensor?

Page 28: Real Time Semantic Analysis of Streaming Sensor Data

xOutline

28

1. Introduction 2.Architecture3.Linked Sensor Data4.Feature Streams5. Demonstration

Page 29: Real Time Semantic Analysis of Streaming Sensor Data

Technology 2: Feature Streams

48th ACM Southeast Conference. ACMSE 2010. Oxford, Mississippi. April 15-17, 2010.

Harshal Patni, Cory Henson, Amit Sheth, Pramod Ananthram, ‘From Real Time Sensor Streams to Real Time Feature Streams,' Kno.e.sis Technical Report, January 2011.

1. What feature is currently being detected by sensor near Dayton Airport?

Page 30: Real Time Semantic Analysis of Streaming Sensor Data

xSystem Architecture

30

Streams Integration based on feature composition

Integrated Stream Analysis to check if the feature is being detected

Page 31: Real Time Semantic Analysis of Streaming Sensor Data

xFeature Composition

31

Page 32: Real Time Semantic Analysis of Streaming Sensor Data

xSystem Capability

32

Page 33: Real Time Semantic Analysis of Streaming Sensor Data

xSystem Feature Integration

33

SELECT

JOIN

Page 34: Real Time Semantic Analysis of Streaming Sensor Data

xSystem Architecture

34

Integrated Stream Analysis to check if the feature is being detected

Page 35: Real Time Semantic Analysis of Streaming Sensor Data

xFeature Definition

35

• Rain Storm NOAA definitionRainStorm = HighWindSpeed(above 35mph) AND

Rain Precipitation AND Temperature(greater than 32F)

SPARQL query for RainStorm

Temperature

Rain Precipitation

WindSpeed

Page 36: Real Time Semantic Analysis of Streaming Sensor Data

xFeature Analysis

36

RDF Feature Stream

Page 37: Real Time Semantic Analysis of Streaming Sensor Data

xRevisiting Abstractions

37

Summarization across Thematic Dimension

Select

Join

Analyze

Background Knowledge

Features representing Real-World Events

Page 38: Real Time Semantic Analysis of Streaming Sensor Data

Summarizing Feature Streams

ObservationKB

Sensor KB Location KB(Geonames)

procedurelocation

procedure location720F Thermometer Dayton Airport

• ~2 billion triples

• MesoWest

• Static + Dynamic

• 20,000+ systems

• MesoWest

• ~Static

• 230,000+ locations

• Geonames

• ~Static

Feature StreamsKB

Find sequence of events near Dayton Airport?

Page 39: Real Time Semantic Analysis of Streaming Sensor Data

xAnswering the query

39

“Find the sequence of weather events observed near Dayton James Cox Airport between Jan 13th and Jan 18th?”

Linked Sensor Data Feature Streams

Page 40: Real Time Semantic Analysis of Streaming Sensor Data

xOutline

40

1. Introduction 2.Architecture3.Linked Sensor Data4.Feature Streams5. Demonstration

Page 41: Real Time Semantic Analysis of Streaming Sensor Data

xDemo

41

• Feature Streams Demo

– http://knoesis1.wright.edu/EventStreams

Page 42: Real Time Semantic Analysis of Streaming Sensor Data

xEvaluation

42

• Data Used: Nevada Blizzard (April 1st – April 6th)

70% Data clear

30% Feature Observed

Page 43: Real Time Semantic Analysis of Streaming Sensor Data

WORKSHOP PAPERS• Harshal Patni, Satya S. Sahoo, Cory Henson, Amit Sheth,

Provenance Aware Linked Sensor Data, 2nd Workshop on Trust and Privacy on Social and Semantic Web,Co-Located with ESWC, Heraklion Greece, May 30th - June 3rd 2010

• Harshal Patni, Cory Henson, Amit Sheth, Linked Sensor Data, In: Proceedings of 2010 International Symposium on Collaborative Technologies and Systems (CTS 2010), Chicago, IL, May 17-21, 2010

TECHNICAL REPORT• Harshal Patni, Cory Henson, Amit Sheth, and Pramod Ananthram.

From Real Time Sensor Streams to Real Time Feature Streams, Kno.e.sis Center Technical Report, December 2009

• Joshua Pschorr, Cory Henson, Harshal Patni, and Amit Sheth. Sensor Discovery on Linked Data, Kno.e.sis Center Technical Report, December 2009

JOURNAL PAPER (In Progress)• Semantic Sensor Web: Design and Application towards weaving a meaningful sensor web

Publications

43

Page 44: Real Time Semantic Analysis of Streaming Sensor Data

Thank You Committee

44

Page 45: Real Time Semantic Analysis of Streaming Sensor Data

Semantic Sensor Web

Thank You

45

Page 46: Real Time Semantic Analysis of Streaming Sensor Data

Demos, Papers and more at: http://wiki.knoesis.org/index.php/SSW

Semantic Sensor Web @ Kno.e.sis

QUESTIONS

46