tuw- 184.742 data as a service – concepts, design & implementation, and ecosystems

34
Data as a Service Concepts, Design & Implementation, and Ecosystems Hong-Linh Truong Distributed Systems Group, Vienna University of Technology [email protected] http://www.infosys.tuwien.ac.at/staff/truong 1 ASE WS 2012 Advanced Services Engineering, WS 2012

Upload: hong-linh-truong

Post on 22-Apr-2015

582 views

Category:

Design


0 download

DESCRIPTION

This presentation is part of the course "184.742 Advanced Services Engineering" at The Vienna University of Technology, in Winter Semester 2012. Check the course at: http://www.infosys.tuwien.ac.at/teaching/courses/ase/

TRANSCRIPT

Page 1: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data as a Service – Concepts, Design &

Implementation, and Ecosystems

Hong-Linh Truong

Distributed Systems Group,

Vienna University of Technology

[email protected] http://www.infosys.tuwien.ac.at/staff/truong

1 ASE WS 2012

Advanced Services Engineering,

WS 2012

Page 2: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Outline

Data provisioning and data service units

Data-as-a-Service concepts

DaaS design and implementation

DaaS ecosystems

ASE WS 2012 2

Page 3: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data versus data assets

ASE WS 2012 3

Data

Data Assets

Data management

and provisioning

Data concerns

Data collection,

assessment and

enrichment

Page 4: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data provisioning activities and

issues

ASE WS 2012 4

Collect

• Data sources

• Ownership

• Quality assessment and enrichment

Store

• Query and backup capabilities

• Local versus cloud, distributed versus centralized storage

Access

• Interface

• Public versus private access

• Access granularity

• Pricing and licensing model

Utilize

• Alone or in combination with other data sources

• Redistribution

Non-exhausive list! Add your own issues!

Page 5: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Stakeholders in data provisioning

ASE WS 2012 5

Data

Data Provider

• People (individual/crowds/organization)

• Software, Things

Service Provider

• Software and people

Data Consumer

• People, Software, Things

Data Aggregator/Integrator

• Software

• People + software

Data Assessment

• Software and people

Page 6: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Recall – Service Unit

ASE WS 2012 6

Service model

Unit Concept

Service unit

„basic

component“/“basic

function“ modeling

and description

Consumption,

ownership,

provisioning, price, etc.

What about service units providing data?

Page 7: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data service unit

ASE WS 2012 7

Service model

Unit Concept

Data service

unit

Data

Can be used for private

or public

Can be elastic or not

Page 8: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data service units in clouds/internet

Provide data capabilities rather than provide

computation or software capabilities

Providing data in clouds/internet is an increasing

trend

In both business and e-science environments

Bio data, weather data, company balance

sheets, etc., via Web services

8 ASE WS 2012

Page 9: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data service unit

9

Data service units in

clouds/internet

data

Internet/Cloud

Data service unit

People

data

Data service unit

Things

ASE WS 2012

data data

Page 10: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

SO DATA SERVICE UNIT IS

BIG OR SMALL? PROVIDING

REALTIME OR STATIC DATA?

Discussion time

ASE WS 2012 10

Page 12: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data as a Service -- characteristics

On-demand self-service

Capabilities to provision data at different granularities

Resource pooling

Multiple types of data, big, static or near-realtime,raw data and

high-level information

Broad network access

Can be access from anywhere

Rapid elasticity

Easy to add/remove data sources

Measured service

Measuring, monitoring and publishing data concerns and usage

ASE WS 2012 12

Built atop NIST‘s definition

Page 13: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Data-as-a-Service – service models

Data as a Service – service models

and deployment models

ASE WS 2012 13

Storage-as-a-Service

(Basic storage functions)

Database-as-a-Service

(Structured/non-structured

querying systems)

Data publish/subcription

middleware as a service

Sensor-as-a-Service

Private/Public/Hybrid/Community Clouds

deploy

Page 14: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Examples of DaaS

ASE WS 2012 14

Page 15: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

WHAT ELSE DO YOU THINK

CAN BE INCLUDED INTO DAAS

MODELS?

Discussion time

ASE WS 2012 15

Page 16: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS design & implementation –

APIs

Read-only DaaS versus CRUD DaaS APIs

Service APIs versus Data APIs

They are not the same wrt concerns

SOAP versus REST

ASE WS 2012 16

Example: infochimps

Page 17: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS design & implementation –

service provider vs data provider

The DaaS provider is separated from the data

provider

17

DaaS

Consumer

DaaS

Sensor

DaaS

Consumer DaaS provider Data

provider

ASE WS 2012

Page 18: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Example: DaaS provider =! data

provider

18

Page 19: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS design & implementation –

structures

DaaS and data providers have the right to

publish the data

ASE WS 2012 19

DaaS

• Service APIs

• Data APIs for the whole resource

Data Resource

• Data APIs for particular resources

• Data APIs for data items

Data Items

• Data APIs for data items

Three levels

Page 20: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

20

DaaS design & implementation –

structures (2)

Data

items

Data

items

Data

items

Data resource

Data

assets

Data resource Data resource

Data resource Data resource

Consumer

Consumer

DaaS

ASE WS 2012

Page 21: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS design & implementation –

patterns for „turning data to DaaS“ (1)

ASE WS 2012 21

DaaS data Build Data

Service

APIs

Deploy

Data

Service

Examples: using WSO2 data service

Page 22: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Storage/Database

-as-a-Service

DaaS design & implementation –

patterns for „turning data to DaaS“ (2)

ASE WS 2012 22

data

Examples: using

Amazon S3

DaaS

Page 23: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Storage/Databa

se/Middleware

DaaS design & implementation –

patterns for „turning data to DaaS“ (3)

ASE WS 2012 23

data

Examples: using

COSM/Pachube

Things

One thing 10000... things

DaaS

Page 24: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Storage/Database/

Middleware

DaaS design & implementation –

patterns for „turning data to DaaS“ (4)

ASE WS 2012 24

data

Examples: using Twitter

People DaaS

Page 25: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

....

DaaS design & implementation –

not just „functional“ aspects (1)

ASE WS 2012 25

data DaaS .... data assets

Data

concerns

Quality of

data Ownership

Price License ....

Enrichment Cleansing

Profiling

Integration ...

Data Assessment

/Improvement

APIs, Querying, Data Management, etc.

Page 26: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS design & implementation –

not just „functional“ aspects (2)

ASE WS 2012 26

Understand the DaaS ecosystem

Specifying, Evaluating and Provisioning Data

concerns and Data Contract

In follow-up

lectures

Page 27: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

WHAT ARE OTHER PATTERNS

IN „TURNING DATA TO

DAAS“?

Discussion time

ASE WS 2012 27

Page 28: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS ecosystems

ASE WS 2012 28

Data Assessment and Enrichment

Marco Comerio, Hong Linh Truong, Carlo Batini, Schahram Dustdar: Service-oriented data quality engineering and

data publishing in the cloud. SOCA 2010: 1-6

Page 29: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Examples of service units in DaaS

ecosystems

ASE WS 2012 29

Platforms/services Capabilities

Strikeiron clean, verify and validate data.

Jigsaw clean, verify and validate

business contact.

PostcodeAnywhere capture, clean, validate

and enrich business data.

Trillium Software Quality clean and standardize data

Uniserv Data Quality Solution X profile and clean data

Adeptia Integration Solution integrate data

Marco Comerio, Hong Linh Truong, Carlo Batini, Schahram Dustdar: Service-oriented data quality engineering and

data publishing in the cloud. SOCA 2010: 1-6

Page 30: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

DaaS ecosystem –

profiling/enriching example

ASE WS 2012 30

http://www.strikeiron.com/

Page 31: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Cloud-based conceptual architecture

for data quality and enrichment

ASE WS 2012 31

Marco Comerio, Hong Linh Truong, Carlo Batini, Schahram Dustdar: Service-oriented data quality engineering and

data publishing in the cloud. SOCA 2010: 1-6

Page 32: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

WHY DO YOU NEED TO STUDY

DAAS CONCEPTS, DESIGN

AND IMPLEMENTATION, AND

ECOSYSTEMS?

Discussion time

ASE WS 2012 32

Page 33: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

Exercises

Read mentioned papers

Check characteristics, service models and

deployment models of mentioned DaaS (and

find out more)

Identify services in the ecosystem of some DaaS

Write small programs to test public DaaS, such

as COSM/Pachube, Microsoft Azure and

Infochimps

Turn some data to DaaS using existing tools

ASE WS 2012 33

Page 34: TUW- 184.742 Data as a Service – Concepts, Design & Implementation, and Ecosystems

34

Thanks for your attention

Hong-Linh Truong

Distributed Systems Group

Vienna University of Technology

[email protected]

http://www.infosys.tuwien.ac.at/staff/truong

ASE WS 2012