distributed data management

27
<Insert Picture Here> Distributed Data Management Graeme Kerr Oracle in R&D Programme

Upload: savanna-angeloz

Post on 02-Jan-2016

40 views

Category:

Documents


0 download

DESCRIPTION

Distributed Data Management. Graeme Kerr Oracle in R&D Programme. Distributed Data Management. Provides access to data when and where you need it Optimizes information access regardless of physical location of data. Enables virtualisation of data in Grids. Types of Data: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Distributed Data Management

<Insert Picture Here>

Distributed Data ManagementGraeme KerrOracle in R&D Programme

Page 2: Distributed Data Management

Distributed Data Management

• Provides access to data when and where you need it• Optimizes information access regardless of physical

location of data.• Enables virtualisation of data in Grids.

• Types of Data:• Flat Files, Relational.• Structured & unstructured.• Text, Binary, proprietary.• Scalar, user defined typed, XML, RDF• Images, DICOM, LOB, Spatial.• High Volumes.

Page 3: Distributed Data Management

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 4: Distributed Data Management

Distribute Information

• Share information between users, applications, databases, organisations and partners. • Move or copy / replicate information as needed• Traditionally implemented as replication or message

queuing• Wide range of technology solutions to achieving distribution.• Continuous feed and periodic refresh capabilities are key

requirements.

• Common within Scientific Grid Approach• Works well when replicas are read-only.• Considerable issues when replicas are updateable.

Page 5: Distributed Data Management

Data Distribution - Example

SOURCE SITE

REPLICA SITES

Continuous Feeds

Page 6: Distributed Data Management

Data Distribution - Example

SOURCE SITE

REPLICA SITES

Continuous Feeds

Periodic Refresh

Page 7: Distributed Data Management

Data Distribution - Example

SOURCE DATABASE

REPLICA DATABASES

Oracle STREAMS

Transportable Tablespaces,RMAN,

Data Pump

Page 8: Distributed Data Management

CERN LCG Example

RAL

BNL

CNAFIN2P3

Gridka

ASGC

TRIUMPF

SARA

PIC

NorduGrid

Online DB @ Detector

Offline DB @ CERN IT

Page 9: Distributed Data Management

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 10: Distributed Data Management

Federate Information

• Federate data in multiple data stores into a single virtual database• Access both Database and non-Database data• Access both structured and unstructured data• Hide physical location of data from applications• Vendor and open-source solutions.

• Used mostly as point solution within Commercial World rather than as a key design point.

Page 11: Distributed Data Management

Data Federation - Example

DB2Excel

WebServices

Page 12: Distributed Data Management

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 13: Distributed Data Management

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 14: Distributed Data Management

Data Federation - Example

DB2

XML

ExcelFlatFilesText

WebServices

FlatFiles Sybase

Page 15: Distributed Data Management

Methods of Data Distribution

• Distribute• Federate• Consolidate

Page 16: Distributed Data Management

Consolidate Information

• Consolidate heterogeneous data into a central pool.• Single database or pool of databases• Any data (audio, video, XML, email, messages, etc…)• Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64,

OpenVMS, OS/390)• Proven scalability

• Hundreds of Very Large Database customers• Industry leading benchmarks

• Very common in commercial world.

Page 17: Distributed Data Management

Typical Data Centre View

DB Clusters

Disk

DB Users

Page 18: Distributed Data Management

Consolidated Clusters with RAC & ASM

DB Cluster

Pool

Disk Pool

DB Users

Page 19: Distributed Data Management

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 20: Distributed Data Management

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 21: Distributed Data Management

DB Server & Disk Virtualisation

DB Cluster

Pool

Disk Pool

DB Users

Page 22: Distributed Data Management

Distributed Data Management : Summary

• Oracle supports efficient utilization of all resources:• Consolidate Information—Scalable access to large amounts

of information• Federate Information—Data can be located where needed

but accessed as single database• Share Information—Multiple users and applications can share

data/information to operate more effectively

• Information Integration & fast connectivity critical to Grid Computing

Page 23: Distributed Data Management

Oracle Database Versions.

• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva

• Database 10G Enterprise Edition• Flagship, Full-featured version.

• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported

• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.

• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence

Page 24: Distributed Data Management

Oracle Database Versions.

• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu &

Mandriva

• Database 10G Enterprise Edition• Flagship, Full-featured version.

• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported

• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.

• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence

Page 25: Distributed Data Management

For More Information

Database: www.oracle.com/database/index.html

Express Edition: www.oracle.com/technology/products/database/xe/index.html

Tangosol: www.oracle.com/tangosol/index.html

Times 10: www.oracle.com/database/timesten.html

or

http://search.oracle.com

Database | Tangosol | etc

Page 26: Distributed Data Management

EMEA Oracle in R&D Programme

[email protected]@oracle.com

Page 27: Distributed Data Management