distributed data management
DESCRIPTION
Distributed Data Management. Graeme Kerr Oracle in R&D Programme. Distributed Data Management. Provides access to data when and where you need it Optimizes information access regardless of physical location of data. Enables virtualisation of data in Grids. Types of Data: - PowerPoint PPT PresentationTRANSCRIPT
<Insert Picture Here>
Distributed Data ManagementGraeme KerrOracle in R&D Programme
Distributed Data Management
• Provides access to data when and where you need it• Optimizes information access regardless of physical
location of data.• Enables virtualisation of data in Grids.
• Types of Data:• Flat Files, Relational.• Structured & unstructured.• Text, Binary, proprietary.• Scalar, user defined typed, XML, RDF• Images, DICOM, LOB, Spatial.• High Volumes.
Methods of Data Distribution
• Distribute• Federate• Consolidate
Distribute Information
• Share information between users, applications, databases, organisations and partners. • Move or copy / replicate information as needed• Traditionally implemented as replication or message
queuing• Wide range of technology solutions to achieving distribution.• Continuous feed and periodic refresh capabilities are key
requirements.
• Common within Scientific Grid Approach• Works well when replicas are read-only.• Considerable issues when replicas are updateable.
Data Distribution - Example
SOURCE SITE
REPLICA SITES
Continuous Feeds
Data Distribution - Example
SOURCE SITE
REPLICA SITES
Continuous Feeds
Periodic Refresh
Data Distribution - Example
SOURCE DATABASE
REPLICA DATABASES
Oracle STREAMS
Transportable Tablespaces,RMAN,
Data Pump
CERN LCG Example
RAL
BNL
CNAFIN2P3
Gridka
ASGC
TRIUMPF
SARA
PIC
NorduGrid
Online DB @ Detector
Offline DB @ CERN IT
Methods of Data Distribution
• Distribute• Federate• Consolidate
Federate Information
• Federate data in multiple data stores into a single virtual database• Access both Database and non-Database data• Access both structured and unstructured data• Hide physical location of data from applications• Vendor and open-source solutions.
• Used mostly as point solution within Commercial World rather than as a key design point.
Data Federation - Example
DB2Excel
WebServices
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Data Federation - Example
DB2
XML
ExcelFlatFilesText
WebServices
FlatFiles Sybase
Methods of Data Distribution
• Distribute• Federate• Consolidate
Consolidate Information
• Consolidate heterogeneous data into a central pool.• Single database or pool of databases• Any data (audio, video, XML, email, messages, etc…)• Any platform (Linux, Windows, Solaris, HPUX, AIX, Tru64,
OpenVMS, OS/390)• Proven scalability
• Hundreds of Very Large Database customers• Industry leading benchmarks
• Very common in commercial world.
Typical Data Centre View
DB Clusters
Disk
DB Users
Consolidated Clusters with RAC & ASM
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
DB Server & Disk Virtualisation
DB Cluster
Pool
Disk Pool
DB Users
Distributed Data Management : Summary
• Oracle supports efficient utilization of all resources:• Consolidate Information—Scalable access to large amounts
of information• Federate Information—Data can be located where needed
but accessed as single database• Share Information—Multiple users and applications can share
data/information to operate more effectively
• Information Integration & fast connectivity critical to Grid Computing
Oracle Database Versions.
• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu & Mandriva
• Database 10G Enterprise Edition• Flagship, Full-featured version.
• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported
• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.
• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence
Oracle Database Versions.
• Database 10G Express Edition• Entry-Level, small footprint, free to develop, deploy & distribute.• 32 Bit Windows, RHEL3 & 4, FC 4, SLES9, Debian 3.1, Ubuntu &
Mandriva
• Database 10G Enterprise Edition• Flagship, Full-featured version.
• Database 10G Standard Edition• Limited featured version, RAC, ASM, Streams Apply supported
• Database 10G Standard Edition 1• SE on limited hardware. Streams Apply supported.
• Data Grid / Distributed Caching• TimesTen • TangoSol Coherence
For More Information
Database: www.oracle.com/database/index.html
Express Edition: www.oracle.com/technology/products/database/xe/index.html
Tangosol: www.oracle.com/tangosol/index.html
Times 10: www.oracle.com/database/timesten.html
or
http://search.oracle.com
Database | Tangosol | etc
EMEA Oracle in R&D Programme
[email protected]@oracle.com