why this panel? - brown university

33

Upload: others

Post on 19-Apr-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Why This Panel? - Brown University
Page 2: Why This Panel? - Brown University

Why This Panel?

n Database Systems & Storage

u Interesting Research & Business Opportunities

u New Engineering & Performance Frontiers

u Applications - Sky (actually Space) is the Limit

n Prevailing Zeitgeist

“I don’t care where I store and query my data“I don’t care where I store and query my data

as long it is on disk” as long it is on disk”n Why VLDB?

u Jim Gray’s 1998 10-year best paper (DiskShadowing) talk future of Disk Technology

u Put Everything in Future (Disk) Controllers(it’s not “if”, it’s “when”)ä http://www.nsic.org/nasd/1998-jun/gray.pdf

Page 3: Why This Panel? - Brown University

PanelistsPanelists

Talk Slides: www.FileTek.com

Page 4: Why This Panel? - Brown University

FileTek - Co-Founder, Chief Technical OfficerCo-Inventor Storage Machine & StorHouse Product

BS in Electrical Engineering, Virginia Tech

Page 5: Why This Panel? - Brown University

Principal Member Technical Staff - AT&T LabsPh.d Computer Science, New York University, 1990

Page 6: Why This Panel? - Brown University

Professor Computer Science, University of Maryland, College ParkDirector of the Division of Informatics, Department of PathologyProfessor Pathology, John Hopkins University (Medical School)

Page 7: Why This Panel? - Brown University

FileTek, Director New Business DevelopmentExecutive MBA USC 1995

M.S. in Computer Science, NYU, 1979

Page 8: Why This Panel? - Brown University

IBM Toronto - Manager Optimizer GroupPh.d Illinois Institute of Technology

Page 9: Why This Panel? - Brown University

Database

Systems

Stor

age

Tren

ds

DataWarehousing

DatabaseTrends

Storage

Systems

AtomicData

Appli

catio

nsYottabytes?

Panel Topics

Page 10: Why This Panel? - Brown University

Socrates in the Marketplacefor Database & Storage Systems

Page 11: Why This Panel? - Brown University

RAID

Optical

Magnetic

Tape

CA

PA

CIT

YTerabytes Milliseconds $$$ per GB

Terabytes Milliseconds $$ per GB

Petabytes Seconds $ per GB

Petabytes Sub-Minutes $$ per TBExabytes

CO

ST

PE

RF

OR

MA

NC

E

Capacity, Cost & Performance Tradeoffs

Page 12: Why This Panel? - Brown University

StorageMedia

RAID

Optical

Tape

43% of

Disk Dollar

7% of

Disk Dollar

Cost

100% of

Disk Dollar

Page 13: Why This Panel? - Brown University

A 100TB ExampleA 100TB Example

ACCESS TIME RELATIVE HRDWR $/MB

8-80 NS DRAM 400-750

.2-.3 MS Solid State Memory 10-50

4-10 MS Performance Disk (RAID) .60-1.00

5-20 MS Capacity Disk (DASD) .20-.60

5-10 Sec Optical Disk .10-.50

5-60 Sec Tape .05-.07

Page 14: Why This Panel? - Brown University

Enterprise-WideData Warehouse

Data Warehouse

ODS ODS ODS ODS

DM DM

IndependentData Mart

Data Warehouse

ODS ODS ODS ODS

DM

DependentData Mart

ODS ODS ODS ODS

DM DM DM DM

Data Warehouse &Data Mart Architectures

Page 15: Why This Panel? - Brown University

A Place for Everything . . .A Place for Everything . . .

DASDDASD TapeTapeOpticalOptical

Time

Num

ber

of A

cces

ses

Page 16: Why This Panel? - Brown University

What is Atomic-Level Data?

operationaloperational atomic/DWatomic/DW departmentdepartment individualindividual

• detailed• day to day• hightransactions

• applicationoriented

• most granular• time variant• integrated• subjectoriented

• some summary

• parochial• somederived;some atomic

• temporary• ad hocaccess

• PC, work-stationbased

• tools focus

Page 17: Why This Panel? - Brown University

DataMart

OperationalData Store

OperationalData Store

OperationalData Store

CentralDataStoreThe

“Hub”

ExtractLogic

& Process

FeedTheMart

DataMart

DataMart

DataMart

LegacySystems

VALUE OF DETAIL ATOMIC DATA(NOT SUMMARIES)

Page 18: Why This Panel? - Brown University

DATABASE TRENDS: SQL-99UDT (Objects) & UDFs (Functions)

SELECT Stock, Price, Video, PlotDailyValues(History)

FROM StockHistory

WHERE MovingAverage (History, 30) /

LastPrice (History) < .8;

Stock Price Video PlotDailyValues

Felipe.com 500 Art.com 75 Pekka.com 100

Page 19: Why This Panel? - Brown University

n TEXT: Includes Many Data Typesu ASCII & EBCDIC Textu Word & Framemaker Documentsu PDF Files, etc.

PRODUCT PICTURE LITERATURE

Yellow Sticky

SELECT * FROM ProductTableWHERE FindText (Literature, “Yellow Sticky”);

TEXT

Page 20: Why This Panel? - Brown University

n AUDIO & VIDEO: Many Usesu Product Informationu Advertisement (for Web Apps)u Customer Interactions

äFinancial TransactionsäInsurance Claims, etc.

AUDIO

VIDEO

SCALAR DATA AUDIO VIDEO

...VLDB Sizes: Cost-Effective System Needs

Page 21: Why This Panel? - Brown University

Stock Time-Series

AUDIO VIDEOTEXTIMAGE SPATIAL

OBJECT/RELATIONAL DATABASES

OpticalJukebox

TapeLibrary

Disks

LOGICAL DATABASE INFORMATION BACKPLANCE

Page 22: Why This Panel? - Brown University

Federated Databases-- The Players

n IBM DB2 Data Joinern Merant Data Directn Cohera Data Federation Systemn Sybase ACAn IBI EDA/ SQLn SolutionsIQ CONNXn B2 SQL Integrator

Page 23: Why This Panel? - Brown University

MicrosoftSQL Server

ORACLE

DB2

Sybase

TeradataWindows

OS/2

AIX

MacIntosh

Solaris

HP-UX

DB2DataJoiner

AIXWindows

HP-UXSolaris

Clients

Gateway

IMS

VSAM

...

Wrappers

Database Trend:Federated Databases

Database Trend:Federated Databases

Page 24: Why This Panel? - Brown University

Migration Rules[1] Aging Strategy or [2] Predicates

IMSRDBMS

MIGRATE

Item Product Price

Item Product Price

Tape orOptical

Logical Horizontal Partitioning (LHP)

?

Page 25: Why This Panel? - Brown University

Key Product Price

P1 Name $$$

Key Text Video

P1

Key Product Price Text Video

RAID Optical Tape

Logical Vertical Partitioning (LVP)

Page 26: Why This Panel? - Brown University

Storage Cost TrendsStorage Cost Trends

1997 1998 1999 2000 2001 2002 2003 2004

Year

0

1

10

100

1000C

ost

per

GB

of

Sto

rage

($/

GB

)

Tape

Disk

Optical

Page 27: Why This Panel? - Brown University

Information ExplosionInformation Explosion

I

1999I

2000I

2001I

2002I

2003

Cost of Storage

Demand for Data Storage(up 100% each year)

(down 35% each year)

Therefore, it doesn’t matter what the price ofRAID/DASD does -- the customer’s data needswill continue to outpace price decreases!

Page 28: Why This Panel? - Brown University

DATABASE SIZES

VERY LARGE DATABASES (VLDB) SCALAR DATA

n Terabyte (10**12)

u National Retail Point-Of-Sale Data (~30TB)

u Telecommunications Call Detail Warehouse (~200TB)

n Growth To {PETA, EXA, ZETTA, YOTTA}

u Disk-Only Solution:

ä Not Cost-Effective (requires $$$)

ä If You Choose Disk-Based Solution

Then Contact Felipe (can use Stock Tip)

Page 29: Why This Panel? - Brown University

Very Large Databases - Definitions

LARGE OBJECTS,

USER-DEFINED TYPES,

“MUCHO” SCALAR DATA

n Petabyte (10**15)

u Text

u Image Data

u Audio

u Video

Page 30: Why This Panel? - Brown University

Very VLDB - Six/Seven Commas

n Exabyte (10**18) - Six Commas

u US National Medical Insurance Records

u Human Genome (US Population)

u Yes: Image, Audio & Video (Again?)

n Zettabyte (10**21) - Seven Commas

u Spatial & Terrestrial Data

u Video & Audio Archives

Page 31: Why This Panel? - Brown University

The Big One - Yottabyte or Bust!

nn Yottabyte (10**24Yottabyte (10**24) - Eight Commas

u By Moore’s Law:

äYear 2050 Database Size

äMath Sizing Source: Pekka Kostamaa

[email protected]

Page 32: Why This Panel? - Brown University

Socratic Exegesis - VLDB StyleAnd Now Here’s ...

n Ted Johnson - Storage Trends

n John Burgess - StorHouse/RM

n Bill O’Connell - Database Systems (@IBM)

n Joel Saltz - Applications & Tools

u Road to Yottabytes (Today?)

Page 33: Why This Panel? - Brown University

Felipe’s Socratic Exegesisn Jim Gray: Correct

u Disk Storage Trendsu Smarter Disksu Service More DB Functionality

ä For SOME Applications: Think of Disks asShared-Nothing Single-CPU Nodes

u Missing/Un-emphasized Point (Annual Growth)ä Disk Improvement 35%ä Data Growth 100%

n Database Growth Enablersu Federated, Object/Relational (UDT/UDFs)

n How Big? Yottabyte by 2050n Finally

u Remember what happened to Socrates Disciple