Technology Trends
ILM
Tiered Storage
Optical Replacement
Information Lifecycle Management
New Buzzword in the Industry “ILM is a Combination of Processes and
Technology that Determines how Data Ages and Flows Through an Environment.”
Storage Networking World
“It Helps End Users Manage Data from the Moment it is Created to the Time it is no Longer Needed.”
Storage Networking World
Information Lifecycle of Content
The Right Platform, at the Right Time, for the Right Cost
Day-to-day business, governance, and compliance
Capture
Create
Collaborate
Version
Manage
Publish
Archive Retire
Re-archive
Query
Changing Information Value of Enterprise Content
SAN Nearline Archive
10TB of Online Storage
(20% Online, 20% Inline, 30% NL, 30% Archive)
(20% Online, 20% NL, 60% Archive)
3TB @ 001/MB2TB @ $.11/MB
Reduce Storage TCO
10TB @ $.25/MB
2TB@$.25/MB 3TB @ 02/MB $783,000
2TB@$.25/MB 2TB @.02/MB 6,000*.001/MB $546,000
$2,500,000
ILM – Components Data Classification
Structured (DBs), SemiStructured (email), Unstructured (documents)
Mission Critical, Business Critical, Operational Age, Access Frequency, Performance Requirements, Availability SRM and ICM tools
Data Policies Time in a location, flow, number of copies, destruction Policy Architects should have intimate familiarity with the business
Data Management Software to implement the policies – HSM, ECM
Tiered Infrastructure Physical structures FC, SATA, CAS, Tape, WORM
ILM Classification Example & Tools
Information Classification Management (ICM) tools Kazeon StoredIQ Scentric
An emerging market
MCSI Strategy
Em
ail
Arc
hiv
e
D2D
Bac
kup
HS
M
SAN
NAS
Sto
rag
eC
on
soli
da
tio
nE
nte
rpri
se-
Wid
e B
ack
up
SAN
NAS
Storage Resource Management
(SRM)
The Evolution of SRMHome-Grown
“SRM”
First-Generation
SRM Modern SRM The Business Process
None Minimum, if any Policy and best-practices implementation
– Application and filesystem policies
– Continuous, automated management
Proactive software solutions– Mission-critical application
support – Policy-driven storage
management – Automated corrective actions – Automated reporting – Multi-level thresholds with
action
API, bi-directional integration
Uniform heterogeneous platform support
The Tools Home-grown Scripts Batch files Spreadsheets
Passive software tools Monitoring Notification Reporting
The Technology
Multiple versions of scripts for each platform
Rudimentary reporting
Command line interface (CLI)
Advanced reporting
Centralized server and data repository
Graphical user interface
Typical SRM EnvironmentApplications File / printOracleSQL ServerExchange
PrimaryServer
Browser
NASDAS
SAN
Storage Neutral SAN / NAS / DAS
DatabaseServer
MailServer
Platforms Windows (NT, 2000, 2003)UNIX (Solaris, HP, Linux, AIX)Novel Netware
LAN
Process Flow - Installing & Configuring
Deploy agentsUse wizards to deploy agentsShow the agent types
Get data from agents Create scanning policiesScan data
Find key problemsIn user interface In reports
Fix problems Intelligent Actions
OptionalThresholdsDashboardSecurity
Availability• Thresholds warn you when you approach dangerous levels• Intelligent Actions automatically triggers when thresholds are breached• Reports show what files are at risk due to failed backups
Automation• Automate housekeeping tasks with Intelligent Actions• Create best practice policies to maintain healthy storage• Become operating system and storage-agnostic with Intelligent Actions
Utilization• Comprehensive reports that provide summary and file-level detail• Snapshots in the user interface provide a quick view of your filesystems, mail servers, and databases
SRM Answers to Your Questions
Hierarchical Storage Management
(HSM)
HSM
What is it? Automated, policy-based data movement with transparent
access for unstructured data in File Systems and structured data in Database and Email systems.
Implements Archiving and ILM What is the primary benefit?
Lower Total Cost of Ownership Other benefits
Regulatory Compliance and Litigation Discovery Improved Performance
databases, email, backup
Motivating Factors Archiving
Source: “Leveraging E-mail Archiving for Records and Storage Management,” Radicati Group, March 17, 2005.
HSM
Current Approaches are ineffective Just add more disk Inefficient use of high-performance, high-cost disk Manual file deletion or movement efforts Selective backup Use of Backup as Archive Key
Point!
• Often 80% of files haven’t been accessed in 30 days
• Primary storage of inactive data is costly, inefficient
Estimate based on analysis of 1995-2004 industry reports and field experience.
0
500
1000
1500
2000
2500
3000
3500
4000
1995
1996
1997
1998
1999
2000
2001
2002
2003
2004
Inactive Data
Active Data
Gig
abyt
es
Typical Enterprise Data Growth
80%
20%Active Data
Inactive Data
* If a large percentage of your data hasn’t been accessed in months, do you still need to back it up every week??
Inactive Data Consumes Storage/Staff Resources
HSM Data Model
Nearline / Removable Media
Secondary / NearlinePrimary Data-Store
End-User View Application Store Archive & Offline Store
Common Archive/Offline DataStore Shared Media Infrastructure
A A
Backup Requirement
Entire File1
BB B Entire File or Stub2Copy
C CC Stub3Copy +
Purge
D None4Move
D
SRM & HSM – Major Players
SRM EMC
VisualSRM STK
GSM Symantec
Enterprise Vault (KVS) StorageExec
IBM Tivoli SRM
CommVault Qinetix
TekTools
HSM EMC
DiskXtender Symantec
Enterprise Vault (KVS) Sun
Utilization Suite (SAM-FS)
IBM Tivoli
HP StorageWorks FMA
SATA & Tiered Storage
Microsoft PowerPoint Presentation
Optical Replacement
Content Addressable Storage (CAS)
What is CAS?A Business Perspective
Technology for effective long-term storage of massive quantities of Fixed Content
Provides: On-line access Retention for decades Object authenticity and integrity Content Protection & Security:
Business ContinuanceDisaster Recovery
Offers Lower Total Cost of Ownership
Acquisition, operation, management
Self-configured & managed Non-disruptive scalability
E-mail Archives
Check Images
Documents
Business Records
Medical Records
CAD/CAM
Fixed Content Market Segments
2004 Total Fixed Content Market
M/F Tape 38%
Misc.Devices
3%
Optical 4%CAS/Other
Disk
25%
Open Sys. Tape 30%
Market Position: An Active Archive
Definition – An archive for Fixed Content - unchanging digital assets that are retained for active reference and long-term business value
An effective archive must provide:
• Meets requirements for business continuity and disaster recovery
• Requires minimal administration Low TCO
• Massively scalable to petabytes
• Technology obsolescence protection
• Assurance content has not been modified
• Where needed, compliance with regulatory requirements
• Fast access – 1-10 seconds in multi-terabyte environments
• Anywhere, any time
ManageabilityManageability
LongevityLongevity
AuthenticityAuthenticity
AccessibilityAccessibility
LowLowUpdate RatesUpdate Rates
ModerateModerateUpdate RatesUpdate Rates
High High Update RatesUpdate Rates
ex. TRANSACTIONAL ex. COLLABORATIVE ex. ARCHIVAL
Application Data Usage
SANSAN NASNAS CASCAS
SERVICE LEVEL
CO
ST
Directo
rs
Switches
SERVICE LEVEL
CO
ST
Gateway
SAN + NAS
Integrated
NAS
SERVICE LEVEL
CO
ST
Complia
nce
Intelligent
Archive
SANSAN NASNAS CASCAS
Typical applications
Purpose
Economic Advantage
Fibre Channel Based
OLTP, data warehousing, ERP, large databases
Leverage management cost of a storage infrastructure
For transferring data between servers and storage devices
IP & SATA Based
Software development, product design and development, small databases, workgroup applications
For file sharing and economical data base connectivity
Lowest cost to connect and share files
Fixed Content
E-mail archiving, ContentManagement, Database Archiving, Medical Imaging
For fast access and assured authenticity of fixed content
Best TCO/ROI
Networked Access
CAS Positioningin the ILM Infrastructure
Centera
Storage Designed for Long-Term Active Archiving of Fixed Content
EMC Centera
The world’s first Content Addressed Storage system
Specifically designed for active archiving
Provides online access to fixed content while preserving the authenticity of data like optical,at a total cost of ownership comparable to tape archives Centera for
midsizeenterprises
Simple
Scalable
Secure
Centera Momentum
Introduced in April 2002 Fastest growing product in EMC history Over
10PB shipped through 2003 50PB shipped through 2004
More than: 1200 customers 400 partners and resellers (at launch in 2002 30+ Partners)
The #1 reason customers buy – they create an “active” enterprise archive. Online access with assured content authenticity at a TCO
better than a tape archive.
Medical Imaging (12)AGFA/Mitra AmicasDigital Healthcare DR Systems Dynamic Imaging Fuji Medical GE Healthcare* Insignia Medical Systems Kodak (VIPArchive) McKesson - PACSMedi-Face Merge
E-Learning (1)DCI
Legal Discovery /Compliance (1) OpenPages*
Call Center (3) Mercom*NICE Systems*Savatar*
E-mail / InstantMessage (10)
Axs One* CaminoSoft Zantaz/Educom* Entelagent* FaceTime* iLumin* KVS* LEGATO* Nexic* Tumbleweed*
ECM (58)Absolute Data Alliance AMCO Arcartus*Artis(ADA System)BCT*Beta Systems* CeSoft Ceyoniq* COI D.Velop daa Systemhaus DocsFlow Documentum*EDS – PLM/UniGraphics
eiStream (Identitech*)eiStream (Kovis)eiStream (ViewStar)* ELO – Digital Office Exigen FabaSoft FileNet* Heilig & Schubert Hummingbird* HylandIBM*
ImageNow (Perceptive Vision) ImageRight IMTF – Informatic InfoCamere Informatica El Corte Ingles* INSCI* IQ Doq
KnowledgeLake* MAAS MarcosInnovation* MDY*
Meridio Mobius* New Vision Systems Novamens*
Media and Entertainment /Surveillance (3)
ArtesiaS4MVirage
Backup/Archiving/Workflow (44)
AAM* Amdocs*
Anecon Anexinet*
Arkivio*Autonomy* AXS Technologies BancTec Boomerang Software CeDemension CIL Centros de Informatica S.A.*
CitecCommvault Connected* CYA Data Strategies*
Dolphin Software*Dorset Electro-Concept GFT Groupe STS
Guidance Software HyperTrust IXOS* LizardTech LuTech*
MHM Computer s.r.o. * Most Prl NetStorage Newell & Budge Nirvana Orchestria* Outerbay
Oxford Arch Digital* PDS Image* Princeton Softech* Proxy
Q-Star* Seven Ten Storage
HSM/FS Gateways (8) AvalonCaminoSoft* FileTek InnoDigital LEGATO*Storage Switch* Storigen Triade
Document / Check Imaging (6)
AFS CSC EPSIIA ImageSoft RDM Corp Unisys PIA
Mainframe (2) BMC*GFS*
Life Sciences (3) Aurora InteractiveCellomics*Scientific SW
Mainframe Gateway (2)
Bustech*Intercom**
* Compliant Edition Integration** Where supported by Comparex
200+ Centera ISV SolutionsECM (continued)OCEOptika*
PDS Image Resolutions*
RSDSaperionSER (DOXIS-ITA) Siemens Business Services System Ware* Vignette* (Tower Tech)T-Systems (ImageMaster) Union Information Unisys – Info Image* UPTIME
WindFire Technology Windream
Oil and Gas (1) Enigma Data Systems*
Backup/Archiving/Workflow (Cont’d)Synstar
Tarmin Solutions Ltd.*Total Information Mgmt U-Know
Veritas
IXOS
EMC Centera 4-Node Configuration
Lower entry price point for customers and partners
2.2 TB usable capacity Rackable configuration
Can be purchased with or without an EMC rack Same rich functionality as in all Centera
configurations
NEW
Universal Access
Centera
Anywhere, any time, any application from virtually any platform
Centera API
NFS
CIFS FTP
HTTP
Emulation
Application Database stores Content Address
for future reference
Centera performs Content Address calculation and sends address
back to application
Application server sends object to Centera over IP network
Object is created and sent to
application server
How Centera Works
LAN CACACACA
Content Addressing
10001010
Digital fingerprint
Globally unique
Location- independent
ContentAddress
algorithm
Content Address
algorithm10111011
Centera ArchitectureCentera Node
Storage nodes Min 2 with the role access node 2.8 GHz P4 processor 1024 MB DDR RAM Four x 320 GB serial ATA-II Two 1 Gbit network-interfaces One 1 Gbit to outside LAN
(copper/optical)
Centera Network Dual 24-port cube switches Gigabit Ethernet connections to
facilitate additional cubes Redundant connection to each node
Extreme Scalability Massive parallel processing Add storage: processing power,
memory, bandwidth
16-nodecube
Multiple cubes form a single cluster
Four node
Redundant Array of Independent Nodes (RAIN)
2 cubes/cabinet
Centera CentraStar O/S: Self-Managing and Self-Configuring
No complex storage-area networking management
No filesystem management
Eight-nodesGen2 CAB
+
Eight-nodeGen3 CAB
1. Nondisruptive upgrade2. One addressable pool3. Capacity automatically grows4. Complete location independence maintained5. Constant validation of content objects and structures 6. Multi-generation hardware support 7. Users shielded from hardware changes
=
No LUN / RAID Group carving or allocation
Retention and Disposition Functionality
FeaturesGovernance
EditionCompliance Edition
Plus
Enforced retention period Yes Yes
Retention classes Yes Yes
Audited delete Yes No
Configurable default retention period
Can be set by SYSOP; factory-set
to zero
Automatic default to infinity
Shredding Yes Yes
Remote management Yes No
36
Case Study: Financial-Services Firm
Decision Criteria
Improve:
Availability
Useful life of the physical device
Storage capacity
Supportability
Total cost of ownership
Centera’s Value Proposition Online availability cut access
time in half
Reduced downtime—saving $5 million per hour
Saved 120 hours per month of system-management time
Meet compliance issues, including requirement to fully "shred,“ and retention periods of seven years
Business Challenge:
Optical as the repository for regulated contentassured content integrity, but it was slow, unreliable,and time-consuming to manage and costly
LAN
Application clients
Traditional disk
Content and documents are stored on CAS
Index of contentand documents
Optical
CAS
Content anddocument applicationserver
Enterprise Content / Document Management
Case Study: Sixt AG (European Auto-Rental Firm)
Decision Criteria Improve access
time
Increase reliability
Reduce people resources
Lower total cost of ownership
Centera’s Value Proposition Significantly improved
response time
Reduced end-user and IT time focused on the solution
Using Centera saves Sixt AG $3 million per year
Business Challenge:Optical—used as the repository for five million auto-rental agreements—viewed as too slow, costly, and inefficient LAN
Desktop clients
Rental agreements stored on CAS
Optical
Centera
Rental agreement applicationserver
Enterprise Document Management
Summary“The Best of Many Worlds”
+ Online, like disk
+ WORM (Write Once, Read Many), like optical
+ Optimized total cost of ownership, like tape
+ CAS benefits:+ Guaranteed and verifiable authenticity + Duplicate elimination+ Protection from technology changes+ Self-managing, self-configuring, and self-healing
= EMC Centera Content Addressed Storage
Not previously available
ILM Summary
Definition: Processes and Tools Data Classification
SRM & ICM Data Policies
Newspaper in the Library example Data Movement
HSM & ECM Tiered Infrastructure
FC SAN, IP/NAS, CAS, Offline