sa info archive-s_aksenenko
TRANSCRIPT
1 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved. EMC InfoArchive – Enterprise Archiving for Maximum Value
EMC InfoArchive От хаоса к порядку
День технологий ЕМС в ГУАП
Аксёненко Сергей
2 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
3 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
Legacy & Live Applications
Archive Services
Structured Data Unstructured Content
EIA GUI
Data Access Ingestion Management
Connectors
Storage Platform EMC Atmos, Isilon, Data Domain, Centera + others
InfoArchive
Architectural Overview
4 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
InfoArchive & Archive Process
Connector Transfer Ingestion Access Archive Services
GUI
Archive Retrieval
Pre InfoArchive InfoArchive
Source Application
5 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
InfoArchive Vocabulary SIP, AIP & Archive Holdings
Archive Storage Ingestion
Data
Access
Archive Services
EMC InfoArchive
Submission Information Packages
SIP’s
Archive Information Packages AIP’s
Data package containing information to be archived
Archive Holdings for information
classification & segregation
6 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
What do we archive?
SIP – Submission Information Package
SIP
7 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
What is in a SIP SIP Descriptor – Describes the data in the SIP and includes
information used by InfoArchive for searching, classification, retention management and ingestion prioritisation.
SIP Data – The data to be archived.
SIP
SIP Descriptor
SIP Data
8 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
Standards Based
WHY XML
• Open, self describing: Ideal for long term retention
• Versatility: Support any data model or metadata structure
• Enables addition of contextual information to records
• Wide support: “render as XML” in applications and data extraction tools
9 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
Active Archiving
10 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
The Active Archiving Process
Periodic archiving of subsets of data
Frequency of archiving is based on
BUSINESS RULES
AC
TIV
E O
PERATIO
NAL S
YSTEM
Structured
Inactive
Active
Unstructured
Inactive
Active
11 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
ACTIVE APPLICATIONS READ & WRITE
Active Archiving
EMC INFOARCHIVE
PROCESS Indentify information to archive • Required for compliance • High growth records Configure connector • Extract information • Archive policies i.e. when to archive • Purge Configure InfoArchive • Archive holding • Retention policies • Search
ARCHIVE OPTIONS
12 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
Application Decommissioning
13 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
The Application Decommissioning Process
One off batch archiving of
ALL SYSTEM DATA
LEG
ACY S
YSTEM
Structured
Inactive
Unstructured
Inactive
14 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
LEGACY APPLICATIONS READ ONLY
EMC INFOARCHIVE
SHUT DOWN PROCESS
Indentify applications to decommission • Information must be retained Define Access & Report Requirements • Keep to minimum Configure extraction tool • Existing ETL, InfoArchive Connector Archive Information • Validate Configure InfoArchive • Retention • Search & reports Shut down application
Application Decommissioning
ARCHIVE OPTIONS
15 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
InfoArchive Technologies
16 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.
•HTML5
•AngularJS
•BootsrapCSS UI
• Java
•Spring Framework
•Web services, REST Application
•xDB: XML, XQuery, XPath
•EMC Documentum
•Apache Hadoop Persistence