hitachi solution for databases - optimized enterprise data warehouse · databases – optimized...

5
SOLUTION PROFILE Reduce Costs, Speed Access and Gain More Value Hitachi Solution for Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense data growth presents challenges for IT organizations. As data volume increases, inefficiencies in your enterprise data warehouse (EDW) can prevent you from realizing the full value of your data. Extract-transform-load (ETL) processes consume more compute and storage resources, leading to higher licensing and management costs. Scheduled downtime to manually manage databases interrupts availability. Backup and archive cycles increasingly take longer, significantly slowing access times for users. Query operations must sort through massive amounts of data, much of it cold, infrequently accessed or irrelevant. As a result, you experience degraded performance. Optimizing your EDW by offloading cold and unused data to a data lake can help you overcome these challenges. With this approach, you can reduce costs, deliver faster access to data, and provide better information for decision-making.

Upload: others

Post on 24-May-2020

26 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hitachi Solution for Databases - Optimized Enterprise Data Warehouse · Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense

SOLUTION PROFILE

Reduce Costs, Speed Access and Gain More Value

Hitachi Solution for Databases – Optimized Enterprise Data Warehouse

Data is at the core of modern digital business, but immense data growth presents challenges for IT

organizations. As data volume increases, inefficiencies in your enterprise data warehouse (EDW) can

prevent you from realizing the full value of your data. Extract-transform-load (ETL) processes consume more

compute and storage resources, leading to higher licensing and management costs. Scheduled downtime

to manually manage databases interrupts availability. Backup and archive cycles increasingly take longer,

significantly slowing access times for users. Query operations must sort through massive amounts of data,

much of it cold, infrequently accessed or irrelevant. As a result, you experience degraded performance.

Optimizing your EDW by offloading cold and unused data to a data lake can help you overcome these

challenges. With this approach, you can reduce costs, deliver faster access to data,

and provide better information for decision-making.

Page 2: Hitachi Solution for Databases - Optimized Enterprise Data Warehouse · Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense

2

The typical EDW architecture is inefficient. Up to 70% of the data stored is cold or unused, resulting in increased query and backup times as well as higher costs.

EDW Inefficiencies Impede Innovation and Digital TransformationOptimizing your enterprise data warehouse is critical. In a typical EDW, 50-70% of data is cold or unused, while only 2.8% of data is hot1. You are challenged to offload unused and cold data to a cost-effective, certified NoSQL MongoDB or Apache Hadoop (Cloudera or MapR) environment to reduce the amount of data queried and backed up for critical operations. At the same time, you must provide timely access to all data and lay a foundation for data blending and analysis.

Offload Cold Data to a Cost-Effective NoSQL MongoDB or Hadoop (Cloudera or MapR) Database Using Pentaho Data Integration

Hitachi Vantara solutions and services can help you optimize your EDW environment. Cold data is offloaded to a MongoDB or Hadoop (Cloudera or MapR) database running on general-purpose servers and storage. Hot and warm data remains in your existing EDW, supported by advanced, specialized hardware. Hitachi Vantara Global Services uses a software toolkit to automatically map data between your Oracle database and MongoDB or Hadoop (Cloudera or MapR). This approach speeds the offload operation and lowers costs. It also diminishes the risk of human errors by reducing the number of manual processes by up to 90%.

By optimizing the placement of data according to cost and availability priority, this solution helps you reduce database management costs, improve data availability, and increase your overall EDW performance. Hitachi Vantara and our partners can fully manage this solution implementation, ensuring seamless deployment.

1 Source: Hortonworks Innovation and Strategy Team and Appfluent Analysis

© Hitachi Vantara Corporation 2017. All Rights Reserved

TYPICAL ORACLE ENTERPRISE DATA WAREHOUSE

(EDW) ARCHITECTURE

Enterprise Data Warehouse With Hitachi VSP,

Exadata, EMC and so on

ColdData

HotData

Systems of Record

RMDB ERP

CRM Other

DATA INTEGRATION

COLD DATA OFFLOAD

60% - 70%

Cold Data

<30%

Hot Data

HITACHI SOLUTION FOR DATABASES WITH ORACLE EDW

OPTIMIZATION

Certified MongoDB or Hadoop Appliance Cluster

with UCP and VSP

PentahoHadoop

RMDB = relational database management system, ERP = enterprise resource processing, CRM = customer relationship management, VSP = Hitachi Virtual Storage Platform,

UCP = Hitachi Unified Compute Platform

Page 3: Hitachi Solution for Databases - Optimized Enterprise Data Warehouse · Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense

3

Integrated ManagementPentaho Data Integration lets you access both your existing EDW and a second big data environment from a single tool. Intuitive drag-and-drop integration with a graphical ETL designer simplifies data pipeline creation.

Automate schema mapping between your Oracle and MongoDB or Hadoop (Cloudera or MapR) databases, lower costs and eliminate human errors.

Pentaho Data Integration for EDW Offload: Features and Benefits

Accelerated Backup and ArchivalOptimized storage tiers let you perform backup operations on smaller subsets of data at the appropriate frequency. For example, hot data can be backed up daily, while offloaded cold data is replicated for availability using MongoDB or Hadoop (Cloudera or MapR)

Increase data availability and reduce backup and archival times.

■■ Offload cold and unused data to a cost-effective big data environment.

■■ Store all data in an orchestrated data lake.

■■ Submit queries to the data lake.

■■ Return unified data sets to users.

Extreme ScalabilityHorizontal scalability of MongoDB and Cloudera lets you offload many terabytes of cold data from existing specialized EDW servers onto cost-effective general-purpose infrastructure. Offloaded data remains readily accessible.

Adapt easily to increasing data growth while maximizing your budget.

Optimized Data PlacementPentaho Data Integration helps you divide your data into optimized storage tiers. Store hot and warm data in your existing EDW system to maximize availability and performance. Offload cold and unused data onto general-purpose servers to minimize cost.

Maximize EDW performance by limiting the amount of data to be examined.

Cost-Effective Storage TieringA second, low-cost, MongoDB or Hadoop (Cloudera or MapR) based storage tier reduces the number of EDW licenses you need to process your data. It also lets you store cold data on general-purpose infrastructure, decreasing the amount of specialized EDW infrastructure needed.

Reduce licensing and infrastructure costs for your data environment.

© Hitachi Vantara Corporation 2017. All Rights Reserved

Hitachi Solution for Databases With Optimized Oracle EDW and Hitachi Unified Compute Platform

Hadoop

Page 4: Hitachi Solution for Databases - Optimized Enterprise Data Warehouse · Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense

Take Advantage of Proven Database Solution ExpertiseHitachi Vantara is a trusted and experienced provider of database solutions. We help you find innovative ways to achieve your business goals by focusing on the value of your data. With a global partner ecosystem, we deliver proven, high-performance, enterprise-class solutions and services, worldwide.

Using converged infrastructure, advanced software and proven database and industry expertise, we can help your organization develop and execute a data management strategy. Optimize your database environment, realize the full value of your data, and pave the way for business innovation. At Hitachi Vantara, we can help you get there faster.

4

Page 5: Hitachi Solution for Databases - Optimized Enterprise Data Warehouse · Databases – Optimized Enterprise Data Warehouse Data is at the core of modern digital business, but immense

HITACHI is a trademark or registered trademark of Hitachi, Ltd. VSP is a trademark or registered trademark of Hitachi Vantara Corporation. All other trademarks, service marks, and company names are properties of their respective owners.

SP-270-B BTD March 2019

Hitachi Vantara

Corporate Headquarters 2535 Augustine Drive Santa Clara, CA 95054 USA HitachiVantara.com | community.HitachiVantara.com

Contact InformationUSA: 1-800-446-0744Global: 1-858-547-4526HitachiVantara.com/contact

Next Steps

Find out more about Hitachi Solution for Databases here.

Hitachi Vantara at a Glance Your data is the key to new revenue, better customer experiences and lower costs. With technology and expertise, Hitachi Vantara drives data to meaningful outcomes.

Learn more about the Hitachi Solution for Databases and related Hitachi Vantara solutions and services in the following guides.

Hitachi Solution for Databases in an Enterprise Data Warehouse Offload Package for Oracle Database Reference Architecture.

Hitachi Solution for Enterprise Data Intelligence with MongoDB Reference Architecture Guide.