pentaho data integration advanced - mhm

2
Pentaho Data Integration Advanced DI1500 COURSE DESCRIPTION This course is designed to build upon fundamental knowledge of Pentaho Data Integration (PDI). Moving beyond the basics of creating transformations and jobs, you will learn how to use PDI in real-world project scenarios. You will add PDI as a data source for a variety of visualization options, utilize PDI’s streaming data processing capabilities, build transformations with metadata injection, and scale and performance tune the PDI solution. This course focuses heavily on labs to allow practical hands-on application of the topics covered in each section. COURSE OBJECTIVES When you complete this course, you should be able to: Reduce manual tasks by harnessing the power of metadata injection Use PDI as a data source for CDA, Data Services, SnowFlake, Google BigQuery and Machine Learning applications Utilize PDI’s streaming data processing capabilities with MQTT, Kafka and Amazon Kinesis data streams Scale PDI by using Carte clustering, monitoring, and partitioning Tune PDI with checkpoints and logging COURSE OUTLINE Content Modules Metadata Injection PDI as an Enterprise Data Hub Data Streaming Scaling Your Enterprise Solution Learning Activities – Labs Static Metadata Injection Standard Metadata Injection Metadata Injection (Push-Pull Modes) 2-Phase Metadata Injection Using Filters in Metadata Injection Delivery Type Instructor-led Training (ILT) Virtual Instructor-led (vILT) Duration 2 days Course Availability Employees Customers Partners Target Audience Solution Architects Data Analysts Required Knowledge and Skills Good working knowledge of Pentaho Data Integration Prerequisite Courses DI1000 - Pentaho Data Integration Fundamentals Supplemental Courses DI2000 - Pentaho Data Integration with Hadoop

Upload: others

Post on 21-Feb-2022

12 views

Category:

Documents


0 download

TRANSCRIPT

Pentaho Data Integration AdvancedDI1500

C OUR SE DE S CR IP T IO N

This course is designed to build upon fundamental knowledge of Pentaho Data Integration (PDI). Moving beyond the basics of creating transformations and jobs, you will learn how to use PDI in real-world project scenarios. You will add PDI as a data source for a variety of visualization options, utilize PDI’s streaming data processing capabilities, build transformations with metadata injection, and scale and performance tune the PDI solution.

This course focuses heavily on labs to allow practical hands-on application of the topics covered in each section.

C OUR SE OB JEC T I V E S

When you complete this course, you should be able to:● Reduce manual tasks by harnessing the power of metadata injection

● Use PDI as a data source for CDA, Data Services, SnowFlake, Google BigQuery and Machine Learning applications

● Utilize PDI’s streaming data processing capabilities with MQTT, Kafka and Amazon Kinesis data streams

● Scale PDI by using Carte clustering, monitoring, and partitioning

● Tune PDI with checkpoints and logging

C OUR SE OU T LIN E

Content Modules

● Metadata Injection

● PDI as an Enterprise Data Hub

● Data Streaming

● Scaling Your Enterprise Solution

Learning Activities – Labs

● Static Metadata Injection

● Standard Metadata Injection

● Metadata Injection (Push-Pull Modes)

● 2-Phase Metadata Injection

● Using Filters in Metadata Injection

Delivery Type● Instructor-led Training (ILT)● Virtual Instructor-led (vILT)

Duration● 2 days

Course Availability● Employees● Customers● Partners

Target Audience ● Solution Architects● Data Analysts

Required Knowledge and Skills● Good working knowledge of Pentaho Data Integration

Prerequisite Courses● DI1000 - Pentaho Data Integration Fundamentals

Supplemental Courses● DI2000 - Pentaho Data Integration with Hadoop

Hitachi VantaraCorporate Headquarters 2535 Augustine Drive Santa Clara, CA 95054 USA hitachivantara.com | community.hitachivantara.com

Contact InformationUSA: 1-800-446-0744Global: 1-858-547-4526hitachivantara.com/contact

HITACHI is a registered trademark of Hitachi, Ltd. VSP is a trademark or registered trademark of Hitachi Vantara LLC. Microsoft, Azure and Windows are trademarks or registered trademarks of Microsoft Corporation. All other trademarks, service marks and company names are properties of their respective owners. CD-DI1500 ANT May 2020

To register or for more informationHitachi Vantara Learning Center (customers/partners)

Hitachi University (employees)

● CDA Datasource

● Data Services

● Connect to a SnowFlake Database

● Google BigQuery

● Pentaho Data Integration and Google BigQuery

● Hello World – R Script Using R Studio

● Credit Card Fraud

● MQTT – Mosquitto Service

● MQTT – Sensor Data (IoT)

● Services – Zookeeper and Kafka

● Kafka – SensorData

● Amazon Kinesis Data Streams

● Master and Slave Servers

● Clustering and Group By

● Stream Partitioning

● Checkpoints

Bec ome a P ar t of t h e C om m u n it y

Join the ConversationAsk questions and connect with other Hitachi Vantara customers,

partners and employees within the Hitachi Vantara Community.