oracle nosql database...running hadoop in nosql database (interactive slide) steps to run a hadoop...

26
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. Oracle NoSQL Database Integrating Apache Hadoop with Oracle NoSQL Database Prathima Trivedi

Upload: others

Post on 25-May-2020

18 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Oracle NoSQL Database

Integrating Apache Hadoop with Oracle NoSQL Database

Prathima Trivedi

Page 2: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

Page 3: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

Page 4: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 5: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 6: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Background

NoSQL

Database

Page 7: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Hadoop Framework

• Is a Java framework for processing large sets of data

• Is used for a variety of analytics

• Incorporates MapReduce functionality to segregate the

data

Page 8: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

MapReduce Technique

• Divides the application into many small blocks of work

• Processes the data that is located in NoSQL Database

• Has two phases:

– Map phase

– Reduce phase

Page 9: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 10: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

NoSQL Database Support to Connect with Hadoop

NoSQL Database acts as the input or output for Hadoop.

NoSQL Database NoSQL Database

Input Data Output Data

Hadoop Cluster

Page 11: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Example: Scenario

NoSQL Database

Commodity

Customer profile

Preferences

Views

Transactions

Online Display Advertising

MapReduce Job

Search preferences

Prediction

Additional services

Page 12: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Example: Business Conclusions

NoSQL Database

Commodity

Customer profile

Preferences

Views

Transactions

Online Display Advertising

MapReduce Job

Search preferences

Prediction

Additional services

Page 13: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 14: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

oracle.kv.hadoop Package

oracle.kv.hadoop consists of two classes:

• KVInputFormat

• KVAvroInputFormat

Producer Note: Convert to interactive slide

Page 15: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 16: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Case Study: Tunes.fm

Data collected:

• Customer Schema

– Age

– Gender

– Location

– Date

• Song Schema

– Track title

– Artist

– Number of times a

track is played

Page 17: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Generating Charts

Track ID 733143757 is played 154 times and is the most

frequently streamed track.

Track ID Number of times that

the track is played

Page 18: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Hardware and Software Requirements (Interactive Slide)

• Hardware Requirements:

– Disk space

– RAM above 2 GB

• Software Requirements:

– Linux-based operating system

– NoSQL Database

– Oracle Java Development Kit 1.6 or later

– Hadoop 2.2

Page 19: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Running Hadoop in NoSQL Database (Interactive Slide)

Steps to run a Hadoop operation in NoSQL Database:

1. Start Kvlite.

2. Load data into NoSQL Database.

3. Start the Hadoop interface.

4. Run the MapReduce job.

5. Display the output.

Page 20: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Demonstration (Interactive Slide)

Page 21: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Course Summary

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Page 22: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

Oracle NoSQL Database provides critical linear scaling and

extremely low latency for those operations closest to the user’s

browser.

a. True

b. False

Page 23: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

The KVInputFormat class enables you to modify data in

NoSQL Database.

a. True

b. False

Page 24: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

NoSQL Database provides high availability of data storage.

a. True

b. False

Page 25: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

Page 26: Oracle NoSQL Database...Running Hadoop in NoSQL Database (Interactive Slide) Steps to run a Hadoop operation in NoSQL Database: 1. Start Kvlite. 2. Load data into NoSQL Database. 3

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Credits

Author and Narrator Prathima Trivedi

Technical Contributor and Reviewer

Anuj Sahni

Editor

Susan Moxley

Producer

Swarnapriya Shridhar

Thanks for watching!

Credits

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.