oracle nosql database...running hadoop in nosql database (interactive slide) steps to run a hadoop...

Post on 25-May-2020

18 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Oracle NoSQL Database

Integrating Apache Hadoop with Oracle NoSQL Database

Prathima Trivedi

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Background

NoSQL

Database

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Hadoop Framework

• Is a Java framework for processing large sets of data

• Is used for a variety of analytics

• Incorporates MapReduce functionality to segregate the

data

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

MapReduce Technique

• Divides the application into many small blocks of work

• Processes the data that is located in NoSQL Database

• Has two phases:

– Map phase

– Reduce phase

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

NoSQL Database Support to Connect with Hadoop

NoSQL Database acts as the input or output for Hadoop.

NoSQL Database NoSQL Database

Input Data Output Data

Hadoop Cluster

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Example: Scenario

NoSQL Database

Commodity

Customer profile

Preferences

Views

Transactions

Online Display Advertising

MapReduce Job

Search preferences

Prediction

Additional services

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Example: Business Conclusions

NoSQL Database

Commodity

Customer profile

Preferences

Views

Transactions

Online Display Advertising

MapReduce Job

Search preferences

Prediction

Additional services

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

oracle.kv.hadoop Package

oracle.kv.hadoop consists of two classes:

• KVInputFormat

• KVAvroInputFormat

Producer Note: Convert to interactive slide

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Road Map

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Case Study: Tunes.fm

Data collected:

• Customer Schema

– Age

– Gender

– Location

– Date

• Song Schema

– Track title

– Artist

– Number of times a

track is played

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Generating Charts

Track ID 733143757 is played 154 times and is the most

frequently streamed track.

Track ID Number of times that

the track is played

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Hardware and Software Requirements (Interactive Slide)

• Hardware Requirements:

– Disk space

– RAM above 2 GB

• Software Requirements:

– Linux-based operating system

– NoSQL Database

– Oracle Java Development Kit 1.6 or later

– Hadoop 2.2

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Running Hadoop in NoSQL Database (Interactive Slide)

Steps to run a Hadoop operation in NoSQL Database:

1. Start Kvlite.

2. Load data into NoSQL Database.

3. Start the Hadoop interface.

4. Run the MapReduce job.

5. Display the output.

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Demonstration (Interactive Slide)

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Course Summary

Introduction

to Hadoop

Benefits of

Integration

NoSQL

Database

APIs

Case Study:

Tunes.fm

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

Oracle NoSQL Database provides critical linear scaling and

extremely low latency for those operations closest to the user’s

browser.

a. True

b. False

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

The KVInputFormat class enables you to modify data in

NoSQL Database.

a. True

b. False

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Quiz

NoSQL Database provides high availability of data storage.

a. True

b. False

PROPERTIES

Allow user to leave interaction: Anytime

Show ‘Next Slide’ Button: Don't show

Completion Button Label: Next Slide

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

Credits

Author and Narrator Prathima Trivedi

Technical Contributor and Reviewer

Anuj Sahni

Editor

Susan Moxley

Producer

Swarnapriya Shridhar

Thanks for watching!

Credits

Copyright © 2014, Oracle and/or its affiliates. All rights reserved.

top related