topic chosen by: q ing sun harini chilamantula sivagami nachiyappan

12
Research Paper Title: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics Author : Taylor, Ronald C Published at: The 11th Annual Bioinformatics Open Source Conference (BOSC) 2010 Boston, MA, USA. 9-10 July 2010 Academic Journal, BMC Bioinformatics. DOI: 10.1186/1471-2105-11-S12-S1. Academic Search Topic Chosen by: Qing Sun Harini Chilamantula Sivagami Nachiyappan 05/16/22

Upload: shaina

Post on 22-Jan-2016

27 views

Category:

Documents


0 download

DESCRIPTION

Topic Chosen by: Q ing Sun Harini Chilamantula Sivagami Nachiyappan. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Research Paper Title: An overview of the Hadoop/MapReduce/HBase framework

and its current applications in bioinformatics

Author : Taylor, Ronald C

Published at:The 11th Annual Bioinformatics Open Source Conference

(BOSC) 2010 Boston, MA, USA. 9-10 July 2010

Academic Journal, BMC Bioinformatics.DOI: 10.1186/1471-2105-11-S12-S1.

Academic Search Topic Chosen by:

Qing SunHarini Chilamantula

Sivagami Nachiyappan

04/21/23

Page 2: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Background Information

Nowadays, data centers are consuming a lot of energy for big data to

store and to maintain, but not in an efficient fashion.

There are several types of waste at different levels

o space for keeping large servers (data centers),

o effort for maintain,

o infrastructure,

o machine,

o system level waste (resource waste) .

Page 3: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

About Hadoop

Hadoop Map/Reduce is a software framework for easily writing applications which process vast amount of data in parallel on large clusters of commodity hardware.

Hadoop is a large scale distributed file system modeled after the Google File System.

The key feature of Hadoop is fault- tolerance to the hardware failures.

By using Hadoop, we can run terabytes of data and applications on thousands of nodes in the network.

Hadoop implements MapReduce, a programming model, using the Hadoop Distributed File System (HDFS).

Page 4: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

MapReduce in Big Data Analysis

MapReduce is used to divide the large applications into small blocks and distribute them to the other nodes in the network.

Master node will collect all the solutions back.

Page 5: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Benefits of Hadoop

Easily process vast amount of data in parallel on large clusters

It provides more scalability

Volume – Terabytes, petabytes and beyond.

Velocity – Speed access, Real-Time Data Analytics.

Variety – Centralized (Data moves to Analytics), Distributed (Analytics

moves to Data).

Value – Graph Algorithm, predictive Machine Learning, Commodity

Hardware.

Page 6: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Hadoop is already a key to delivering

on the promise of bioinformatics.

The Hadoop is also in the process of

providing a platform in which it is

easy to analyze and integrate the

various large, disparate data sources

into one data warehouse.

In near feature Hadoop hold some

more incridible contributions,

regarding store and process complex

data

It would have made easy to

understand if they have any

Pictorial representations.

When you have OLTP needs.

MR is not suitable for a large

number of short on-line

transactions .

Using MR is time consuming.

Conclusion Suggestions

Page 7: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Term Project Proposal

Title: APP’s Search Application

Team Members: Qing Sun

Harini Chilamantula Sivagami Nachiyappan

Faculty Advisor – Dr. Meiliu LuCSC Department – Fall 2013

California State University, Sacramento

Page 8: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Project Motivations

This search application is designed for iphone, Android, Windows, Tizen, VBM, Meego and ipad apps

It includes utility applications, performance applications, gamming applications.

It is a Search- based application for an app.

It is the faster and easier way to search different Operating System supportable apps at one common place.

Page 9: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Goals of Project

This application can display the requirements of a user required app and can also connect user to appropriate web page to download the app from the app store.

By using app search application user can know the features, details, how to use and availability of particular apps.

Page 10: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

How we reach our goals for a successful project

Design an Online Transactional Processing (OLTP) application to get details of various applications from their respective websites and display the result of the query in a webpage.

Design an Online Analytical Processing (OLAP) to integrate the data from various data sources, create our own data mart and display the results of the customer’s query on a webpage.

Extract data from various data sources, transform the data and present the data.

Page 11: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

Project Schedule

WEEK OBJECTIVES

10 Project Proposal

11 Group Task Assignments / Data collection

12 Progress Peport / Design Task

13 Create Presentation Slides / Build initial website

14 Presentation Practice / Project Presentation

15 Prepare Final Written Repoet

Page 12: Topic Chosen by: Q ing  Sun Harini Chilamantula Sivagami Nachiyappan

References:

http://spectrum.ieee.org/automaton/robotics/robotics-software/cloud-robotics

https://developers.facebook.com/docs/guides/appcenter/

http://apphelp.copilotlive.com/copilot/en-US/?platform=android

http://copilotlive.com/us/personal/android.asp

http://copilotlive.com/us/store/android.asp

http://www.biomedcentral.com/1471-2105/11/S12/S1

http://link.springer.com/article/10.1186%2F1471-2105-11-S12-S1

http://www.roadsideamerica.com/mobile/roadside/ios/

http://www.roadsideamerica.com/mobile/roadside/ios/faq

http://stackoverflow.com/questions/18585839/what-are-the-disadvantages-of-mapreduce