Download - Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Large-Scale Data and Computation

Yifu Huang

School of Computer Science, Fudan [email protected]

COMP620003 Advanced Computer Networks Report, 2013

Yifu Huang (FDU CS) COMP620003 Report 2013/12/5 1 / 27

Page 2: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Outline

1 Motivation

2 GFS

3 MapReduce

4 Bigtable

5 Discussion

Page 3: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Motivation

Why we choose these papers?

Big dataGoogle

Why we need GFS, MapReduce, Bigtable?

A scalable distributed file system for large distributed data-intensiveapplicationsA programming model for processing and generating large datasets thatis amenable to a broad variety of real-world tasksA distributed storage system for managing structured data that isdesigned to scale to a very large size

What can we learn from these papers?

The design and implementation ideas behind GFS, MapReduce,BigtableGet ready to enjoy open source equivalents HDFS, YARN, HBase

Page 4: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Motivation

Big dataGoogle

Page 5: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Motivation

Big dataGoogle

Page 6: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Motivation

Ecosystem

Page 7: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

The Google File System

Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung

Google, Inc.

Page 8: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

The system is built from many inexpensive commodity componentsthat often failThe system stores a modest number of large filesThe workloads primarily consist of two kinds of reads: large streamingreads and small random readsThe workloads also have many large, sequential writes that appenddata to filesThe system must efficiently implement well-defined semantics ormultiple clients that concurrently append to the same fileHigh sustained bandwidth is more important than low latency

Page 9: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

Page 10: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

Page 11: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

Page 12: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

Page 13: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview

Page 14: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Design Overview (cont.)

Architecture

Page 15: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Chunk Size: 64 MBMetadata

The file and chunk namespacesThe mapping from files to chunksThe locations of each chunk’s replicas

Chunk LocationsOperation LogConsistency Model

Guarantees by GFSImplications for Applications

Page 16: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Page 17: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Page 18: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Page 19: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Page 20: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

System Interactions

Leases and Mutation Order

Page 21: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

System Interactions (cont.)

Data Flow

We decouple the flow of data from the flow of control to use thenetwork efficientlyOur goals are to fully utilize each machine’s network bandwidth, avoidnetwork bottlenecks and high-latency links, and minimize the latency topush through all the data

Atomic Record Appends

GFS provides an atomic append operation called record append

Snapshot

The snapshot operation makes a copy of a file or a directory treealmost instantaneously, while minimizing any interruptions of ongoingmutations

Page 22: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Data Flow

Snapshot

Page 23: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Data Flow

Snapshot

Page 24: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

Fault Tolerance And Diagnosis

High Availability

Fast RecoveryChunk ReplicationMaster Replication

Data IntegrityDiagnostic Tools

Page 25: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

High Availability

Page 26: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

GFS

High Availability

Page 27: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

MapReduce: Simplied Data Processing on Large Clusters

Jeffrey Dean and Sanjay Ghemawat

Google, Inc.

Page 28: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

Model

Map

Takes an input pair and produces a set of intermediate key/value pairsThe MapReduce library groups together all intermediate valuesassociated with the same intermediate key and passes them to thereduce function

Reduce

Accepts an intermediate key and a set of values for that key andmerges these values together to form a possibly smaller set of values

Page 29: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

Model

Map

Takes an input pair and produces a set of intermediate key/value pairsThe MapReduce library groups together all intermediate valuesassociated with the same intermediate key and passes them to thereduce function

Reduce

Accepts an intermediate key and a set of values for that key andmerges these values together to form a possibly smaller set of values

Page 30: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

Implementation

User ProgramMasterWorker (Map/Reduce)Fault toleranceLocalityTask granularityBackup tools

Page 31: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

Performance

Grep

Page 32: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

MapReduce

Performance (cont.)

Sort

Page 33: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Bigtable: A Distributed Storage System for Structured Data

Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A.Wallach Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber

Google, Inc.

Page 34: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Model

Rows

AtomicTablets

Column Families

Family : qualifierAccess control

Timestamps

Decreasing orderGarbage collect

Cell

(row : string, column : string, time : int64) -> string

Page 35: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Model

Rows

AtomicTablets

Column Families

Timestamps

Cell

Page 36: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Model

Rows

AtomicTablets

Column Families

Timestamps

Cell

Page 37: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Model

Rows

AtomicTablets

Column Families

Timestamps

Cell

Page 38: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Model (cont.)

Page 39: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Storage

Region–Store—memStore—StoreFileFile–BlocksLog–Write ahead log

Page 40: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Implementation

Tablet Location

Page 41: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Implementation (cont.)

Tablet Serving

Page 42: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Evaluation

Benchmarks

Page 43: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Evaluation (cont.)

Scaling

Page 44: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Bigtable

Evaluation (cont.)

Applications

Page 45: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Discussion

Contributions of GFS, MapReduce, Bigtable

Supporting large-scale data processing workloads on commodityhardwareEasy to use, hides details of parallelization, fault tolerance, localityoptimization, and load balancingScale well both in terms of data size (from URLs to web pages tosatellite imagery) and latency requirements (from backend bulkprocessing to real-time data serving)

Drawbacks of GFS, MapReduce, Bigtable

Single master may be still a potential bottleneckDisk I/O, time consumingDo not support many database operations (multi-row transaction,secondary index . . . )

Page 46: Large-Scale Data and Computation - Fudan Universityadmis.fudan.edu.cn/~yfhuang/files/LSDC_slide.pdfLarge-Scale Data and Computation YifuHuang School of Computer Science, Fudan University

Discussion

Contributions of GFS, MapReduce, Bigtable

Supporting large-scale data processing workloads on commodityhardwareEasy to use, hides details of parallelization, fault tolerance, localityoptimization, and load balancingScale well both in terms of data size (from URLs to web pages tosatellite imagery) and latency requirements (from backend bulkprocessing to real-time data serving)

Drawbacks of GFS, MapReduce, Bigtable

Single master may be still a potential bottleneckDisk I/O, time consumingDo not support many database operations (multi-row transaction,secondary index . . . )