page 1 go over minqi zhou( 周敏奇 ) room 111 (east) mathematics building 021-32204750-167...

Page 1

Go Over

Minqi Zhou(周敏奇 )mqzhou@sei.ecnu.edu.cn

Room 111 (East) Mathematics Building021-32204750-167

Distributed Systems

Over View• Why distributed system• Naming• Communication• Synchronization• Secure

Page 3

What can we do nowthat we could not do

before?

Technology advances

Processors Memory

Networking

StorageProtocols

Page 5

Building and classifyingdistributed systems

Flynn’s Taxonomy (1972)

SISD– traditional uniprocessor system

SIMD– array (vector) processor– Examples:

• APU (attached processor unit in Cell processor)• SSE3: Intel’s Streaming SIMD Extensions• PowerPC AltiVec (Velocity Engine)

MISD– Generally not used and doesn’t make sense– Sometimes applied to classifying redundant systems

MIMD– multiple computers, each with:

• program counter, program (instructions), data– parallel and distributed systems

number of instruction streamsand number of data streams

Subclassifying MIMD

memory– shared memory systems: multiprocessors– no shared memory: networks of computers,

multicomputersinterconnect

– bus– switch

delay/bandwidth– tightly coupled systems– loosely coupled systems

You know you have a distributed system when the crash of a computer you’ve never heard of stops you from getting any work done.

– Leslie Lamport

Coupling

Tightly versus loosely coupled software

Tightly versus loosely coupled hardware

Design issues: TransparencyHigh level: hide distribution from usersLow level: hide distribution from software

– Location transparency:users don’t care where resources are

– Migration transparency:resources move at will

– Replication transparency:users cannot tell whether there are copies of resources

– Concurrency transparency:users share resources transparently

– Parallelism transparency:operations take place in parallel without user’s knowledge

Design issues

Reliability– Availability: fraction of time system is

usable• Achieve with redundancy

– Reliability: data must not get lost• Includes security

Performance– Communication network may be slow and/or

unreliableScalability

– Distributable vs. centralized algorithms– Can we take advantage of having lots of

computers?11

Page 12

Service Models

Centralized model• No networking• Traditional time-sharing

system• Direct connection of user terminals to

system• One or several CPUs• Not easily scalable• Limiting factor: number of CPUs in system

– Contention for same resources

Client-server model

Environment consists of clients and serversService: task machine can performServer: machine that performs the taskClient: machine that is requesting the service

Directory server Print server File server

client client

Workstation modelassume client is used by one user at a time

Peer to peer model• Each machine on network has (mostly)

equivalent capabilities

• No machines are dedicated to serving others

• E.g., collection of PCs:– Access other people’s files– Send/receive email (without server)– Gnutella-style content sharing– SETI@home computation

Processor pool model

What about idle workstations(computing resources)?– Let them sit idle– Run jobs on them

Alternatively…– Collection of CPUs that can be assigned

processes on demand– Users won’t need heavy duty workstations

• GUI on local machine– Computation model of Plan 9

Grid computing

Provide users with seamless access to:– Storage capacity– Processing– Network bandwidth

Heterogeneous and geographically distributed systems

Page 18

Naming

Naming things• User names

– Login, email• Machine names

– rlogin, email, web• Files• Devices• Variables in programs• Network services

Naming ServiceAllows you to look up names

– Often returns an address as a response

Might be implemented as– Search through file– Client-server program– Database query– …

What’s a name?

Name: identifies what you want

Address: identifies where it is

Route: identifies how to get there

Binding: associates a name with an address– “choose a lower-level-implementation for a

higher-level semantic construct”

21RFC 1498: Inter-network Naming, addresses, routing

NamesNeed names for:

– Services: e.g., time of day– Nodes: computer that can run services– Paths: route– Objects within service: e.g. files on a file

serverNaming convention can take any format

– Ideally one that will suit application and user

– E.g., human readable names for humans, binary identifiers for machines

Naming 5.2 Flat Naming

Flat naming

ProblemGiven an essentially unstructured name (e.g., an identifier), how canwe locate its associated access point?

Simple solutions (broadcasting)Home-based approachesDistributed Hash Tables (structured P2P)Hierarchical location service

6 / 38

Page 24

Problems with sockets

Sockets interface is straightforward– [connect]– read/write– [disconnect]

BUT … it forces read/write mechanism – We usually use a procedure call

To make distributed computing look more like centralized:– I/O is not the way to go

RPC1984: Birrell & Nelson

– Mechanism to call procedures on other machines

Remote Procedure Call

Goal: it should appear to the programmer that a normal call is taking

Implementing RPC

The trick:

Create stub functions to make it appear to the user that the call is local

Stub function contains the function’s interface

client server

Stub functions Marshal, Unmarshal return, return to client code

client functions

client stub

network routines

server functions

server stub(skeleton)

network routines

Parameter passing

Pass by value– Easy: just copy data to network message

Pass by reference– Makes no sense without shared memory

Representing data

No such thing asincompatibility problems on local system

Remote machine may have:– Different byte ordering– Different sizes of integers and other types– Different floating point representations– Different character sets– Alignment requirements

Page 31

Concurrency

Schedules• Transactions must have scheduled so that

data is serially equivalent• Use mutual exclusion to ensure that only one

transaction executes at a time• or…• Allow multiple transactions to execute

concurrently– but ensure serializability

• concurrency control

• schedule: valid order of interleaving32

Methods • Two Phase locking• Strict two phase locking• Read/write lock• Two version locking

Page 34

Synchronization

Physical clocks in computers

Real-time Clock: CMOS clock (counter) circuit driven by a quartz oscillator

– battery backup to continue measuring time when power is off

OS generally programs a timer circuit to generate an interrupt periodically

– e.g., 60, 100, 250, 1000 interrupts per second(Linux 2.6+ adjustable up to 1000 Hz)

– Programmable Interval Timer (PIT) – Intel 8253, 8254– Interrupt service procedure adds 1 to a counter in

memory

Problem

Getting two systems to agree on time– Two clocks hardly ever agree– Quartz oscillators oscillate at slightly different

frequencies

Clocks tick at different rates– Create ever-widening gap in perceived time– Clock Drift

Difference between two clocks at one point in time– Clock Skew

RPCSimplest synchronization technique

– Issue RPC to obtain time– Set time

Does not account for network or processing latency

client serverwhat’s the time?

3:42:19

Cristian’s algorithmCompensate for delays

– Note times:• request sent: T0

• reply received: T1

– Assume network delays are symmetric

server

clienttime

request reply

Tserver

Cristian’s algorithmClient sets time to:

server

clienttime

request reply

Tserver

= estimated overhead in each direction

Time synchronization• Berkeley algorithm• NTP• SNTP

Logical clocks

Assign sequence numbers to messages– All cooperating processes can agree on order

of events– vs. physical clocks: time of day

Assume no central time source– Each system maintains its own local clock– No total ordering of events

• No concept of happened-when

Happened-before

Lamport’s “happened-before” notation

a b event a happened before event be.g.: a: message being sent, b: message

receipt

Transitive:if a b and b c then a c

Lamport’s algorithm• Each message carries a timestamp of the

sender’s clock

• When a message arrives:– if receiver’s clock < message timestamp

set system clock to (message timestamp + 1)

– else do nothing

• Clock must be advanced between any two events in the same process

Lamport’s algorithm

Algorithm allows us to maintain time ordering among related events

– Partial ordering

Event counting example

Problem: Detecting causal relations

If L(e) < L(e’)– Cannot conclude that ee’

Looking at Lamport timestamps– Cannot conclude which events are causally

Vector clocksRules:

1. Vector initialized to 0 at each processVi [j] = 0 for i, j =1, …, N

2. Process increments its element of the vector in local vector before timestamping event: Vi [i] = Vi [i] +1

3. Message is sent from process Pi with Vi attached to it

4. When Pj receives message, compares vectors element by element and sets local vector to higher of two values

Vj [i] = max(Vi [i], Vj [i]) for i=1, …, N

Page 48

Group Communication

Modes of communication• unicast

– 11– Point-to-point

• anycast– 1nearest 1 of several identical nodes– Introduced with IPv6; used with BGP

• netcast– 1 many, 1 at a time

• multicast– 1many– group communication

• broadcast– 1all

Groups

Groups are dynamic– Created and destroyed– Processes can join or leave

• May belong to 0 or more groupsSend message to one entity

– Deliver to entire group

Deal with collection of processes as one abstraction

For multicast• atomic • Reliable • unreliable• ordering

Multicasting considerations

atomic

reliableunreliable

unordered synccausal

totalglobal

unordered FIFO

Message Ordering

Page 53

Distributed shared memory

MotivationSMP systems

– Run parts of a program in parallel– Share single address space

• Share data in that space– Use threads for parallelism– Use synchronization primitives to prevent race

conditions

Can we achieve this with multicomputers?– All communication and synchronization must

be done with messages54

Distributed Shared Memory (DSM)Goal: allow networked computers to

share a region of virtual memory

• How do you make a distributed memory system appear local?

• Physical memory on each node used to hold pages of shared virtual address space. Processes address it like local memory.

issues• Access (MMU)• Cache• Replication• consistency

Page 57

Security

Terms: types of ciphers

• restricted cipher

• symmetric algorithm

• public key algorithm

Classic Cryptosystems• Substitution Ciphers• Transposition Ciphers• Combined ciphers• Rotor machines• One-time pads

public key algorithm

• Diffie-Hellman exponential key exchange• RSA algorithm

Digital signature• Arbitrated protocol• Integrity of the document

Authentication

Three factors:– something you have key, card

• can be stolen

– something you knowpasswords• can be guessed, shared, stolen

– something you are biometrics• costly, can be copied (sometimes)

password• Reusable password• One time password• Skey authentication• SKID2/SKID3 authentication• Kerberos authentication

Page 64

The end.

page 1 go over minqi zhou( 周敏奇 ) room 111 (east) mathematics building 021-32204750-167...

Documents

festivals strange ( 奇怪的 ) festivals from around the...

ntc热敏电阻 - murata manufacturing

page 1 introduction to cryptography minqi zhou...

爱奇样册-转曲的1000本...title:...

distributed systems principles and paradigms minqi zhou...

minqi zhou introduction to data mining 2/23/2014 1 data...

content.csbs.utah.educontent.csbs.utah.edu/~mli/cv/minqi...

faber allergen - 最新一代ige過敏測試, faber

content.csbs.utah.edumli/cv/minqi li-list_of_cit… ·...

moisture sensitive component control introduction ...

page 1 distributed shared memory minqi zhou...

クレジット： utokyoonline education 学術俯瞰講義...

1澳大利亞奇觀 --公路列車

information retrieval and web search lecture 2: the term...

制作：程敏娟. what’s your favorite programme? do you...

螨過敏治療注射劑 allergenic extract of standardized...

爱奇艺广告大数据实践 sacc2017charles

li minqi: the rise of china and demise of the capitalist...

beijing immersion trip敏慧

minqi zhou introduction to data mining 3/23/2014 1 data...