disks and files vivek pai princeton university. 2 gedankenagain what is one-millionth of the...

27
Disks and Files Vivek Pai Princeton University

Post on 19-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

Disks and Files

Vivek Pai

Princeton University

Page 2: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

2

Gedankenagain

What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical world have the

same solution when the problem changes by six orders of magnitude?

If two machines are cooperating, how can one tell when the other finishes a task? If the “controller” is managing several “workers”? If controller can’t proceed until worker is done?

Page 3: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

3

Mechanics

Today’s goals: finish up last time’s issues

Start going through some disk infoDisk as a precursor to filesystem

Next week’s reading listed on home page (well, linked from lecture name)

First: any questions?

Page 4: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

4

Communicating Machines

Controller sends task

Worker performs task

Worker = network card, disk, keyboard

Controller Worker

Page 5: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

5

Polling Versus Interrupts

PollingCheck “constantly”Wastes resources – why?Simpler design

InterruptsController free to do other workMore mechanism needed

Page 6: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

6

When Are They Appropriate?

PollingLow cost systemsLow-delay environmentsHigh-performance systems

InterruptsMultiprogrammed systemsPower-conscious environments

Page 7: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

7

Why Interrupts For Syscalls?

Interrupts have to existHardware communication (IRQs)Must be delivered to OSHave to be in privileged mode

Software interrupts for syscallsSame infrastructureSimilar requirements

Page 8: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

8

Signals

Notification mechanism to programUsed by OS/hardware to alert programAsynchronous – like an interrupt

What can program do?Default action (signal-specific) Ignore itPerform some other action

Man: signal, sigprocmask

Page 9: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

9

Some Signals

SIGHUP terminate process terminal line hangup

SIGINT terminate process interrupt program

SIGILL create core image illegal instruction

SIGFPE create core image floating-point exception

SIGKILL terminate process kill program

SIGSEGV create core image segmentation violation

SIGPIPE terminate process write on a pipe with no reader

SIGALRM terminate process real-time timer expired

SIGURG discard signal urgent condition present on socket

SIGSTOP stop process stop (cannot be caught or ignored)

SIGCONT discard signal continue after stop

Page 10: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

10

Disks First, Then Filesystems

Disk properties shape filesystems

None of it matters for correctnessWe’re not about correctnessCorrectness is easyWe’re about performance

Moving target

Page 11: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

11

Today’s Typical Disks

Form factor: .5-1” 4” 5.7”Storage: 18-73GB

Form factor: .4-.7” 2.7” 3.9”Storage: 4-27GB

Form factor: .2-.4” 2.1” 3.4”Storage: 170MB-1GB

Page 12: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

12

Magnetic Disk Capacity

1

10

100

1,000

10,000

100,000

MBytes

80 84 88 92 96 00

1.8”

2.5”

3.5”

5.25”

8-14”

80%/year

Page 13: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

13

Disk Technology TrendsDisks are getting smaller for similar capacity Spin faster, less rotational delay, higher bandwidth Less distance for head to travel (faster seeks) Lighter weight (for portables)

Disk data is getting denser More bits/square inch Tracks are closer together Doubles density every 18 months

Disks are getting cheaper ($/MB) Factor of ~2 per year since 1991 Head close to surface

Page 14: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

14

Disk Organization

Disk surface Circular disk coated with

magnetic material

Tracks Concentric rings around

disk surface, bits laid out serially along each track

Sectors Each track is split into

arc of track (min unit of transfer)

sector

Page 15: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

15

Disk Organization As Fiction

Fixed arc implies inefficiencyshort inner sectors,

long outer sectors

RealityMore sectors on

outer tracksDisks map

transparently

sector

Page 16: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

16

More on Disks

CD’s and floppies come individually, but magnetic disks come organized in a disk packCylinder Certain track of the

platter

Disk arm Seek the right cylinder

seek a cylinder

Page 17: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

17

Disk Examples (Summarized Specs)Seagate Barracuda IBM Ultrastar 72ZX

Capacity, Interface & ConfigurationFormatted Gbytes 28 73.4

Interface Ultra ATA/66 Ultra160 SCSIPlatters / Heads 4 / 8 11/22Bytes per sector 512 512-528

PerformanceMax Internal transfer rate (Mbytes/sec) 40 53

Max external transfer rate (Mbytes/sec) 66.6 160Avg Transfer rate( Mbytes/sec) > 15 22.1-37.4Multisegmented cache (Kbytes) 512 16,384Average seek, read/write (msec) 8 5.3

Average rotational latency (msec) 4.16 2.99Spindle speed (RPM) 7,200 10,000

Page 18: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

18

More on Disk PerformanceSeek Position heads over cylinder, typically 5.3 8 ms

Rotational delay Wait for a sector to rotate underneath the heads Typically 8.3 6.0 ms (7,200 – 10,000RPM) or ½ rotation takes

4.15-3ms

Transfer bytes Average transfer bandwidth (15-37 Mbytes/sec)

Performance of transfer 1 Kbytes Seek (5.3 ms) + half rotational delay (3ms) + transfer (0.04 ms) Total time is 8.34ms or 120 Kbytes/sec!

What block size can get 90% of the disk transfer bandwidth?

Page 19: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

19

Disk BehaviorsThere are more sectors on outer tracks than inner tracks Read outer tracks:

37.4MB/sec Read inner tracks: 22MB/sec

Seek time and rotational latency dominates the cost of small reads A lot of disk transfer

bandwidth are wasted Need algorithms to reduce

seek time

Block Size (Kbytes)

% of Disk Transfer

Bandwidth

1Kbytes 0.5%

8Kbytes 3.7%

256Kbytes 55%

1Mbytes 83%

2Mbytes 90%

Page 20: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

20

Observations

Getting first byte from disk read is slow high latency

Peak bandwidth high, but rarely achieved

Need to mitigate disk performance impact Do extra calculations to speed up disk access

Schedule requests to shorten seeks

Move some disk data into main memory – filesystem caching

Page 21: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

21

FIFO (FCFS) order

Method First come first serve

Pros Fairness among requests In the order applications

expect

Cons Arrival may be on random

spots on the disk (long seeks)

Wild swing can happen

0 199

98, 183, 37, 122, 14, 124, 65, 67

53

Page 22: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

22

SSTF (Shortest Seek Time First)

Method Pick the one closest on disk Rotational delay is in

calculation

Pros Try to minimize seek time

Cons Starvation

Question Is SSTF optimal? Can we avoid starvation?

0 199

98, 183, 37, 122, 14, 124, 65, 67(65, 67, 37, 14, 98, 122, 124, 183)

53

Page 23: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

23

Elevator (SCAN)

Method Take the closest request in

the direction of travel Real implementations do

not go to the end (called LOOK)

Pros Bounded time for each

request

Cons Request at the other end

will take a while

0 199

98, 183, 37, 122, 14, 124, 65, 67(37, 14, 65, 67, 98, 122, 124, 183)

53

Page 24: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

24

C-SCAN (Circular SCAN)

Method Like SCAN But, wrap around Real implementation

doesn’t go to the end (C-LOOK)

Pros Uniform service time

Cons Do nothing on the return

0 199

98, 183, 37, 122, 14, 124, 65, 67(65, 67, 98, 122, 124, 183, 14, 37)

53

Page 25: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

25

History of Disk-related Concerns

When memory was expensiveDo as little bookkeeping as possible

When disks were expensiveGet every last sector of usable space

When disks became more commonMake them much more reliable

When processor got much fasterMake them appear faster

Page 26: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

26

Disk Versus Memory

Memory

Latency in 10’s of processor cycles

Transfer rate 300+MB/s

Contiguous allocation gains ~10x

Disk

Latency in milliseconds

Transfer rate 5-50MB/s

Contiguous allocation gains ~1000x

Page 27: Disks and Files Vivek Pai Princeton University. 2 Gedankenagain What is one-millionth of the coast-to-coast distance of the US? Does anything in the physical

27

On-Disk Caching

Method Put RAM on disk controller to cache blocks

Seagate ATA disk has .5MB, IBM Ultra160 SCSI has 16MB Some of the RAM space stores “firmware” (an OS)

Blocks are replaced usually in LRU order

Pros Good for reads if you have locality

Cons Expensive Need to deal with reliable writes