a performance comparison of raid-5 and log-structured-arrays

21
A performance comparison of RAID-5 and Log-Structured- Arrays Jai Menon

Upload: lore

Post on 16-Jan-2016

36 views

Category:

Documents


1 download

DESCRIPTION

A performance comparison of RAID-5 and Log-Structured-Arrays. Jai Menon. Outline of the talk. LSA Introduction Analyze RAID-5 performance using an analytical model. Examine LSA performance. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A performance comparison of RAID-5 and Log-Structured-Arrays

A performance comparison of RAID-5 and Log-Structured-

Arrays

Jai Menon

Page 2: A performance comparison of RAID-5 and Log-Structured-Arrays

Outline of the talk

• LSA Introduction• Analyze RAID-5 performance using an

analytical model.• Examine LSA performance.• Sensitivity of LSA performance to free

space and compression ratio achieved.• RAID5 with compression.• Results.

Page 3: A performance comparison of RAID-5 and Log-Structured-Arrays

LSA

• Borrows heavily from Log-Structured-File systems.

• Combines LFS,RAID,Compression and Non-volatile cache.

• Updated data is written into new disk locations.

Page 4: A performance comparison of RAID-5 and Log-Structured-Arrays

Raid-5 performance analysis

• RAID-5 controller built in from Non-Volatile-Storage(NVS).

• N+1 disks (N data+1 parity).• Fast write.• Dirty disk blocks.• Clean disk blocks.• Dirty blocks are written to disk using destaging.• NVS contains only data blocks.(not parity)

Page 5: A performance comparison of RAID-5 and Log-Structured-Arrays

• Read hit:block requested is in cache.

• Read miss:block requested not in cache.

• Write hit: block written by a write request is in cache.

• Write miss: block written by a write request is not in cache.

Page 6: A performance comparison of RAID-5 and Log-Structured-Arrays

• Write: data block is placed in the Non-Volatile cache, and destaged later.

• N+1 disks each is represented by 2 queues (read misses,destages).

• Read misses have higher priority.

• Non-preemptive priority.

Page 7: A performance comparison of RAID-5 and Log-Structured-Arrays

• r. h., w. h., w. m. satisfied directly by cache.

• R. m. pass through the system then enter the read miss queue.

• A destage is 3 or 4 disk operations.

Page 8: A performance comparison of RAID-5 and Log-Structured-Arrays

M/M/P system

P1

P2

Pn

Page 9: A performance comparison of RAID-5 and Log-Structured-Arrays

LSA

• Data is stored in compressed form.• The system must keep a directory to locate

data items.• The host system writes records to the

subsystem, record is compressed and stored in controller cache.

• Logical track appears as a contiguous entity on disk.

Page 10: A performance comparison of RAID-5 and Log-Structured-Arrays

LSA

• N+1 disks. Each is divided into segment columns.• A segment is the corresponding segment- columns

from n+1 disks.• One segment-column contains the parity of the

remaining segment-columns.• Parity segment columns are rotated among disks.• Logical devices are mapped and stored in the

LSA.

Page 11: A performance comparison of RAID-5 and Log-Structured-Arrays

LSA

• The location of a logical track (LSA directory) changes over time.

• LSA directory is in NVS in disk controller.• When the fraction of cache occupied by

modified track exceeds a threshold, some number of modified tracks moved to a memory segment, from where they destaged to disk.

Page 12: A performance comparison of RAID-5 and Log-Structured-Arrays

LSA

• Garbage collection.• If data can be written to any one of the disk

arrays, skew flattening can be better than RAID5(skew flattening within an array).

• Seek affinity is worse .• Disks see read misses,destage writes,reads

due to write misses and garbage collection reads and writes.

Page 13: A performance comparison of RAID-5 and Log-Structured-Arrays

Effect of seek affinity

• Loss of seek affinity have a negative impact on performance.

• Performance of LSA without seek affinity is 50% higher at low I/O rates and 100% higher at high I/O rates than .5 seek affinity.

Page 14: A performance comparison of RAID-5 and Log-Structured-Arrays

Effect of Free Space

• More free space implies lower average segment occupancy and hence lower garbage collection rate.

• If space allocated is 0.9 then garbage collection is responsible for 30% of the device utilization.

• If it is 0.7 then garbage collection takes 10%. The device can handle 25% more I/Os.

• Preferable 0.8 occupancy or less.

Page 15: A performance comparison of RAID-5 and Log-Structured-Arrays

Effect of Compression Ratio

• The better the compression ratio the greater the free space and hence the better the performance.

• The better the compression ratio the better the cache hit ratio.

Page 16: A performance comparison of RAID-5 and Log-Structured-Arrays

RAID5 with compression in cache

• A version of RAID5 in which data is stored compressed in cache.

• To write a record to disk, store it in compressed form but leave enough pad after it.

• We can always do update-in-place.• No disk space is saved.• No LSA directory is needed.• Improved performance or lower cost is possible.

Page 17: A performance comparison of RAID-5 and Log-Structured-Arrays

Assumptions

• Storing of old data in RAID5 and LSA directory takes 17% of the cache.

• Systems with compressed cache have larger cache.

• RAID5 has flat skew within an array and some skew between arrays.LSA has no skew.

• Only LSA gets better transfer times.• RAID5 has better seek affinity.

Page 18: A performance comparison of RAID-5 and Log-Structured-Arrays

results

• With no compression, LSA has a worse response time compared to RAID5, and about the same cost.

• With 3:1 compression LSA has better response time and a lower cost.However it has worse performance and better cost than RAID5 with compression.

Page 19: A performance comparison of RAID-5 and Log-Structured-Arrays

Summary

• RAID5 with compression has better response time and throughput than RAID5.

• If data is already pre-compressed then LSA has worse response time and better throughput.

• If data is not pre-compressed then we can build an LSA 2X cheaper that has much worse response time but better throughput.

Page 20: A performance comparison of RAID-5 and Log-Structured-Arrays

Comparison on a workload with flat skew

• With no data compression LSA and RAID5 have the same response time and almost the same throughput.

• With 3:1 compression, LSA has slightly worse response time and better throughput than RAID5 with compression.

• RAID5 with compression has better performance.

Page 21: A performance comparison of RAID-5 and Log-Structured-Arrays

Questions

• What is LSA?

• Compare RAID5,LSA and RAID5 with compression with respect to response time and throughputs.

• What is skew , and how can it affect the performance of a disk array?