lustre* with zfs* sc16 presentation - intel · accelerating genomics analysis – use case •...

12

Upload: others

Post on 25-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •
Page 2: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Lustre* with ZFS*

Keith Mannthey, Lustre Solutions Architect

Intel High Performance Data Division

Page 3: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Legal Information• All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps

• Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit http://www.intel.com/performance.

• Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at http://www.intel.com/content/www/us/en/software/intel-solutions-for-lustre-software.html.

• Intel technologies may require enabled hardware, specific software, or services activation. Check with your system manufacturer or retailer.

• You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein.

• No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.

• The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.

• Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.

• This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps.

• A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS.

• Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined". Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information.

• Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

• * Other names and brands may be claimed as the property of others.

• © 2016 Intel Corporation3

Page 4: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Lustre with ZFS

• Motivations for Lustre w/ZFS

• Lustre w/ZFS – Unique Features

• Industry Use Cases

• Industry Momentum

• Intel’s Commitment to Lustre w/ZFS

4

Page 5: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

5

Motivation

Machine Learning

Genomics

Video/Animation

Simulation

LUSTRE with OpenZFS

EXAscale Computing

Technical NeedsUsage Models

Performance

Rapid Scalability

Security & Compliance

Manageability

Reliability / Availability

Extreme Performance at Scale

Integrated Security

SW Management Stack

Data Integrity and Recovery

Open Source and ExtensibleDEEP Integration

Page 6: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

ZFS – Unique Features

• Incredible reliability– Data is always consistent on disk; silent data corruption is detected and corrected;

smart rebuild strategy

• Compression– Maximize usable capacity for increased ROI

• Snapshot – support built into Lustre– Consistent snapshot across all the storage targets without stopping the file system.

• Hybrid Storage Pool– Data is tiered automatically across DRAM, SSD/NVMe and HDD accelerating random

& small file read performance

• Manageability– Powerful storage pool management makes it easy to assemble and maintain Lustre

storage targets from individual devices6

Page 7: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Accelerating Genomics Analysis –Use Case

• Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability.

• Solution: Storage System based on Intel® Enterprise Edition for Lustre* Software w/ZFS.

• Results:

• 20X improvement in data r/w capacity, lower costs, faster speed with supplying data to high-performance computing clusters.

• High efficiency from full use of computing resources

• Full confidence in meeting the demand for continuous performance and capacity upgrades caused by increasingly complex genetic information research.

• Summary: Gene Sequencing, Data Archiving and Storage Clusters based on Intel® Enterprise Edition for Lustre* software have improved data throughput performance and accelerated value mining and insights into genetic information.

7

Page 8: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

ZFS Enhancements in the path of Exascale

• Changes for using ZFS more efficiently

• Improved file create performance

• Snapshots of whole file system

• Changes to core ZFS code

• Inode quota accounting

• Multi-mount protection for safety

• System and fault monitoring improvements

• Large dnodes for improved extended attribute performance

• Reduce CPU usage with hardware-assisted checksums, compression

• Declustered parity & distributed hot spaces to improve re-silvering

• Metadata allocation class to store all metadata on SSD/NVRAM

8

Page 9: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

• Path to Exascale

• CORAL and future follow-on architectures are scoped with ZFS.

• LLNL Sequoia1 (55PB File System)

• Cheaper, less complex, higher performance file system for Sequoia

• With Intel, Lustre and ZFS continue to advance

• Collaborate with OpenZFS community on new features.

• Breakthrough metadata performance: LAD’16 Talk

Industry Adoption

1 http://computation.llnl.gov/projects/zfs-lustre 9

Page 10: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Intel

IPCC-L

OpenZFS

Intel’s Commitment to Lustre w/ZFS

Native Encryption Built-

in encryption for data at

rest to provide enhanced

storage security.

Persistent Read Cache

Update of existing L2ARC

read cache to persist data

across reboots.

Performance Enhancements

ZFS improvements for increased

metadata performance.

Fault Management

Enhanced fault monitoring and

management architecture for ZFS.

D-RAID –

De-clustered RAIDZ provides

massively improved rebuild

performance after a drive failure.

Parity acceleration – Using AVX instructions to

accelerate parity calculation 10

Page 11: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •

Wrap up

• Lustre w/ZFS provides • Data Integrity

• Compression

• Snapshots

• Learn More• www.intel.com/Lustre

11

Page 12: Lustre* with ZFS* SC16 Presentation - Intel · Accelerating Genomics Analysis – Use Case • Challenge: Improve data r/w performance; Reduce TCO & Enhance System Scalability. •