cluster software overview. cluster software stack linux kernel/environment redhat, suse, debian,...

18
Cluster Software Overview

Upload: donna-grant

Post on 17-Jan-2016

218 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Overview

Page 2: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Linux Kernel/Environment RedHat, SuSE, Debian, etc.

Page 3: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

HPC Device Drivers Interconnect driver (e.g., Myrinet, Infiniband, Quadrics) Storage drivers (e.g., PVFS)

Page 4: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Job Scheduling and Launching Sun Grid Engine (SGE) Portable Batch System (PBS) Load Sharing Facility (LSF)

Page 5: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Cluster Software Management E.g., Rocks, OSCAR, Scyld

Page 6: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Cluster State Management and Monitoring Monitoring: Ganglia, Clumon, Nagios, Tripwire, Big Brother Management: Node naming and configuration (e.g., DHCP)

Page 7: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Message Passing and Communication Layer E.g., Sockets, MPICH, PVM

Page 8: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Parallel Code / Web Farm / Grid / Computer Lab Locally developed code

Page 9: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Cluster Software Stack

Questions: How to deploy this stack across every machine in the cluster? How to keep this stack consistent across every machine?

Page 10: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Software Deployment

Known methods: By hand “Add-on” method

Bring up a frontend, then add cluster packages OpenMosix, OSCAR, Warewulf

Integrated Cluster packages are added at frontend installation time

Rocks, Scyld

Page 11: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

OpenMosix

Overview Single system image - all nodes look like one large

multiprocessor Jobs migrate from machine to machine (based on machine

load) No changes required for apps to use system

Processors supported x86 Opteron port in progress Itanium port unclear

Interconnects supported All IP-based networks

Page 12: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

OpenMosix Installation

Install all nodes with Linux Can use RedHat, SuSE, Debian, Gentoo, etc.

Download and install OpenMosix add-on packages on each node (or configure and compile custom kernel) Kernel patches

Supports diskful and diskless

Reboot all nodes Now running OpenMosix kernel

Page 13: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

OSCAR

Overview Imaged based installation

Bring up frontend by hand Add OSCAR packages to frontend Construct image for compute nodes within a directory

on the frontend Redhat 9.0 and Mandrake 9.0 support Includes LAM/MPI, MPICH, PVM, Maui/PBS, PVFS, Clumon

Processors supported: x86, Itanium (in beta) Interconnects: Ethernet, Myrinet

Page 14: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

OSCAR

Diskless support in development Opteron support in development High-availability support in alpha testing

Page 15: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Warewulf Overview

Install frontend first Recommend using RPM-based distribution

Imaged based installation “Virtual node filesystem”

Attacks problem of generic slave node management

Standard cluster software not included Added separately Use ‘chroot’ commands to add in extra software

Supports diskful and diskless Processors supported: x86, Opteron

Page 16: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Scyld

Commercial distribution Single system image design

“Bproc” Based on Red Hat Enterprise Server 3.0

Processors: x86 and Opteron Itanium in development

Interconnects: Ethernet and Infiniband MPI and PVM Diskful and diskless support

Page 17: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Rocks

Cluster on a CD Based on Red Hat Enterprise Workstation 3.0

Processors: x86, Opteron, Itanium Interconnects: Ethernet and Myrinet

Infiniband in development

Compute node management via Red Hat’s kickstart mechanism

Diskfull only

Page 18: Cluster Software Overview. Cluster Software Stack Linux Kernel/Environment  RedHat, SuSE, Debian, etc

Many Others

Clustermatic Uses LinuxBIOS

Clic From mandrakesoft

Fully Automatic Installation Debian-based

Callident Commericial version of Rocks

Qlusters Commercial version of openMosix