utilizing sshfs for distributed computing. what is sshfs sshfs allows you to mount data through an...
TRANSCRIPT
Utilizing SSHFS for Distributed Computing
What is SSHFS
• SSHFS allows you to mount data through an ssh connection. – http://fuse.sourceforge.net/sshfs.html
– http://wikipedia.org/wiki/SSH_Filesystem• Simple command line
– sshfs remoteuser@remotehost:/path/to/remote_dir local_dir
• Performance is as fast as your ssh connection.
• Works on Linux. (i386, x86_64, ia64)
Why use sshfs?
• Allows you to use local applications on remote data.– using Paraview to look at data processed on
the TeraGrid and stored on the GPFS-WAN.• Directly accessing the remote file. Your
changes are seen by everyone. • SSHFS allows you to tune performance.
Easy to install on Linux
• Ubuntu – sudo apt-get install sshfs
• CentOS (x86_64 example)– download fuse (http://fuse.sourceforge.net)
• ./configue --prefix=/usr --libdir=/usr/lib64
– download sshfs• ./configure –prefix=/usr –libdir=/usr/lib64
– /etc/rc.d/rc.local• mknod /dev/fuse -m 0666 c 10 299
Advanced Sys. Adm. Tricks
• Utilize SSHFS with autofs to mount remote data for local users
• Utilize samba to share SSHFS with windows users
• Utilize SMBWebClient to share data through a web portal.
What Does 87TB + 220TB look like?
220TB through
CIS portal using
autofs, samba,smbwebclient.
CIS has 87TB of local storage.
/cis/net lists network drives.