a massively parallel architecture for bioinformatics

17
A Massively Parallel Architecture for Bioinformatics Presented by Md Jamiul Jahid

Upload: darena

Post on 24-Feb-2016

58 views

Category:

Documents


0 download

DESCRIPTION

A Massively Parallel Architecture for Bioinformatics. Presented by Md Jamiul Jahid. Introduction. Bioinformatics algorithms are demanding in scientific computing In general most of the bioinformatics algorithms are fairly simple Dealing with huge amount of data - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A Massively Parallel Architecture for Bioinformatics

A Massively Parallel Architecture for Bioinformatics

Presented by Md Jamiul Jahid

Page 2: A Massively Parallel Architecture for Bioinformatics

Introduction

• Bioinformatics algorithms are demanding in scientific computing

• In general most of the bioinformatics algorithms are fairly simple

• Dealing with huge amount of data• The size of DNA sequence database doubles

every year

Page 3: A Massively Parallel Architecture for Bioinformatics

Introduction

• A typical DNA contains 3.4 billion base pairs• Maximum algorithms use only simple

operations with input data like – Arithmetic operation– String matching– String comparison

Page 4: A Massively Parallel Architecture for Bioinformatics

Introduction

• Standard CPUs are designed for providing a good instruction mix for almost all commonly used algorithm

• For a target class of algorithm they are not effective

• Results– High runtime– Energy– Money

Page 5: A Massively Parallel Architecture for Bioinformatics

Contribution

• Present a massively parallel architecture • Using low cost FPGA(Field Programmable Gate

Array)• They called it COPACOBANA 5000– Meaning Cost-Optimized Parallel Code Braker ANd

Analyzer

Page 6: A Massively Parallel Architecture for Bioinformatics

COPACOBANA 1000• This machine is for cryptanalysis: fast code

breaking• 120 low cost FPGAs• 20 subunits• Each has Xilinx Spartan -3 XC3S1000 FPGAs

Page 7: A Massively Parallel Architecture for Bioinformatics

COPACOBANA 1000

• Assumptions– Programs are

parallelizable– Demand of data

transfer is low– All node needed

very little local memory which can be served from on-chip RAM of FPGAs

Page 8: A Massively Parallel Architecture for Bioinformatics

COPACOBANA 5000

• Bus Concepts– Point to point connection two neighboring FPGA-

cards– Point to point connection contain 8 pairs of wire– Each 250MHz, total 2Gbit/s

Page 9: A Massively Parallel Architecture for Bioinformatics
Page 10: A Massively Parallel Architecture for Bioinformatics

COPACOBANA 5000

• Controller– Root entity of control is running on a remote host

computer– Connected to COPACOBANA5000 by LAN– Two scenario• Data on remote host• Data on COPACOBANA5000

Page 11: A Massively Parallel Architecture for Bioinformatics

COPACOBANA 5000

• FPGA-Card– Xilinx Spartan-3 5000 is used– Contains 8 FPGAs– All FPGAs are globally clocked

Page 12: A Massively Parallel Architecture for Bioinformatics

Performance Estimation

• Between– PC– COPACOBANA1000– COPACOBANA5000

Page 13: A Massively Parallel Architecture for Bioinformatics

Performance Estimation

Page 14: A Massively Parallel Architecture for Bioinformatics

Conclusion

• In this paper a new hardware for running bioinformatics algorithm is proposed

• The hardware are– Cheap– Low power consumption– Efficient

Page 15: A Massively Parallel Architecture for Bioinformatics

Questions

?

Page 16: A Massively Parallel Architecture for Bioinformatics

Thank You

Page 17: A Massively Parallel Architecture for Bioinformatics

Reference• Gerd Pfeiffer, Stefan Baumgart, Jan Schröder, and Manfred Schimmler,

A Massively Parallel Architecture for Bioinformatics, 9th International Conference on Computational Science (ICCS 2009).