b i o i n f o r m a t i c s an intro
TRANSCRIPT
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Bioinformatics – an Introduction
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
“Bioinformatics addresses problems related to thestorage, retrieval and analysis of information about
biological structure, sequence and function.
- National Institute of Health
Definition
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Bioinformatics deals with
• Design and implementation of new algorithms and statistics which assess relationship among members of large data sets.
• Analysis and interpretation of various data types, which includes nucleotide and amino acid sequences and structure of protein.
• To develop computational tools and databases that enables efficient analysis, access and management of biologically significant information.
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Bioinformatics and Human Genome Project
• The Human Genome Project is a 13-year effort coordinated by the U.S. Department of Energy and the National Institutes of Health.
• The project budget was 3 billion and it was completed three years before the proposed time due to the technological advancements in biological data storage, management and analysis.
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Necessary Skill Sets
• Knowledge in Molecular Biology• Statistics• Mathematics (algorithm development)• Communicate biological problems to computer
scientists• Working knowledge in bioinformatics tools• Computer proficiency (windows/command line)• Programming Skills• Data administration
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Specialized fields
• Computational Biology• Genomics• Proteomics• Bioprogramming• Cheminformatics• Structural Biology• Systems Biology• Pharmacogenomics
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Applications
• Sequence analysis and similarity searches
• Protein structure prediction• Phylogenetics• Molecular docking
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Sequence Analysis and Similarity Searches
• Finding the (protein-coding) gene Sequence alignment Sequence Comparison and functional annotation Domain and pattern analysis
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Finding the (protein-coding) gene?
Protein
mRNA
DNA
transcription
translation
CCTGAGCCAACTATTGATGAA
PEPTIDE
CCUGAGCCAACUAUUGAUGAA
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Pairwise Sequence Alignment
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Multiple Sequence Alignment
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Pfam analysis
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Structure Prediction
P. Paulsharma Chakravarthy
BIO
INFO
RMAT
ICS
Molecular Docking