bioinf workshop2a (1)

20
8/8/2019 Bioinf Workshop2a (1) http://slidepdf.com/reader/full/bioinf-workshop2a-1 1/20 Bioinformatics Workshop 2

Upload: sylwia-bzdyra

Post on 10-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 1/20

Bioinformatics Workshop 2

Page 2: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 2/20

The background to today¶s task: HER oncogenes

 Nomenclature: EGFR, HER, erb-b

Page 3: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 3/20

Tasks

Download the answersheet (2) documentfrom blackboard

Go to Genbank homepage.http://www.ncbi.nlm.nih.gov/

Search for all the Human EGFR protein

sequences

Remember to select proteins in the drop menu

Page 4: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 4/20

EGFR (Homo sapiens):

epidermal growth factor 

receptor 

Click on

Have a look around the site!

Page 5: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 5/20

Click on HPRB

FU 228 - 270

REC 361 - 481

FU 496 - 547

FU 552 - 601

REC 57 - 168

TM 646 - 668

Tyr_Kinase 712 - 968

Page 6: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 6/20

Go ³Back´

Click on Reference Sequence Details

Click on Swiss Prot P00533

Then click on Fasta

Open the file and copy the sequence

on to your clipboard

Click back and then the NCBI logo

Page 7: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 7/20

Click on ³Blast´

³Human´

Database ³RefSeq´

Programme: BlastP

Then Click Away

Page 8: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 8/20

Click On ³Select all´ and then

³Distance Tree of Results´

Copy

Page 9: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 9/20

Go back to Blast: This time search whole

database (not just human)

(get the sequence from your saved

EGFR FAST A file)

Putative conserved domains have been detected,

click on the image below for detailed results.

Page 10: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 10/20

Page 11: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 11/20

Tasks

Next you will need to go to

http://www.megasoftware.net/mega41.html

When you get the MEGA link, download thewindows version on to your D Drive.

Page 12: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 12/20

Launch Mega

Download ³Sequences for class´

from the blackboard site

File > Open Data

Selected ³Sequences for class´

Ignore the Error Message

Utilities > Convert to MEGA format

Page 13: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 13/20

Back to MEGA main page

 Alignment >

 AlignmentExplorer Clustal>

Create a new alignment>

Click No for protein>

Then«.

Data>Open>

Retrieve Sequences from file

Click on ³Sequences for class´

Page 14: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 14/20

Edit>Select All

 Alignment>Align by CLUST AL

OK to all ³DEFAULT´

File > Exit

Save Data to MEGA file

Open data in MEGA file

Page 15: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 15/20

Click Data> Data Explorer 

Display Colour Cells

Highlight ³Conserved Sites´

Download to XL (Excel)

Where are the variable and conservedregions?

Page 16: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 16/20

Go back to main MEGA site and click on

Phylogeny

..and play

Page 17: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 17/20

Tasks

Looking at the tree diagram, write a

short paragraph explaining what you

think the tree means. You may wish toinclude things such as the basal taxon,

description of who is related to who,

whether there are any major groupingsor clades etc. This section should be no

more than 150 words.

Page 18: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 18/20

What is bootstrapping?

Which do you think are the

oncogenic forms?

Page 19: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 19/20

Redo the ³alignments´ (on main MEGA

page) but highlighting only the Human

normal and oncogenic forms

What are the conserved and

variable sequences?

How do these relate toprotein motif functions?

Page 20: Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 20/20

Submission

Send your answer sheet to me by next

week.

Include your Exel spread sheets if you

wish!!