internet databases ii

16
Internet Databases II Bert Gold, Ph.D., F.A.C.M.G.

Upload: ivan-dale

Post on 01-Jan-2016

13 views

Category:

Documents


0 download

DESCRIPTION

Internet Databases II. Bert Gold, Ph.D., F.A.C.M.G. The Santa Cruz Website. The Golden Path How Constructed Reliance upon the NCBI assembly Additional Wash U. data and data from The SNP Consortium. Santa Cruz Continued. Add a track, annotation Utility of the Browser - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Internet Databases II

Internet Databases II

Bert Gold, Ph.D., F.A.C.M.G.

Page 2: Internet Databases II

The Santa Cruz Website

• The Golden Path– How Constructed– Reliance upon the NCBI assembly– Additional Wash U. data and data from The

SNP Consortium

Page 3: Internet Databases II

Santa Cruz Continued

• Add a track, annotation

• Utility of the Browser

• Demonstrating the use of the Browser

• Showing the Repeats, annotation

• Looking at Genes, coding regions and introns

• BLAT

Page 4: Internet Databases II

Polymorphism

• STRs

• Segmental Duplications

• Cytogenetics

• Contiguous Gene Syndromes

• Deletion by ‘illegitimate recombination’

(non-homologous recombination)

Page 5: Internet Databases II

More Polymorphism

• Mapping

• Validating

• The Utility of Spidey

• Acembly

Page 6: Internet Databases II

Automated polymorphism scoring software

• An excursion away from Internet

• Cluster analysis– Threshold– QA/QC– Binning– Outliers

• SDS 2.0 and Automated calls

Page 7: Internet Databases II

Sequencing for Polymorphism Detection

• Polyphred

Page 8: Internet Databases II

Microsatellite Software

• GeneScan

• Genotyper

• EXPORT as EXCEL

• IMPORT as VNTR

• How ‘Phenotype’ is ‘Genotype’ for Software developers

Page 9: Internet Databases II

Genome Assembly Programs • Phred, Phrap, Consed, Autofinish• Polyphred as a spin-off• Free: The Staden Package• http://www.angis.org.au/Staden/staden_home.html• Free: http://www.generunner.com• Soren Rasmussen’s DNA Tools:

http://www.dnatools.dk• The Wisconsin Package• Lasergene• Sequencher

Page 10: Internet Databases II

Database Management Programs • Cyrillic http://www.exetersoftware.com/cat/cyrillic

.html• Genomica• Finch Server• More Generic

– Microsoft ACCESS– FoxPro– FileMaker– MySQL

• Industrial Strength– Microsoft SQL– Oracle

Page 11: Internet Databases II

Linkage analysis Generally

• Jurg Ott’s Lab: http://linkage.rockefeller.edu

• Daniel Weeks Lab at Pittsburgh: http://www.hgen.pitt.edu/ (and then click through on his name)

• Duke University Center for Human Genetics http://www.chg.duke.edu/software/index.html

Page 12: Internet Databases II

More Linkage Analysis Sites

• Genetic Data Analysis Software (Bruce Weir)http://lewis.eeb.uconn.edu/lewishome/software.html

Page 13: Internet Databases II

Comparative Genomics • HOMOPHILA: HUMAN TO

DROSOPHILA DATABASE• http://homophila.sdsc.edu/• JOINT GENOME INSTITUTE (FUGU

HOME)• http://jgi.doe.gov• FRUITFLY GENOME PROJECT HOME• http://www.fruitfly.org

Page 14: Internet Databases II

Whole Genome Projects

• Human (www.nhgri.nih.gov)

• Drosophila (www.fruitfly.org)

• Mouse (www.informatics.jax.org)

• Worm (www.wormbase.org)

• Yeast (www-genome.stanford.edu)

• Rice (www.arabadopsis.org)

• Maize (www.agron.missouri.edu)

Page 15: Internet Databases II

EBI is responsible for human gene annotation (officially)

• Ensembl Project

• Hosted at Sanger Center/EBI

The Sanger Center itself is at:– www.sanger.ac.uk

• Automatic annotation – www.ensembl.org

Page 16: Internet Databases II

How to automate HTML (HTML Forms)

• Function calls in HTML

• PERL

• JAVA

• C

• Applets

• Other possibilities (Mathematica)