biomedical databases & tools rolando garcia-milian [email protected] biomedical &...

17
Biomedical Databases & Tools Rolando Garcia-Milian [email protected] Biomedical & Health Information Services Department Health Sciences Center Library October 3, 2013

Upload: lela

Post on 25-Feb-2016

47 views

Category:

Documents


0 download

DESCRIPTION

Biomedical Databases & Tools Rolando Garcia-Milian [email protected] Biomedical & Health Information Services Department Health Sciences Center Library October 3, 2013. Problem – Rapid Growth of Biomedical data . - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Biomedical Databases & Tools

Rolando [email protected]

Biomedical & Health Information Services DepartmentHealth Sciences Center Library

October 3, 2013

Page 2: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Problem – Rapid Growth of Biomedical data

GenBank Statistics http://www.ncbi.nlm.nih.gov/genbank/genbankstats-2008/

0.00

500,000.00

1,000,000.00

1,500,000.00

2,000,000.00

2,500,000.00

3,000,000.00

3,500,000.00

Samples Submitted to Gene Ex-pression Omnibus Database

Compiled from GEO historic data http://www.ncbi.nlm.nih.gov/geo/summary/?type=history

Page 3: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Compiled by from PubMed http://www.ncbi.nlm.nih.gov/pubmed

1940 1950 1960 1970 1980 1990 2000 2010 20200.00

5,000,000.00

10,000,000.00

15,000,000.00

20,000,000.00

25,000,000.00

Number of Records in PubMed

Biomedical Literature

Problem – Growth of the Biomedical Literature

• Huge volume (PubMed 23132342 citations)

• High diversity

• High quality (peer review)

• Users overwhelmed by long list of search results

• 1/3 of Pubmed queries result in 100 or more citations (Islamaj, 2009)

Page 4: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Querying the biomedical literature becomes more difficult

Medical Subject HeadingsFiltersBoolean operators

Problem – Querying the Biomedical Literature

Page 5: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Alternative Mining Tools for the Biomedical Literature

Main gene queryProtein/gene associated

SynonymMedical terminology

Page 6: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

• Based on Universal Medical Language System

• Repository of semantic predications (subject-predicate-object triples)

• 57.6 million predications from all of PubMed citations (Rindflesch, 2011)

Alternative Mining Tools for the Biomedical Literature

Page 7: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Alternative Mining Tools for the Biomedical Literature

Linked to Entrez Gene database

Page 8: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Workshop- Novel Online Tools for Mining the Biomedical Literature

Page 9: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Case 1 – Few Results in the Biomedical Literature

• Searching for novel genes

Page 10: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Case 2 – Few Results in the Biomedical Literature

• Searching for side effects of drugs: Cerebyx – respiratory failure

Page 11: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Phenotypic information can be used to infer molecular interactions and hinting at new uses of marketed drugs (Campillos, 2008)

Case 2 – Few Results in the Biomedical Literature

Page 12: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Freely Available Up-To-Date Discovery Tools

National Center for Biotechnology Information, USA

European Bioinformatics Institute, UK

Page 13: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Proprietary Tools

Page 14: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Annotation/ Visualization Tools – Genome Browsers

Page 15: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Workshop- Introduction to Genome Browsers

Page 16: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

References

Campillos M*, Kuhn M*, Gavin AC, Jensen LJ, Bork P. Drug target identification using side-effect similarity. Science. 2008 Jul 11;321(5886):263-6. http://www.ncbi.nlm.nih.gov/pubmed/18621671

Islamaj Dogan R, Murray GC, Névéol A, Lu Z. (2009) Understanding PubMed user search behavior. Database (Oxford) http://www.ncbi.nlm.nih.gov/pubmed/20157491

Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res. 2002 Jun;12(6):996-1006. http://www.ncbi.nlm.nih.gov/pubmed/12045153

Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P. A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol. 2010;6:343. Epub 2010 Jan 19. http://sideeffects.embl.de/drugs/56338/

Rindflesch, T.C. et al. (2011) Semantic MEDLINE: An advanced information management application for biomedicine. Information Services & Use, 31, 15-21. http://lhncbc.nlm.nih.gov/system/files/pub-lhncbc-2011-109.pdf

Page 17: Biomedical Databases & Tools Rolando Garcia-Milian Rolando.milian@ufl.edu Biomedical & Health Information Services Department Health  Sciences Center  Library

Data ResourcesData Management Resources, UF Libraries: http://library.ufl.edu/datamgmt

Surveys on Your Needs• UF Research Data Needs Assessment:

http://bit.ly/UFdatasurvey

• Big Data, Little Data Workshop Evaluation:  http://bit.ly/dataeval