automatic concept indexing and classification for improved retrieval in the hazardous substances...

15
Automatic Concept Indexing and Classification for Improved Retrieval in the Hazardous Substances Data Bank Doszkocs, Tamas; Chang, Hua Florence; Aronson, Alan; Thomas, Phillip, National Library of Medicine Wilder, Dean, MSD, Inc. Zamora, Antonio, Consultant

Upload: salma-pinkett

Post on 15-Dec-2015

219 views

Category:

Documents


1 download

TRANSCRIPT

Automatic Concept Indexing and Classification

for Improved Retrieval in the

Hazardous Substances Data Bank

Doszkocs, Tamas; Chang, Hua Florence; Aronson, Alan; Thomas, Phillip,

National Library of Medicine 

Wilder, Dean, MSD, Inc. Zamora, Antonio, Consultant 

Hazardous Substances Data Bank Search

HSDB “radon” record for query “small-cell carcinomas”

HSDB Automatic MeSH Indexing

HSDB MeSH Indexing and Search Prototype

“small-cell carcinomas” query result

Focus Results by MeSH Headings

View Results by MeSH headings

Narrow Search by “air pollutants, radioactive”

View Results by Semantic Types (MeSH)

Radon and “Air pollution, indoor”

“Radon” HSDB Record

Future Plans

• Scale HSDB Automatic UMLS/MeSH Indexing Experiment

• Refine MetaMap Selection Algorithm• Utilize Noun Phrase Parser(s) for non-

UMLS/MeSH Concepts• Incorporate AI Algorithms• Test/Evaluate• Integrate into TOXNET Production System

Automatic Concept Indexing and Classification

for Improved Retrieval in the

Hazardous Substances Data Bank

Doszkocs, Tamas; Chang, Hua Florence; Aronson, Alan; Thomas, Phillip,

National Library of Medicine 

Wilder, Dean, MSD, Inc. Zamora, Antonio, Consultant