development of a semantic-based search system for immunization knowledge li-hui lee 1, hsing-yi chu...

18
Development of a Semantic-based Search System for Immunization Knowledge Li-Hui Lee 1 , Hsing-Yi Chu 2 , Der-Ming Liou 3 1 Institute of Public Health, 2 Institute of Health Informatics and Decision Making 3 Institution of Biomedical Informatics National Yang-Ming University, Taiwan (R.O.C) 21 August, 2013

Upload: damian-bryan

Post on 29-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Development of a Semantic-based

Search System for Immunization

KnowledgeLi-Hui Lee1, Hsing-Yi Chu2, Der-Ming Liou3

1 Institute of Public Health,2 Institute of Health Informatics and Decision Making

3 Institution of Biomedical Informatics National Yang-Ming University, Taiwan (R.O.C)

21 August, 2013

2

• Introduction• Research Aim• Methods• Results• Limitations and Future Work• Conclusion and Contribution

Outline

3

Introduction

• Children’s Immunization Information Search– Keyword-based Search

Google, Yahoo, Microsoft Bing, Taiwan CDC, BabyHome, NTUH, Everyday Health, etc.

Popular keyword combinations

– Semantic-based SearchEnglish and Simplified Chinese onlySemantic-based search in Traditional Chinese is

needed

4

Research Aim

• To develop an integrated immunization information system including English and Traditional Chinese information search.

• To compare the search results between our system and other search tools

5

Methods

• System Functionalities and Architecture– Immunization ontological structure

design– Flowchart of search phrases pre-

processing and execution

• System Evaluation

6

Jena Java API

System Functionalities and Architecture

7

Data Used for Building Immunization Ontology

• Ontology Data Source– Keywords got from CDC Children

Healthcare Handbook• Vaccine information for public • Related disease information for public• Inoculation documents

– Relations and synonyms from UMLS metathesaurus• Vaccine• Related Disease

Flowchart of Search Phrases Pre-processing and

Execution

8

to lowercase

Found or Not FoundYESYES

NONO

System Evaluation

9

• The immunization ontology design• System design and performance

– Execution time– Relevance of results– Completeness of results– Synonym searches

• 31 users were invited to use the system and provide feedback and suggestions for improvement.

Immunization Ontological Structure

10

Prevented_By

Prevents

Is_A

rdfs: subClassOfowl: ObjectProperty

Is_A

Disease SymptomVaccine

Live Attenuated Vaccine Killed Vaccine

Class Lengend

AgeIs_A Is_A

Point of Time Period

Occurs_In Related_To

System Design: Information Search

11

System Design: Semantic-based Search Results

12

Comparisons among Search Tools using the Term “BCG”

13

ComparisonsThe

StudyHealthMash

Google

Yahoo

Microsoft Bing

Execution Time of Search (microsecond)

188-220 ungettable 80 170 ungettable

The Relevance of Resultswhich result is related to search phrases in the result list.

The 1st The 1stThe 7th

The 8th

The 6th

The Completeness of Results

(1)Functions Difference

The Study:The ontological structure is shown in Traditional Chinese.The target resource for search is the same as Google’s search engine. HealthMash:The ontological structure is shown in English.The target resource for search includes limited Taiwanese websites or Chinese articles are more likely in Simplified Chinese, not in Traditional Chinese.

Google:To filter the published date or duration of target resource To present the definition of search phrases from dictionariesTo show the result according to the read difficulty levelYahoo & Microsoft Bing:To configure the number of search results per page

(2) To Show Synonyms Yes No No No No

Comparisons of the Related Search Results Number among Taiwanese

Health Information Websites

14

Websites

Keywords

The Study

Taiwan CDC

BabyHome

Health Education Center of

NTUH

BCG 127,000 1,320 0 22

卡介苗 127,000 1,730 94 15

15

Users’ Feedback and Suggestions

• 31 users– 4 mothers, 8 medical staffs and 19 IT

students

• Usefulness of searched Information– Basic information> related terms

(synonyms and relations) > ontological structure > related articles

• 67.7% respondents prefer to the semantic-based search.

16

Limitations and Future Work

• The relationship sets may restrict the information search target.

• Information written in other languages is excluded from the search.

• The pre-processing does not concern with lengthy search phrases.

17

Conclusion and Contribution

• An Immunization ontology is created to help semantic-based information search.

• Our immunization ontology presented in OWL and referred to UMLS metathesaurus can be easily integrated and applied to other systems.

18

Thanks for Your Attention!

Any Questions?