a massive repository for the national medical knowledge...

28
A Massive Repository for the A Massive Repository for the National Medical Knowledge Bank IEEE S i M St S t IEEE Symposium on Mass Storage Systems San Diego, CA April 19, 2001 Warren Sterling, PhD [email protected]

Upload: others

Post on 01-Apr-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

A Massive Repository for the A Massive Repository for the National Medical Knowledge Bank

IEEE S i M St S tIEEE Symposium on Mass Storage SystemsSan Diego, CAApril 19, 2001

Warren Sterling, [email protected]

Page 2: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Agenda

• National Medical Knowledge Bank (NMKB) Project• NMKB Features• Virtual Conferences• Virtual Conferences• Intelligent Agent-Based Active Learning Framework• Case Finder• Data Warehouse Requirements• Data Warehouse Requirements• Object Relational Database Repository Requirements• Repository Architecture• Summary & Conclusions• Summary & Conclusions

5/24/2009 NCR ConfidentialNCR Confidential

Page 3: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

National Medical Knowledge BankNational Medical Knowledge Bank

An Advanced Technology Program Joint VentureAn Advanced Technology Program Joint VentureAn Advanced Technology Program Joint Venture An Advanced Technology Program Joint Venture

Allegheny-Singer Research Institute, Pittsburgh PAMillennium Healthcare Solutions, Edison NJ

NCR Corporation, Dayton OHMCP Hahnemann University Philadelphia PAMCP Hahnemann University, Philadelphia PAAT&T Government Markets, McLeansville NC

S d bSponsored by:The United States Department of CommerceNational Institute of Standards and Technology

5/24/2009 NCR ConfidentialNCR Confidential

Page 4: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

National Medical Knowledge Bank National Medical Knowledge Bank Features

• Web-based Architecture• Targeted to healthcare practitionersTargeted to healthcare practitioners• Integrated applications

– Virtual Medical Conferences– Nursing TrainingNursing Training– Case-Based Retrieval for Diagnosis/Treatment

Determination– Disease Domain-specific Literature SearchDisease Domain specific Literature Search

• Offers Continuing Education credits• Designed to use an object relational database as a

scalable, parallel data warehouse

5/24/2009 NCR ConfidentialNCR Confidential

, p

Page 5: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

NMKB Web Architecture

Users

LOGIN

CNIDUS WEBCNIDUS WEBAPPLICATION APPLICATION LOGIN

MGMT SERVERSERVER

Continuing Education: Clinician Support:Virtual PresentationsPrograms for CE creditActive Learning

Case FinderEvidence Finder

M diODBC

Text SearchCYBERCASH MediaServer Object/Relational

Database

Text SearchEngine

File SystemCYBERCASH

Page 6: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Virtual ConferencesVirtual Conferences

Page 7: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Brain Attack Conference - Video i h Slid I d i

• Asynchronous,

with Slide Indexing

discretionary viewing

• Lower cost option (travel, time)(travel, time)

• Concept searching

• CME credit, including JE/JIT

• Indexed presentation outlines for fastoutlines for fast navigation

• Streaming id / di ith

5/24/2009 NCR ConfidentialNCR Confidential

video/audio with synchronized slides

Page 8: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Intelligent Agent-based Active Learning Frameworkg

Page 9: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Case Studies

• Delivers personalized, active education experience

• Problem-based learningProblem based learning

• At convenience of student schedule; can complete in multiple sessions

• Certified CNE credit

• Reusable framework; lowers production cost

• Agent technology - lesson planning; student model; tutor; ontology

A li bl h

5/24/2009 NCR ConfidentialNCR Confidential

• Applicable to other domains

Page 10: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Activity Menu

5/24/2009 NCR ConfidentialNCR Confidential

Page 11: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

HPI Interview

5/24/2009 NCR ConfidentialNCR Confidential

Page 12: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Intelligent Agent-based Continuing Education - HPI Video InterviewEducation HPI Video Interview

• Patient interview id i tt

• Patient interview id i ttvideo vignettes

(streaming video)

• Student selects

video vignettes (streaming video)

• Student selects questions and orders them

• Student evaluated

questions and orders them

• Student evaluated on question choice on question choice

5/24/2009 NCR ConfidentialNCR Confidential

Page 13: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Intelligent Agent-based Continuing Education – Review

Patient Chart

5/24/2009 NCR ConfidentialNCR Confidential

Page 14: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

C Fi dCase Finder(Case-based Reasoning)

Page 15: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Target Selector

• Uses Case-BasedReasoning to Find “Similar” Cases

• Eases clinician’s burden of reading and recalling cases

• Saves clinician time• Web-based; works

with any ODBC t a y O Cdatabase

• Matches complex data

5/24/2009 NCR ConfidentialNCR Confidential

Page 16: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Neurology Cases #5

5/24/2009 NCR ConfidentialNCR Confidential

Page 17: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Medical Ontologygy

Case Finder uses the Unified Medical Case Finder uses the Unified Medical Language System (UMLS) with >700K concepts and 1 5M concept namesconcepts and 1.5M concept names– allows matching at the concept level

• stroke = brain attack

– identifies relationships between concepts• embolic stroke is a kind of stroke

5/24/2009 NCR ConfidentialNCR Confidential

Page 18: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

What the Warehouse HoldsWhat the Warehouse Holds

• NMKB Content Index for all current and potential content for all applications - based on Dublin Core

• Evidence-based medicine abstracts and “pearls”• All exemplar cases for CaseFinder application• Digital video and slides/images for all virtual events• All content for Active Learning Framework training

modules• Unified Medical Language System ontology• Student history• User data• Potential content for the NMKB

5/24/2009 NCR ConfidentialNCR Confidential

Page 19: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Repository (Database) Requirements

• Store standard datatypes, medical imagery, yp , g y,video, text - up to 2 Gbyte objects

• Support application-specific User Defined F ti fFunctions for:– image analysis and image manipulation– cbr similarity metrics calculation

• Support DICOM3 datatype• Support high concurrent usage• Support terabyte-size databases• Support scalability

5/24/2009 NCR ConfidentialNCR Confidential

Page 20: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

What is an Object Relational Database?Database?

New Data Traditional Data+

Alphanumeric System/User Defined Functions (SDFs/UDFs)System Defined Types (SDTs)

• Integer

• Character

• Audio

• Image

• Word Spotting; Voice Recognition for Identification

• Tumor Classification in MRI Scans; Color • Character

• Date

• Float

• Image

• Video

• Geospatial

;Histogram

• Extraction of Video Segment (start and end points)

• Map/Image Overlay; Distance Between Points; Polygon Overlap

• Document/TextPoints; Polygon Overlap

• Language translation; Word/Phase Matching and Counting

Object Relational Database Table RowjAlphanumeric Attributes “Object” Columns (SDTs)

Char (n) Integer Float Image Audio Video Point Text

NCR Confidential

Char (n) Integer Float Image Audio Video Point TextPatientName

PatientAge

AccountBalance

MRIScans

DoctorComm.

Angio-gram

WorkLocation

TranscribedDr. Comm

Page 21: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Parallel Architecture

Q erClient

Queries (via ODBC)

Massively ParallelProcessor (MPP)

QueryCoordinator

ClientApplication

( )

Result Tuples

ExecutionBYNET

Data Data Data Data

ExecutionSteps

BYNET

Server Server Server Server

CloudCover

...

...LandUse

...

...

CloudCover

...

...LandUse

...

...

CloudCover

...

...LandUse

...

...

CloudCover

...

...LandUse

...

...

5/24/2009 NCR ConfidentialNCR Confidential

Disk Arrays

Page 22: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Type HierarchyType Hierarchy

BaseType

nD Arrayspatial integer char... File ref yp g

1D A 2D R ti l l i t 1D Array 2D Rastercircle polygon point

Text Video Blob Audiopolyline swiss-cheese polygon

5/24/2009 NCR ConfidentialNCR Confidential

Page 23: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Parallel ExecutionParallel Execution

Pipelined Execution Partitioned Execution

AGGAGG

JOINJOIN

SELECT SELECTSELECT SELECT

5/24/2009 NCR ConfidentialNCR Confidential

Page 24: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Execution Architecture

Query LanguageParser

Relational Engine“Complexity”

Q Pl S t C t lOptimizer

Type CheckingComplexity

Query Plan System CatalogsAccess, Cost, Methods

baseAdtType Information

“Intelligence”MethodInvocations

AbstractBase Class

QueryExec.

Point IntegerArray

Scheduler / Operators

5/24/2009 NCR ConfidentialNCR Confidential

Derived ADTs

Page 25: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

CloudCover (date: date, density: ( , y2Draster, instr: string)

Row 1 Row 2 Row n

Row 1

date instrdensity

...

BoundingTile OID of Raster

date instrdensityDISK

...

Raster Header

BoundingBox

TileSize

OID of RasterImage

5/24/2009 NCR ConfidentialNCR Confidential

Page 26: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Sample Spatial Query2 mile radius2 mile radius

Hospital

Query: Find all patients that live within a 2 mile radius of Greenwich Hospital.

Tables: Patients (name:string home:point)

5/24/2009 NCR ConfidentialNCR Confidential

Tables: Patients (name:string, home:point) Hospitals (name:string, location:point)

Page 27: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

Sample Spatial Query2 mile radius

Hospital

Select patients.name from patients, hospitalswhere circle (hospitals location 2)

5/24/2009 NCR ConfidentialNCR Confidential

where circle (hospitals.location, 2)contains patients.home and hospitals.name = “Greenwich”

Page 28: A Massive Repository for the National Medical Knowledge Bankstorageconference.us/2001/presentations/paper-15.pdf · 2014-04-22 · A Massive Repository for the National Medical Knowledge

S d C l iSummary and Conclusions

• The NMKB project sponsored by NIST ATP was successfully completed

• The NMKB will support innovative interactive multimedia enabled medical applications multimedia-enabled medical applications.

• Goal is to commercialize the NMKB or pieces of it. • The data warehouse of a commercial NMKB must be

supported by object relational database technologysupported by object relational database technology• Scalable growth is required as content base grows.• Parallel database operation is required for sophisticated

CBR searching and concurrent access by large numbers CBR searching and concurrent access by large numbers of users.

5/24/2009 NCR ConfidentialNCR Confidential