distributed knowledge-based abstraction, visualization, and exploration of time-oriented clinical...

53
Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time- Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research Center Ben Gurion University, Beer Sheva, Israel & Stanford University, CA, USA

Upload: clifford-austin

Post on 16-Jan-2016

220 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Distributed Knowledge-Based Abstraction, Visualization, and

Exploration of Time-Oriented Clinical Data

Yuval Shahar, M.D., Ph.D.

Medical Informatics Research CenterBen Gurion University, Beer Sheva, Israel

&Stanford University, CA, USA

Page 2: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Need for Intelligent Integration of Multiple Time-Oriented Clinical Data

• Many medical tasks, especially those involving chronic patients, require extraction of clinically meaningful concepts from multiple sources of raw, longitudinal, time-oriented data– Example: “Modify the standard dose of the drug, if during treatment,

the patient experiences a second episode of moderate anemia that has persisted for at least two weeks”

• Examples of clinical tasks:– Diagnosis

• Searching for “a gradual increase of fasting blood-glucose level”– Therapy

• Following a treatment plan based on a clinical guideline– Quality assessment

• Comparing observed treatments with those recommended by a guideline– Research

• Detection of hidden dependencies over time between clinical parameters

Page 3: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Need for Intelligent Mediation:The Gap Between Raw Clinical Data and Clinically Meaningful Concepts

• Clinical databases store raw, time-stamped data• Care providers and decision-support applications

reason about patients in terms of abstract, clinically meaningful concepts, typically over significant time periods

• A system that automatically answers queries or detects patterns regarding either raw clinical data or concepts derivable from them over time, is crucial for effectively supporting multiple clinical tasks

Page 4: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Temporal-Abstraction Task

• Input: time-stamped clinical data and relevant events (interventions)

• Output: interval-based abstractions

• Identifies past and present trends and states

• Supports decisions based on temporal patterns, such as: “modify therapy if the patient has a second episode of Grade II bone-marrow toxicity lasting more than 3 weeks”

• Focuses on interpretation, rather than on forecasting

Page 5: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Abstraction:The Bone-Marrow Transplantation

Domain

.

0 40020010050

1000

2000( )

100K

150K

( )

•••

• • • ••

• •

•••

Granu-locytecounts

• • •

Time (days)

Plateletcounts

PAZ protocol

M[0] M[1] M[2] M[3] M[1] M[0]

BMT

Expected CGVHD

M[i]= Myelotoxicity (bone-marrow toxicity) of severity Grade i

Page 6: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Bone-Marrow Transplantation Example, Revisited

Page 7: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Uses of Temporal Abstractions• Therapy planning and monitoring (e.g., to support guideline-based

care)

• Creating high-level summaries of time-oriented medical records

• Supporting explanation modules for a medical DSS

• Representing goals of therapy guidelines for quality assurance at runtime and quality assessment retrospectively; guideline intentions

regarding both the (care provider) process and (patient) outcomes can be captured as temporal patterns to be achieved or avoided

• Visualization and exploration of time-oriented clinical data: the KNAVE project

Page 8: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Temporal-Abstraction Ontology• Events (interventions) (e.g., insulin therapy) - part-of, is-a relations

• Parameters (measured raw data and derived concepts) (e.g., hemoglobin values; anemia levels) - abstracted-into, is-a relations

• Patterns (e.g., crescendo angina; chronic GVHD) - component-of, is-a relations

• Abstraction goals (user views)(e.g., therapy of diabetes) - is-a relations

• Interpretation contexts (effect of regular insulin) - subcontext, is-a relations

• Interpretation contexts are induced by all other entities

Page 9: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal-Abstraction Output Types

• State abstractions (LOW, HIGH)• Gradient abstractions (INC, DEC)• Rate Abstractions (SLOW, FAST) • Pattern Abstractions (CRESCENDO)

- Linear patterns- Periodic patterns

Page 10: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal-Abstraction Knowledge Types

• Structural (e.g., part-of, is-a relations) - mainly declarative/relational

• Classification (e.g., value ranges; patterns) - mainly functional

• Temporal-semantic (e.g., “concatenable” property) - mainly logical • Temporal-dynamic (e.g., interpolation functions) - mainly probabilistic

Page 11: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Dynamic Induction of Contexts

+2W+4W

AZT-toxicity interpretation context

AZT administration

ss

ee

sees

Page 12: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Induction of Interpretation Contexts

+2W+4W

CCTG-522_AZT-toxicity interpretation context

AZT-administration event

(a)

+4W+-6W

-2W

Hepatitis B

HB prodrome Chronic active hepatitis

(b)

CCTG-522 protocol

Page 13: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The Benefits of Interpretation Contexts

• Context intervals serve as a frame of reference for interpretation of clinical data; abstractions are meaningful only in a particular clinical context

• Context intervals focus and limit the computations to only those relevant to a particular context

• Contexts enable the use of context-specific medical knowledge• contexts support maintenance of several concurrent views (or

interpretations) of the data• Facilitate maintenance of clinical knowledge

– The same context-forming entity (e.g., a hepatitis episode) can induce several clinical contexts

– The same context (e.g., “a chemotherapy effect”) might be induced by several entities (e.g., multiple medication types)

Page 14: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Local and Global Persistence Functions

1

0

I1 I2

t

1 2

th

Time

Bel()

Page 15: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temperature

Hemoglobin Level

Linear Component

Week 2 Week 3Week 1

Anemia

Fever Fever

Anemia Anemia

FeverFever

Anemia

Fever

Linear ComponentLinear Component Linear Component

Periodic Pattern

Abstraction of Periodic Patterns

Page 16: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The RÉSUMÉ System Architecture

Temporal-abstraction mechanisms

Temporal fact baseE v e n t s

C o n t e x t s

A b s t r a c t e d i n t e r v a l s

P r i m i t i v e d a t a •

Domain knowledge base

Event ontology

Parameter ontology

Primitive data

Events

••

Context ontology

External patient database

+ +

+

•+

Page 17: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Test Domains for the RÉSUMÉ System

• Medical domains:– Guideline-based care

• AIDS therapy

• Oncology

– Monitoring of children’s growth

– Therapy of insulin-dependent diabetes patients

• Non-medical domains:– Evaluation of traffic-controllers actions

– summarization of meteorological data

Page 18: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Acquisition of Temporal-Abstraction Knowledge

Page 19: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Editing The Ontology Using Protégé Tools

Page 20: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Knowledge-Based Visualization andExploration of Time-Oriented Data:

The KNAVE-I and KNAVE-II Projects

(Shahar and Cheng, 1999, 2000; Shahar et al., 2003)• KNAVE = Knowledge-Based Navigation of Abstractions for Visualization and Explanation

• Interactive queries regarding both raw data and multiple levels of time-oriented abstractions derivable from these data

• Visualization and manipulation of query results• Dynamic exploration of the results using the domain’s temporal-

abstraction ontology• The semantics of all operators do not depend on any specific

domain, but the interface uses each domain’s ontology to compute and display specific terms and explore their relations

• KNAVE accesses the data through the IDAN temporal-abstraction mediator, which uses the ALMA system, a constraint-based re-implementation of RESUME, for temporal-reasoning

Page 21: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The IDAN Temporal-Abstraction Mediator(Boaz and Shahar, 2003)

Temporal-Abstraction

Controller

Knowledge- acquisition tool

Standard Medical Vocabularies Service KNAVE-II

Knowledge Service

Temporal-Abstraction

Service (ALMA)

Data Access Service

Medical Expert

Clinical User

Page 22: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The KNAVE-II Browsing and Exploration Interface[Shahar et al., AIM 2006]

Overall pattern

Raw clinical data

Intermediate abstractionsMedical

knowledge browser

Concept search

Page 23: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Moving Data Panels Around

Page 24: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Global Temporal-Granule Zoom (I)

Page 25: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Global Temporal-Granule Zoom (II)

Page 26: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Global Calendar-Based Zoom

Page 27: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Global Content-Based Zoom (I)

Page 28: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Global Content-Based Zoom (II)

Page 29: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Local Time-Sensitive Zoom

Page 30: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Exploration Operators• Motion across semantic links in the domain’s knowledge base by

using the semantic explorer; in particular, relations such as:

- part-of - is-a - abstracted-from - subcontext • Motion across abstraction types: state, gradient, rate, pattern

• Application of aggregation operators such as mean and distribution

• Dynamic change of temporal-granularity (e.g., days, months)

• Explanation by context-sensitive display of relevant knowledge

• “What- if” queries allow hypothetical assertion or retraction of data and examination of resultant patterns

Page 31: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Semantic Exploration of Temporal Abstractions

Page 32: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Explanation: A Classification Function

Page 33: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Explanation: A Persistence Function

Page 34: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Functionality and Usability Evaluation of KNAVE-II(Palo Alto Veterans Administration Health Care System)

Eight clinicians with varying medical/computer use backgrounds A second study used six additional clinicians and more difficult queries

Each user was given a 15 minute demonstration of the interface and two warm-up queries to answer

The evaluation used an online database of more than 1000 bone-marrow transplantation patients followed for 2 to 4 years

Each user was asked to answer 10 queries common in oncology protocols, about individual patients, at increasing difficulty levels

A cross-over study design compared the KNAVE-II module versus two existing methods (in the 2nd study, control-group users chose which one): Paper charts An electronic spreadsheet (ESS)

Measures: Quantitative: time to answer and accuracy of responses Qualitative: the Standard Usability Score (SUS) and comparative ranking

Page 35: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The KNAVE-II Evaluation Results(Martins et al., MEDINFO 2004; AIM 2008)

• Direct Ranking comparison: KNAVE-II ranked first in preference by all users • Detailed Usability Scores: The Standard Usability Scale (SUS) mean scores:

KNAVE-II 69, ESS 48, Paper 46 (P=0.006) (more than 50 is user friendly)• Time to answer:

– In the first study, users were significantly faster using KNAVE-II as the level of difficulty increased, up to a mean of 93 seconds difference versus paper, and 27 seconds versus Excel, for the hardest query (p = 0.0006)

– The second evaluation, using more difficult queries and more advanced features of KNAVE-II, emphasized the differences even further: The comparison with Excel showed a similar trend for moderately difficult queries (P=0.007) and for hard queries (p=0.002); on the average, study participants answered each of the two hardest queries 277 seconds faster using KNAVE than when using the ESS

• Correctness: – In the first study, using KNAVE-II significantly enhanced correctness versus

using paper, especially as level of difficulty increased (P=0.01) – In the second study, 91.6% (110/120) of all of the questions asked within

queries of all levels produced correct answers using KNAVE-II, versus only 57.5% (69/120) using the ESS (p<0.0001); the correctness scores for KNAVE-II versus the ESS were significantly higher for all queries

Page 36: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The VISITORS System:Interactive Exploration of Multiple Patients

• Supports, among other types, aggregation queries:• Who [Patients] Query

– Find all male patients who were older than 50 and were treated by Dr. Johnson; and, starting within the two weeks following a BMT procedure, had a one week episode of WBC_State whose value was lower than “Normal”

– Show all patients whose HGB_State can be interpreted as “moderate anemia” during more than 5% of February 2006

• When [Time intervals] Query– Select time intervals during which more than 10% of the specific

patients have HGB state value greater than “normal”

Page 37: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

• Demographic Constraints:

– Young (age≤20) or Old (age≥70) Male patients

The VISITORS Query-Builder InterfaceThe VISITORS Query-Builder Interface

Page 38: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

• Knowledge-based constraints– Hemoglobin [HGB] State was abstracted as Normal or higher for at least seven

days after the two weeks period starting from the allogenic bone marrow transplantation,

– WBC counts were abstracted as Gradient = Increasing during the same period

Page 39: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The VISITORS Main Display InterfaceThe VISITORS Main Display Interface

Multiple-subjects raw dataMedical knowledge

browser

Subject groups

Distribution of derived patterns over time

Concept search

Page 40: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The VISITORS System: Zooming into a PanelThe VISITORS System: Zooming into a Panel

Page 41: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Modifying Display Parameters for a Raw-Data Concept

Page 42: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Displaying the Distribution of an Abstract Concept over Time

Page 43: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Interactive Temporal Data Mining:Temporal Association Charts

[Klimov & Shahar, IDAMP 2007]Abstractions for the same subject group are connected; support and confidence indicated by width and hue

The data of each subject are

connected by a line

Page 44: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Association Charts: Support and Confidence

Links

Three data mining parameters displayed for each temporal association link:

support = 55.60% of patient group

confidence = 61.00% probability

actual number of patients = 25

Page 45: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Association Charts:Direct Manipulation Using a Time-Value Lens

Current WBC minimal value 1.4 103 cells/ml

91% of patients have the “moderately_low” value of the HGB_STATE_BMT

Page 46: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

New WBC minimal value 5.43 103

cells/ml

Now only 44.4% of patients have the “moderately_low” value of the HGB_STATE_BMT

Temporal Association Charts:Direct Manipulation Using a Time-Value Lens (Cont)

Page 47: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Association Charts: Using a Relative Time Line

Time line is

relative to the BMT_Al procedure

Page 48: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Association Charts: Using a Relative Time Line (Cont)

During the first month after BMT_Al

During the second month after the BMT_Al procedure

Page 49: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Temporal Association Charts: Using a Relative Time Line (Cont)

1st month1st month2nd month 2nd month

Page 50: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Adding a New Clinical Database to The IDAN Temporal Mediator Architecture

• Due to local variations in terminology and data structure, linking to a new clinical database requires creation of– A term-mapping table– A unit-mapping table– A schema-mapping table

• The mapping tools use a vocabulary-server search engine that organizes and searches within several standard controlled medical vocabularies (ICD-9-CM , LOINC, CPT, SNOMED, NDF)

• Clinical databases are mapped into the standard terms and structure that are used by the clinical knowledge base, thus making the knowledge base(s) highly generic

Page 51: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

The LOINC Server Search Engine

Page 52: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

LOINC Search Results

Page 53: Distributed Knowledge-Based Abstraction, Visualization, and Exploration of Time-Oriented Clinical Data Yuval Shahar, M.D., Ph.D. Medical Informatics Research

Summary:Abstraction,Visualization, & Exploration

of Time-Oriented Clinical Data

• Automated, intelligent interpretation of longitudinal clinical data is useful for monitoring, therapy planning, data summarization and visualization, explanation, and quality assessment

• Interactive query, visualization, and exploration requires runtime access to the medical domain’s ontology

• The visualization and exploration semantics are specific to the intelligent-exploration task, but not to any particular medical domain