scoring systems in the intensive care unit and time series ... · scoring systems in the intensive...

48
Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital Health Winter 2018

Upload: trannguyet

Post on 12-Aug-2019

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Scoring Systems in the Intensive Care Unit and Time Series Preprocessing

Orhan Konak

Data Management for Digital Health

Winter 2018

Page 2: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Where are we?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

2

DataSources

DataFormats

Oncology Nephrology Intensive CareAdditional

Topics

Dat

a M

anag

em

en

t&

Fo

un

dat

ion

sR

eal

-wo

rld

U

se C

ase

s

BiologyRecap

BusinessProcesses

Processingand Analysis

Page 3: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Recap

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

3

ICU

• Patients with life threatening illness

• Supporting failing organ systems

• Highly specialized environment

Equipment

• Organ support equipment

• Intravenous lines, feeding, suction, drains, catheter

• Constant monitoring of bodily functions

Monitoring

• Observing vital signs

• Key to improve patients survival

• Generates data like waveforms

http://telemedicinamorsch.blogspot.com/2018/11/como-colocar-eletrodos-no-paciente-para.html

Page 4: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Patient Data Management System (PDMS)

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

4

■ Document vital parameters sampled by monitors

■ Demands on PDMS have increased immensely

■ PDMS are currently expected to assist clinicians at every level of

intensive care, e.g.

□ Strategic level of physician orders and prescriptions

□ Operational level

□ Administrative level

https://www.getinge.com/de/Produktkatalog/metavision-perfusion/

Page 5: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Does Introduction of a PDMS Improve the ICU?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

5

■ PDMS implementation costs range from 15k to 20 kEUR per

bed

■ Costs and revenues increased continuously over the years

■ No clear evidence for cost savings after the PDMS introduction

■ PDMS has resulted in better patient outcomes

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3847636/

https://www.imd-soft.com/

Page 6: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

PDMSIT Infrastructure

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

6

htt

ps:/

/spectr

um

.ieee.o

rg/t

he-

hum

an-

os/b

iom

edic

al/

dia

gnostics/i

n-

hospital-

inte

nsiv

e-c

are

-units-a

i-could

-pre

dic

t-w

hic

h-p

atients

-are

-likely

-to-d

ie

htt

ps:/

/t3.f

tcdn.n

et/

jpg/0

0/6

5/6

3/8

2/2

40_F_65638294_K7fT

E7w

Pn

pO

kYjs

SBqXA3zVAXTQ

rbcdm

.jpg

Patient Data Management System

HIS

htt

ps:/

/ww

w.g

etinge.c

om

/de

/Pro

duktk

ata

log/m

eta

vis

ion-

perf

usio

n/

htt

ps:/

/ww

w.c

linic

alp

ain

advis

or.

com

/tra

um

a-p

ain

/spect-

vs-m

ri-

to-d

iagnose-

tbi/

art

icle

/579126/

htt

ps:/

/ww

w.s

hirtlabo

r.de/m

otiv-

gale

rie/k

arr

iere

-beru

f/bio

loge.h

tml

Page 7: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Scoring Systems in the ICU

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

7

■ Scoring system as clinical decision support

■ Severity scales important to predict

□ Patient outcome,

□ Comparing quality-of-care, and

□ Stratification for clinical trials.

■ Essential part of improvement in clinical decisions and in

identifying patients with unexpected outcomes

■ Using logistic regression models

https://www.digitalhealth.net/2017/02/papworth-hospital-goes-paperless-icu-metavision/

Page 8: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Scoring Systems in the ICU

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

8

■ Scoring system usually comprises of two parts

□ a score (a number assigned to disease severity) and

□ a probability model (equation giving the probability of

hospital death of the patients)

Number

Probability model

Score

P 𝑨

A =10

Page 9: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Types of Scoring SystemsCommonly Used Adult ICU Scoring Systems

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

9

■ First-day scoring systems

□ Acute Physiology and Chronic Health Evaluation (APACHE)

□ Simplified Acute Physiology Score (SAPS)

□ Mortality Prediction Model (MPM)

■ Repetitive scoring systems

□ Organ System Failure (OSF)

□ Sequential Organ Failure Assessment (SOFA)

□ Organ Dysfunction and Infection System (ODIN)

□ Multiple Organ Dysfunction Score (MODS)

□ Logistic Organ Dysfunction (LOD)

http://scoringexpert.pl/2017/01/01/model-scoringowy-troche-teorii/

Page 10: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Severity Scores in Medical & Surgical ICUTimeline

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

10

•APACHE

•SAPS

•APACHE II

1980-85

•SAPS II

•MPM

1986-90•APACHE III

•MODS

•MPM II

•ODIN

1990-95

•SOFA

•CIS

1996-2000•SAPS III

•APACHE IV

2000-current

Page 11: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Glasgow Coma Score (GCS)Let’s Take a Closer Look at One Score

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

11

■ Neurological scale

■ Give a reliable and objective way of recording the conscious

■ Initially used to assess a person's level of consciousness after a head

injury

■ Now used by first responders, EMS, nurses, and doctors

■ Part of several ICU scoring systems, including APACHE II, SAPS II, and

SOFA https://nurse.org/articles/glasgow-coma-scale/

Page 12: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

GCSCalculation

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

12

Eye Opening Response

•4 Spontaneously

•3 To speech

•2 To pain

•1 No response

Verbal Response

•5 Oriented to time, person and place

•4 Confused

•3 Inappropriate words

•2 Incomprehensible sounds

•1 No response

Motor Response

•6 Obeys command

•5 Moves to localized pain

•4 Flex to withdraw from pain

•3 Abnormal flexion

•2 Abnormal extension

•1 No response

Behavior Response

Total Score

Mild 13 – 15

Moderate 9 – 12

Severe 3 – 8

htt

ps:/

/gunnar.

com

/htt

ps:/

/melibeeglo

bal.com

/blo

g/2

015/0

1/t

he-l

ack-o

f-non-

verb

al-

com

munic

ation-i

n-a

-dig

ital-

world/

htt

p:/

/ww

w.isfs

port

s.o

rg/

gym

nastics

Page 13: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

GCSCalculation

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

13

https://codehealth.io/library/article-1/glasgow-coma-scale/

E

V

M

Page 14: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

GCSExample 1

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

14

■ Infant, moves spontaneously towards objects and follows them, smiling and orienting

towards interesting sounds. The infant opens the eyes spontaneously.

E V M

4 5 6 15 https://www.thompsons-scotland.co.uk/serious-head-and-brain-injury/brain-injury-solicitors-scotland/brain-injury-claims-and-the-glasgow-coma-scale

Page 15: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

GCSExample 2

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

15

■ Adult, moves the hand away when applying pressure on the nail bed. The patient can make

words but not form sentences. The patient opens the eyes to pain, but not to speech.

E V M

2 3 4 9 https://www.thompsons-scotland.co.uk/serious-head-and-brain-injury/brain-injury-solicitors-scotland/brain-injury-claims-and-the-glasgow-coma-scale

Page 16: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

GCSExample 3

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

16

■ Adult, moves hand towards head when applying pressure above the eye socket. The patient

is disoriented but able to form sentences. The patient opens the eyes in response to speech.

E V M

3 4 5 12 https://www.thompsons-scotland.co.uk/serious-head-and-brain-injury/brain-injury-solicitors-scotland/brain-injury-claims-and-the-glasgow-coma-scale

Page 17: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Acute Physiology And Chronic Health Evaluation II (APACHE II)Calculation - Patient’s Age and 12 Routine Physiological Measurements

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

17http://www.scymed.com/en/smnxpw/pwfbd770.htm

AaDO2 or PaO2

(depending on FiO2)

Temperature (rectal)

Mean arterial pressure

pH arterial Heart rate

Respiratory rate

Sodium (serum)

Potassium (serum)

Creatinine

Hematocrit

White blood cell count

Glasgow Coma Scale

Page 18: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Comparison of ICU Scoring

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

18

ICU Scoring System Timing of data collected

Physiological values

Other required data Total data elements required

Original reported mortality prediction performance

SAPS III Prior to and within 1 hour of ICU admission

10 Age, six chronic health variables, ICU admission diagnosis, ICU admission source, LOS prior to ICU admission, emergency surgery, infection on admission, four variables for surgery type

26 AUC = 84.8% (n=16,784)

APACHE IV First ICU day (16-32 h depending on time of admission)

17 Age, six chronic health variables, ICU admission diagnosis, ICU admission source, LOS prior to ICU admission, emergency surgery, thrombolytic therapy, Fio2, mechanical ventilation

32 AUC = 88.0% (n=52,647)

MPM III Prior to and within 1 hour of ICU admission

3 Age, three chronic health variables, five acute diagnosis variables, admission type (e.g., medical-surgical) and emergency surgery, CPR within 1 h of ICU admission, mechanical ventilation, code status

16 AUC = 82.3% (n=50,307)

https://www.researchgate.net/figure/A-comparison-of-intensive-care-unit-ICU-scoring-systems-from-47-with-permission_tbl1_273059579

Page 19: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Which Score to Use?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

19

■ APACHE, SAPS, MPM → only of historic significance

■ APACHE II →most widely used in USA

■ SAPS II → commonly used in Europe

■ APACHE III → not in public domain

■ SAPS III, APACHE IV → better design

■ MODS, LODS → uncommonly used ?

Page 20: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

The Ideal Scoring System

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

20

■ On the basis of easily/routinely recordable variables

■ Well calibrated

■ Applicable to all patient populations

■ Can be used in different countries

■ The ability to predict functional status or quality of life after ICU discharge

No scoring system currently incorporates all these features

Page 21: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Taking Off the Physician’s Glasses

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

21

Everything’s kind of blurry!

But I'm seeing things very clearly!

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

https://www.brillenecke.eu/sitemap/

Page 22: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Selected Data SourceElectrocardiography (ECG)

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

22

Hospital Intensive Care Unit Monitoring

ECG Signal

?

htt

ps:/

/blo

g.u

ltim

o.c

om

/de/a

ktu

elles/o

ptim

ieru

ng-u

ltim

o-g

o/

htt

p:/

/str

ata

xrt

.com

/diffe

rent-

radia

tion-

thera

py-t

ypes/

htt

ps:/

/ww

w.w

ebm

d.c

om

/cancer/

vital-

sig

ns-

monitor#

1

https://en.wikipedia.org/wiki/Electrocardiography

Page 23: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Function of ECG

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

23

https://en.wikipedia.org/wiki/Electrocardiography

■ Process of recording the electrical activity of the heart

■ Electrodes placed over the skin

■ Electrodes detect the tiny electrical changes on the skin

■ Commonly performed to detect any cardiac problems

https://www.philips.de/healthcare/product/

Page 24: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Simplified ECG Signal Chain

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

24

Amplifier FilterAnalog-to-digital

converter

https://de.banggood.com/AD8232-Measurement-Pulse-Heart-Monitoring-Hearbeat-Sensor-Module-for-Arduino-Monitor-Devices-p-1402651.html?akmClientCountry=DE&gmcCountry=DE&currency=EUR&createTmp=1&utm_source=googleshopping&utm_medium=cpc_bgcs&utm_content=zouzou&utm_campaign=pla-deg-label3-0-30-pc&ad_id=324664290179&cur_warehouse=CN

https://www.boehm-elektromedizin-gmbh.de/shop/neue-produkte/philips-intellivue-mx550/

Page 25: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Amplifier

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

25https://www.adinstruments.com/tips/data-quality

■ Electronic device that increases the power of a signal. It does this by taking energy

from a power supply and controlling the output to match the input signal shape but

with a larger amplitude

https://www.conrad.de/de/conrad-components-stereo-verstaerker-bausatz-9-vdc-12-vdc-18-vdc-20-w-2-115592.html

Page 26: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Electronic Filter

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

26https://www.allaboutcircuits.com/technical-articles/an-introduction-to-filters/

■ Circuits which perform signal processing functions

■ Remove unwanted frequency components from the signal

■ Include the low-pass filter, the high-pass filter, the band-pass filter, and

the notch filter (or the band-reject or band-stop filter)

https://de.banggood.com/MAX262-Programmable-Filter-Bandpass-Band-Resistant-All-Pass-Low-Pass-High-Pass-p-1382154.html?akmClientCountry=DE&gmcCountry=DE&currency=EUR&createTmp=1&utm_source=googleshopping&utm_medium=cpc_bgcs&utm_content=zouzou&utm_campaign=pla-deg-ele-pc&cur_warehouse=CN

Page 27: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Analog-to-Digital Converter (ADC)

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

27

http://qqtrading.com.my/electrocardiogram-sensor-ecg-heart-rate-monitor-ad8232

■ Converts an analog voltage to a digital number

■ Converts the output data into a series of digital values by

approximating the signal with fixed precision

■ Detecting binary signals: is the button pressed or not? These

are digital signals

■ Converts voltage as a binary – 0 or 1

Page 28: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Sampling RateNyquist–Shannon Sampling Theorem

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

28

https://www.adinstruments.com/tips/data-quality

■ The minimum rate at which digital

sampling can accurately record an analog

signal is known as the Nyquist Frequency,

which is double the highest expected

signal frequency

■ Nyquist frequency = 2 x highest expected

frequency

■ E.g. you are recording ECG in humans that

has components which can reach up to 50

Hz as their highest expected frequency, so

the minimum sampling rate should be 100

Hz

Page 29: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Data Acquisition

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

29http://www.buykorea.org/product-details/SimDAQ-KIT--Biosignal-DAQ-8-channel-24bit-ADC-Isolation-DAQ-USB-DAQ--3107900.html

■ Biological signals recorded via a data acquisition unit (DAQ)

■ Converted to a digital signal by DAQ unit

■ Resulting digital signal is sampled at regular intervals by analysis

software

■ Data stored and displayed on computer

Page 30: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

What is a Time Series?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

30

■ A time series is a collection of observations made sequentially in time

15:51:00 1 25.175015:51:01 1 25.225015:51:02 2 25.250015:51:03 3 25.250015:51:04 4 25.275015:51:05 5 25.325015:51:06 6 25.350015:51:07 7 25.350015:51:08 8 25.350015:51:09 9 25.3500

15:52:40 100 24.750015:52:41 101 24.755015:52:42 102 24.760015:52:43 103 24.770015:52:44 104 24.760015:52:45 105 24.5500

http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Time

Time series ID

Value

Time Series?

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 31: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Time Series are Ubiquitous!

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

31

■ Everything which is on the monitoring

screen (ECG, BP, RR, …)

■ Angela Merkel’s popularity rating

■ The weather in Berlin

■ German Stock Index DAX

People measure things …

… and things change over timehtt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

Page 32: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Time Series

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

32

■ 1 hour of ECG data: 1 GB

■ Typical weblog: 5 GB per week

■ Space Shuttle database: 200 GB and growing

■ Since most of the data lives on disk (or tape), we need

a representation of the data we can efficiently

manipulate

Why is working with time series so difficult?

How do we work with very large databases?

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 33: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

■ Time-series data applications are proliferating

■ As a result time-series databases are in fashion

■ Most adopt a NoSQL model

■ Developers preferred NoSQL to relational databases

for time-series data by over 2:1

■ Reason for adopting NoSQL time-series databases

comes down to scale

Time Series Database

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

33

https://www.percona.com/blog/2017/02/10/percona-blog-poll-database-engine-using-store-time-series-data/

Page 34: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Time Series

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

34

■ Differing data formats

■ Differing sampling rates

■ Noise, missing values, etc.

Why is working with time series so difficult?

Miscellaneous data handling problems

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 35: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Time SeriesLet’s Compare ECG Signals

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

35htt

ps:/

/ww

w.c

vphysio

logy.c

om

/Arr

hyth

mia

s/A

009.h

tm

I’m comparing the curves and

try to find similarities, respectively

abnormalities.

What are you doing there?

Let me show you how to do it.

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 36: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Euclidean Distance MetricComparing to Time Series

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

36

■ Let’s assume we want to compare two time series

About 80% of published work in data mining uses

Euclidean distance

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Page 37: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Preprocessing the Data Before Distance Calculations

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

37

■ 4 most common distortions

□ Offset Translation

□ Amplitude Scaling

□ Linear Trends

□ Noise

Euclidean distance is very sensitive to some “distortions”

in the data. For most problems

these distortions are not meaningful → should remove

them

If we naively try to measure the distance between two

“raw” time series, we may get very unintuitive results

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 38: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Preprocessing the DataOffset Translation

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

38

http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Page 39: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Preprocessing the DataAmplitude Scaling

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

39

■ Zero-mean

■ Unit-variance

■ Widely used for normalization in many machine learning algorithms

http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Page 40: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Preprocessing the DataOffset Translation

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

40

■ Removing linear trend:

□ Fit the best fitting straight line to the

time series, then

□ subtract that line from the time

■ Remove linear trend

■ Removed offset translation

■ Removed amplitude scaling

http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Page 41: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Preprocessing the DataNoise

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

41

The intuition behind removing noise is

Average each data points value with

its neighborshttp://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdf

Page 42: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Software Filter

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

42

Low Pass Filter High Pass Filter

https://www.adinstruments.com/tips/data-quality

Page 43: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

■ Fourier showed that any periodic signal s(t) can be written as a sum of sine waves with

various amplitudes, frequencies and phases

■ For example, the Fourier expansion of a square wave can be written as

Discrete Fourier Transform

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

43http://mriquestions.com/fourier-transform-ft.html

Page 44: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Discrete Fourier Transform

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

44

https://de.wikipedia.org/wiki/Joseph_Fourier

=

−=

1

0

2N

n

ni

nkNk

exX

http

://mriq

uestio

ns.c

om

/fourie

r-transfo

rm-ft.h

tml

Fourier series in 1822

Page 45: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Discrete Fourier Transform

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

45

■ Important signal processing tool

■ Used to decompose a signal into its sine and cosine

components

■ Output of the transformation represents the signal in the

Fourier or frequency domain

■ Apply mathematical operations to eliminate certain

frequency domains very easily

■ Applying the inverse Fourier transform to recover the

original time signal

https://slideplayer.com/slide/4173668/

Page 46: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

Summary of Preprocessing

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

46

The “raw” time series may have distortions which we

should remove before clustering, classification

etc.

Of course, sometimes the distortions are the most interesting thing about the data, the above is only a general

rule

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/Pro

fessor_

Frink

htt

ps:/

/en.w

ikip

edia

.org

/wik

i/D

r._N

ick

Page 47: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

What Do We Want to Do with the Time Series Data?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

47http://didawiki.cli.di.unipi.it/lib/exe/fetch.php/dm/time_series_2017.pdfhttp://amid.fish/anomaly-detection-with-k-means-clustering

Page 48: Scoring Systems in the Intensive Care Unit and Time Series ... · Scoring Systems in the Intensive Care Unit and Time Series Preprocessing Orhan Konak Data Management for Digital

What to take home?

Data Management for Digital Health, Winter 2018

Use Case Intensive Care

48

■ Patient Data Management Systems are currently expected to assist clinicians at every level

of intensive care

■ Currently, decision support via ICU scoring systems

■ Data is generated by sensors

■ Preprocessing of time series

■ Chance to improve decision support with the help of machine learning