extrac’ngfeaturesfrom** human2readablesemistructuredtextelaine/pubs/angelino-structure... ·...

46

Extrac’ng features from humanreadable semistructured text Elaine Angelino 4/16/2012

Upload: phungkiet

Post on 27-Oct-2018

214 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Extrac'ng features from human-‐readable semistructured text

Elaine Angelino 4/16/2012

Page 2: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Outline

•  Mo'va'on: electronic medical records (EMRs) •  What is semistructured data?

•  Extrac'ng structured features – Methods – Evalua'on – Applica'ons – Future work

Page 3: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Outline

•  Mo'va'on: electronic medical records (EMRs) •  What is semistructured data?

•  Extrac'ng structured features – Methods – Evalua'on – Applica'ons

•  Future work

Page 4: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

What can we do with all the data in (electronic) medical records?

Page 5: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

EMR data mining opportuni'es

•  Real-‐'me error detec'on & quality control •  Measure treatment effec'veness “in the wild”

•  Epidemiological trends & outbreaks

•  New hypotheses & correla'ons •  Combine with other data sources – Medical literature, (personal) genomics, Web

*

* With collaborators J. Rassen, S. Schneeweiss P. Wahl (Brigham) and M. Rosen (Regenstrief)

Page 6: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Retrospec've drug effec'veness study

Effec'veness of drug X vs.

drug Y

Treatment data (drug X vs. drug Y) Outcome data (heart a]ack vs. ok)

Protec've < 1.0 Harmful > 1.0

Page 7: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Retrospec've drug effec'veness study

Effec'veness of drug X vs.

drug Y

Treatment data (drug X vs. drug Y) Outcome data (heart a]ack vs. ok)

Protec've < 1.0 Harmful > 1.0

2.19 Crude result

True < 1.0

Page 8: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Clinical trial vs. retrospec've study

•  Randomized clinical trial – Par'cipants are randomly assigned to different courses of treatment

•  Retrospec've study – No control over who gets what, so we must adjust for confounding differences

Page 9: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Example confounder: Age

•  Older pa'ents tend to •  Receive drug X more oden than drug Y •  Tend to have more heart a]acks

•  This makes drug X appear less effec've than drug Y

Page 10: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Effec'veness of drug X vs.

drug Y Confounder adjustment

Pa'ent medical & demographic data

Retrospec've drug effec'veness study

2.19 Treatment data (drug X vs. drug Y) Outcome data (heart a]ack vs. ok)

Protec've < 1.0 Harmful > 1.0

What does this data look like?

Crude result

True < 1.0

?

Page 11: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Example EMR entry: ECG report EMPI: XXXX MRN TYPE: BWH MRN: XXXX REPORT NUMBER: 0000000W REPORT DATE TIME: 01/01/01 14:02 REPORT DESCRIPTION: CARDIOLOGY REPORT STATUS: FINAL REPORT TYPE: EKG REPORT TEXT: Report Number: 0000000W Report Status: FINAL Type: EKG Date: 01/01/01 14:02 Electrocardiogram Report # 00-00000W 01/01/01 14:02 VENT. RATE 99 BPM PR INTERVAL 188 ms QRS DURATION 100 ms QT/QTC 368 472 ms P-R-T AXES 66 -10 95 NORMAL SINUS RHYTHM LEFT VENTRICULAR HYPERTROPHY WITH REPOLARIZATION ABNORMALITY REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB [report_end]

Page 12: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Example EMR entry: ECG report EMPI: XXXX MRN TYPE: BWH MRN: XXXX REPORT NUMBER: 0000000W REPORT DATE TIME: 01/01/01 14:02 REPORT DESCRIPTION: CARDIOLOGY REPORT STATUS: FINAL REPORT TYPE: EKG REPORT TEXT: Report Number: 0000000W Report Status: FINAL Type: EKG Date: 01/01/01 14:02 Electrocardiogram Report # 00-00000W 01/01/01 14:02 VENT. RATE 99 BPM PR INTERVAL 188 ms QRS DURATION 100 ms QT/QTC 368 472 ms P-R-T AXES 66 -10 95 NORMAL SINUS RHYTHM LEFT VENTRICULAR HYPERTROPHY WITH REPOLARIZATION ABNORMALITY REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB [report_end]

Structured database fields

Semistructured text blob

Page 13: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Example EMR entry: ECG report EMPI: XXXX MRN TYPE: BWH MRN: XXXX REPORT NUMBER: 0000000W REPORT DATE TIME: 01/01/01 14:02 REPORT DESCRIPTION: CARDIOLOGY REPORT STATUS: FINAL REPORT TYPE: EKG REPORT TEXT: Report Number: 0000000W Report Status: FINAL Type: EKG Date: 01/01/01 14:02 Electrocardiogram Report # 00-00000W 01/01/01 14:02 VENT. RATE 99 BPM PR INTERVAL 188 ms QRS DURATION 100 ms QT/QTC 368 472 ms P-R-T AXES 66 -10 95 NORMAL SINUS RHYTHM LEFT VENTRICULAR HYPERTROPHY WITH REPOLARIZATION ABNORMALITY REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB [report_end]

Key-‐value pairs

Quan?ta?ve data

Unstructured text

Page 14: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Effec'veness of drug X vs.

drug Y Confounder adjustment

Pa'ent medical & demographic data

Retrospec've drug effec'veness study

Age/sex

2.19

2.05

Treatment data (drug X vs. drug Y) Outcome data (heart a]ack vs. ok)

Structured data

Protec've < 1.0 Harmful > 1.0

Crude result

True < 1.0

Page 15: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured & unstructured text

Retrospec've drug effec'veness study

Age/sex 2.05

? BP, LDL, HDL

Features A

Features B

Features C

?

?

?

Treatment data (drug X vs. drug Y) Outcome data (heart a]ack vs. ok)

Structured data

2.19 Crude result

True < 1.0

Page 16: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Outline

•  Mo'va'on: electronic medical records (EMRs) •  What is semistructured data?

•  Extrac'ng structured features – Methods – Evalua'on – Applica'ons – Future work

Page 17: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured data

•  Peter Buneman, PODS ’97

•  “In semistructured data, the informa'on that is normally associated with a schema is contained within the data, which is some'mes called self-‐describing.”

•  Our view of semistructured data is broader!

Page 18: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured text: Exhibit A

Page 19: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured text: Exhibit A

-‐ Explicit structure -‐  Seman'cally meaningful tags -‐  Not for humans

Page 20: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured text: Exhibit B

Page 21: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured text: Exhibit B

-‐ Li]le structure -‐  HTML for visual formakng -‐  Automa'c extrac'on difficult

Page 22: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Electronic medical records (EMRs) are human-‐readable semistructured text

NORMAL SINUS RHYTHM

NONSPECIFIC T WAVE ABNORMALITY

PROLONGED QT INTERVAL OR T-U FUSION, CONSIDER MYOCARDIAL DISEASE, ELECTROLYTE IMBALANCE, OR DRUG EFFECTS

ABNORMAL ECG

WHEN COMPARED WITH ECG OF 01-JAN-2001 14:02,

T WAVE INVERSION NOW EVIDENT IN LATERAL LEADS

REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB

[report_end]

The two reports are clearly similar, sharing many of the same data items and formatting structure for key-value pairs, numerical data measurements and the statement, NORMAL SINUS RHYTHM. We can also noticeseveral structural differences:

(1) The first report is missing the value for QT/QTC, which can indicate that the EKG machine was unableto extract this measurement from the raw ECG signal.

(2) In the first report, the P-R-T AXES measurement includes three numerical values but the second reporton has two. The latter is incorrect and an example of human data entry error.

(3) The second report is longer than the first, containing more statements interpreting the EKG results,including statements comparing the two reports.

Based on our own experiences, we think that EMR system designers want to explicitly represent thefine-grained structure of EMR entries but that this is difficult or impractical to achieve with existing, in-usesystems. As evidence, we show an example of a de-identified EMR entry from the [Regenstrief/Indiana]EMR system; it is also an ECG report. The first text block below is a visual representation of the report.In contrast to the Partners EMR system, this text report is represented by XML rather than plain text. Weshow this XML directly below the visual representation.Stationary ECG Study: XXXX University XXXX XXXX XXXX

Test Date: XXXX

XXXX Name: XXXX

Patient XXXX: XXXX Room:

Gender: Male Technician: JG

DOB: XXXX

XXXX Number: XXXX XXXX MD: XXXX XXXX

Intervals Axis

XXXX: 46 P: 32

PR: 132 QRS: -19

QRSD: 112 T: -33

QT: 472

QTc: 448

Interpretive Statements

Sinus bradycardia

T XXXX inversions in leads #, aVF are not XXXX, but are more prominent

compared to prior studies

Subtle ST-T changes elsewhere are nonspecific

Low QRS voltages in precordial leads

Abnormal ECG

Electronically Signed On XXXX XXXX by XXXX XXXX

<text_report>

<text><title></title><p>Stationary ECG Study<br/>XXXX University XXXX<br/>XXXX XXXX</p></text>

<text><title>Test Date</title><p>XXXX<br/></p></text>

<text><title>XXXX Name</title><p>XXXX<br/></p></text>

<text><title>Patient XXXX</title><p>XXXX Room:<br/></p></text>

<text><title>Gender</title><p>Male Technician: JG<br/></p></text>

<text><title>DOB</title><p>XXXX<br/></p></text>

<text><title>XXXX Number</title><p>XXXX XXXX MD: XXXX XXXX</p>

<p>Intervals Axis<br/></p></text>

<text><title>XXXX</title><p>46 P: 32<br/></p></text>

<text><title>PR</title><p>132 QRS: -19<br/></p></text>

<text><title>QRSD</title><p>112 T: -33<br/></p></text>

<text><title>QT</title><p>472<br/></p></text>

<text><title>QTc</title><p>448</p>

<p>Interpretive Statements<br/>

4

Page 23: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured data

Self-‐describing

Focus on wrappers for HTML / XML

Methods depend on <tag> tree

Extract RDBMS records that can be queried

Extract features for informa'on retrieval tasks

Page 24: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured data

Self-‐describing

Focus on wrappers for HTML / XML

Methods depend on <tag> tree

Extract RDBMS records that can be queried

Extract features for informa'on retrieval tasks

Human-‐readable semistructured text

Structure is encoded via visual formakng, e.g., punctua'on & white spaces (no tags!)

Heuris'cs/sta's'cs vs. supervised techniques

Extrac'ng tables from plain text

Visual-‐based methods for Web pages & PDFs

Map mailing addresses to DB records

Page 25: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured data

Self-‐describing

Focus on wrappers for HTML / XML

Methods depend on <tag> tree

Extract RDBMS records that can be queried

Extract features for informa'on retrieval tasks

Unstructured text

NLP techniques -‐ Bags of words -‐ N-‐grams -‐  Domain-‐specific stack

IR techniques -‐  Segmenta'on -‐  Classifica'on -‐  Normaliza'on

Classify text w.r.t. topics or authors, content summariza'on

Recognize en''es, events, rela'onships

Human-‐readable semistructured text

Structure is encoded via visual formakng, e.g., punctua'on & white spaces (no tags!)

Heuris'cs/sta's'cs vs. supervised techniques

Extrac'ng tables from plain text

Visual-‐based methods for Web pages & PDFs

Map mailing addresses to DB records

Page 26: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured data

Self-‐describing

Focus on wrappers for HTML / XML

Methods depend on <tag> tree

Extract RDBMS records that can be queried

Extract features for informa'on retrieval tasks

Unstructured text

NLP techniques -‐ Bags of words -‐ N-‐grams -‐  Domain-‐specific stack

IR techniques -‐  Segmenta'on -‐  Classifica'on -‐  Normaliza'on

Classify text w.r.t. topics or authors, content summariza'on

Recognize en''es, events, rela'onships

Human-‐readable semistructured text

Structure is encoded via visual formakng, e.g., punctua'on & white spaces (no tags!)

Heuris'cs/sta's'cs vs. supervised techniques

Extrac'ng tables from plain text

Visual-‐based methods for Web pages & PDFs

Map mailing addresses to DB records

Page 27: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Outline

•  Mo'va'on: electronic medical records (EMRs) •  What is semistructured data?

•  Extrac'ng structured features – Methods – Evalua'on – Applica'ons – Future work

Page 28: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Example EMR text report Report Number: 0000000W Report Status: FINAL

Type: EKG

Date: 01/01/01 14:02

Electrocardiogram Report # 00-00000W 01/01/01 14:02

VENT. RATE 99 BPM

PR INTERVAL 188 ms

QRS DURATION 100 ms

P-R-T AXES 66 -10 95

NORMAL SINUS RHYTHM

LEFT VENTRICULAR HYPERTROPHY WITH REPOLARIZATION ABNORMALITY

REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB

[report_end]

Report Number: 0000000W Report Status: FINAL

Type: EKG

Date: 02/02/02 13:25

Electrocardiogram Report for Accession # 00-00000W 02/02/02 13:25

VENT. RATE 76 BPM

PR INTERVAL 154 ms

QRS DURATION 88 ms

QT/QTC 422 474 ms

P-R-T AXES 55 94

NORMAL SINUS RHYTHM

NONSPECIFIC T WAVE ABNORMALITY

PROLONGED QT INTERVAL OR T-U FUSION, CONSIDER MYOCARDIAL DISEASE,

ELECTROLYTE IMBALANCE, OR DRUG EFFECTS

ABNORMAL ECG

WHEN COMPARED WITH ECG OF 01-JAN-2001 14:02,

T WAVE INVERSION NOW EVIDENT IN LATERAL LEADS

REFERRED BY: AARDVARK, M.D.,ALICE. REVIEWED BY: BEAGLE, M.D.,BOB

[report_end]

Fig. 1. Two ECG reports from the Partners HealthCare LMR system. They contain several kinds of structured information and are similar but have some structural differences.

Stationary ECG Study: XXXX University XXXX XXXX XXXX

Test Date: XXXX

XXXX Name: XXXX

Patient XXXX: XXXX Room:

Gender: Male Technician: JG

DOB: XXXX

XXXX Number: XXXX XXXX MD: XXXX XXXX

Intervals Axis

XXXX: 46 P: 32

PR: 132 QRS: -19

QRSD: 112 T: -33

QT: 472

QTc: 448

Interpretive Statements

Sinus bradycardia

T XXXX inversions in leads #, aVF are not XXXX, but

are more prominent compared to prior studies

Subtle ST-T changes elsewhere are nonspecific

Low QRS voltages in precordial leads

Abnormal ECG

Electronically Signed On XXXX XXXX by XXXX XXXX

<text_report>

<text><title></title><p>Stationary ECG Study<br/>XXXX University XXXX<br/>XXXX XXXX</p></text>

<text><title>Test Date</title><p>XXXX<br/></p></text>

<text><title>XXXX Name</title><p>XXXX<br/></p></text>

<text><title>Patient XXXX</title><p>XXXX Room:<br/></p></text>

<text><title>Gender</title><p>Male Technician: JG<br/></p></text>

<text><title>DOB</title><p>XXXX<br/></p></text>

<text><title>XXXX Number</title><p>XXXX XXXX MD: XXXX XXXX</p>

<p>Intervals Axis<br/></p></text>

<text><title>XXXX</title><p>46 P: 32<br/></p></text>

<text><title>PR</title><p>132 QRS: -19<br/></p></text>

<text><title>QRSD</title><p>112 T: -33<br/></p></text>

<text><title>QT</title><p>472<br/></p></text>

<text><title>QTc</title><p>448</p>

<p>Interpretive Statements<br/>

Sinus bradycardia<br/>T XXXX inversions in leads #, aVF are not XXXX, but are more prominent<br/>

compared to prior studies<br/>Subtle ST-T changes elsewhere are nonspecific<br/>

Low QRS voltages in precordial leads<br/>

Abnormal ECG</p>

<p>Electronically Signed On XXXX XXXX by XXXX XXXX</p></text>

</text></text_report>

Fig. 2. Visual rendering and XML representation of an ECG report from the RMRS/INPC EMR system. The XML does not add meaningful structure beyond visual formatting.

4

Regenstrief Medical Record System / Indiana Network for Pa'ent Care (RMRS/INPC)

Visual rendering XML representa'on (!)

Page 29: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Feature extrac'on methods

•  Lightweight, domain-‐agnos'c •  Domain-‐ and language-‐specific

Page 30: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Lightweight feature extrac'on

•  Fast methods requiring li]le or no domain knowledge

•  Example features – Bags of words – Bags of n-‐grams

– Typed tokens – Discre'zed quan'ta've data

Page 31: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Medical text feature extrac'on

•  Natural language processing (NLP) tasks

•  MetaMap sodware by the Na'onal Library of Medicine (NLM)

•  Designed for processing medical text

•  Maps input text to preferred terms

Page 32: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Phrases and cons'tuents

(UMLS) and (8) word-sense disambiguation. The UMLS includes a dictionary of concepts, a thesaurusmapping these concepts onto preferred names and a semantic tree that places these concepts within broadersemantic categories. MetaMap’s different processing steps produce rich output that can be used to extractmany kinds of features. We highlight some of these through examples.

Table 4.4 shows features that come from MetaMap’s lexical and syntactic analysis steps. For the inputphrase ‘‘sinus rhythm has replaced electronic atrial pacemaker’’ (row A), we show how MetaMapbreaks this into phrases (row B) and further into constituents (row C). Each constituent maps to a lexicalcategory (e.g., noun or adjective, row D) and a syntax type (e.g., head or modifier, row E.).

Feature Example

A. Input text sinus rhythm has replaced electronic atrial pacemakerB. Phrases sinus rhythm, has, replaced electronic atrial pacemakerC. Constituents sinus rhythm, has, replaced, electronic, atrial, pacemakerD. Lexical categories (sinus rhythm, noun), (has, aux), (replaced, adj),(matched to constituents) (electronic, adj), (atrial, adj), (pacemaker, noun)E. Syntax types (sinus rhythm, head), (has, aux), (replaced, mod),(matched to constituents) (electronic, mod), (atrial, mod), (pacemaker, head)

Table IV. Examples of features extracted via MetaMap’s lexical and syntactic analysis pipeline.

Table 4.4 gives a few more examples of features we can extract via MetaMap’s lexical and syntacticanalysis. By combining phrase and constituent features, we can for example extract noun phrases, i.e.,phrases containing a head noun plus optional modifiers. We italicize constituents corresponding to headnouns within phrases. Breaking down the inputs into feature vectors of phrases or constituents suggestdifferent ways to compare or align related inputs (rows A-B and C-D).

Input text Phrases identified by MetaMap

A. electronic atrial pacemaker electronic atrial pacemakerB. electronic ventricular pacemaker electronic ventricular pacemakerC. ST \T\ T wave abnormality, consider anterolateral st t t wave abnormality, consider, anterolateral ischemia,

ischemia or digitalis effect or, digitalis effect

Table V. Phrases identified by MetaMap, where we have italicized constituents that are head nouns

Table 4.4 shows features corresponding to MetaMap’s final output for the input phrase ‘‘sinus rhythm’’.The Meta Candidates (rows A) show possible mappings between substrings, e.g., constituents or tokens, andconcepts in MetaMap’s dictionary. Each concept maps to a preferred name via MetaMap’s thesaurus and isa member of a broader semantic category. Each Meta Candidate is given a score, with 1000 beingthe maximum possible score. The Meta Mappings (rows B) represent the final mappings of the entirephrase. Each Meta mapping is given a score, again with 1000 being the maximum.

Output name Concept Preferred name Semantic category Score

A. Meta Candidates Sinus rhythm Sinus rhythm Finding 1000sinus rhythm electrocardiogram rhythm strip 3-lead: sinus rhythm Finding 1000Sinus pathologic fistula Anatomical Abnormality 861SINUS Nasal sinus Body Space or Junction 861Rhythm Rhythm Finding 861Sinus Sinus - general anatomical term Body Space or Junction 861rhythm physiologic rhythm Physiologic Function 861Rhythmicities Periodicity Temporal Concept 761

B. Meta Mappings Sinus rhythm Sinus rhythm Finding 1000sinus rhythm electrocardiogram rhythm strip 3-lead: sinus rhythm Finding 1000

Table VI. Examples of features extracted via MetaMap’s full pipeline for the input phrase ‘‘sinus rhythm’’.

Table 4.4 illustrates MetaMap’s ability to map different inputs to onto the same preferred name (rowsA-C). While some abbreviations are accepted (row C), others are not (rows D-E); notice that the latter

14

Page 33: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Meta Mappings

(UMLS) and (8) word-sense disambiguation. The UMLS includes a dictionary of concepts, a thesaurusmapping these concepts onto preferred names and a semantic tree that places these concepts within broadersemantic categories. MetaMap’s different processing steps produce rich output that can be used to extractmany kinds of features. We highlight some of these through examples.

Table 4.4 shows features that come from MetaMap’s lexical and syntactic analysis steps. For the inputphrase ‘‘sinus rhythm has replaced electronic atrial pacemaker’’ (row A), we show how MetaMapbreaks this into phrases (row B) and further into constituents (row C). Each constituent maps to a lexicalcategory (e.g., noun or adjective, row D) and a syntax type (e.g., head or modifier, row E.).

Feature Example

A. Input text sinus rhythm has replaced electronic atrial pacemakerB. Phrases sinus rhythm, has, replaced electronic atrial pacemakerC. Constituents sinus rhythm, has, replaced, electronic, atrial, pacemakerD. Lexical categories (sinus rhythm, noun), (has, aux), (replaced, adj),(matched to constituents) (electronic, adj), (atrial, adj), (pacemaker, noun)E. Syntax types (sinus rhythm, head), (has, aux), (replaced, mod),(matched to constituents) (electronic, mod), (atrial, mod), (pacemaker, head)

Table IV. Examples of features extracted via MetaMap’s lexical and syntactic analysis pipeline.

Table 4.4 gives a few more examples of features we can extract via MetaMap’s lexical and syntacticanalysis. By combining phrase and constituent features, we can for example extract noun phrases, i.e.,phrases containing a head noun plus optional modifiers. We italicize constituents corresponding to headnouns within phrases. Breaking down the inputs into feature vectors of phrases or constituents suggestdifferent ways to compare or align related inputs (rows A-B and C-D).

Input text Phrases identified by MetaMap

A. electronic atrial pacemaker electronic atrial pacemakerB. electronic ventricular pacemaker electronic ventricular pacemakerC. ST \T\ T wave abnormality, consider anterolateral st t t wave abnormality, consider, anterolateral ischemia,

ischemia or digitalis effect or, digitalis effect

Table V. Phrases identified by MetaMap, where we have italicized constituents that are head nouns

Table 4.4 shows features corresponding to MetaMap’s final output for the input phrase ‘‘sinus rhythm’’.The Meta Candidates (rows A) show possible mappings between substrings, e.g., constituents or tokens, andconcepts in MetaMap’s dictionary. Each concept maps to a preferred name via MetaMap’s thesaurus and isa member of a broader semantic category. Each Meta Candidate is given a score, with 1000 beingthe maximum possible score. The Meta Mappings (rows B) represent the final mappings of the entirephrase. Each Meta mapping is given a score, again with 1000 being the maximum.

Output name Concept Preferred name Semantic category Score

A. Meta Candidates Sinus rhythm Sinus rhythm Finding 1000sinus rhythm electrocardiogram rhythm strip 3-lead: sinus rhythm Finding 1000Sinus pathologic fistula Anatomical Abnormality 861SINUS Nasal sinus Body Space or Junction 861Rhythm Rhythm Finding 861Sinus Sinus - general anatomical term Body Space or Junction 861rhythm physiologic rhythm Physiologic Function 861Rhythmicities Periodicity Temporal Concept 761

B. Meta Mappings Sinus rhythm Sinus rhythm Finding 1000sinus rhythm electrocardiogram rhythm strip 3-lead: sinus rhythm Finding 1000

Table VI. Examples of features extracted via MetaMap’s full pipeline for the input phrase ‘‘sinus rhythm’’.

Table 4.4 illustrates MetaMap’s ability to map different inputs to onto the same preferred name (rowsA-C). While some abbreviations are accepted (row C), others are not (rows D-E); notice that the latter

14

sinus rhythm Input text:

(Higher scores indicate “be]er” mappings)

Page 34: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

MetaMap uses a thesaurus to map synonyms onto the same output

receive lower scores because their Meta Mappings only correspond to part of the input.

Input text Meta Mapping Preferred name Semantic category ScoreA. heart attack Heart attack Myocardial Infarction Disease or Syndrome 1000

B. myocardial infarction Myocardial Infarction Myocardial Infarction Disease or Syndrome 1000

C. myocardial infarct Myocardial Infarction Myocardial Infarction Disease or Syndrome 1000

D. myocardial infarc Myocardial Myocardium Tissue 694

E. myocard infarct Infarct Infarction Pathologic Function 861

Table VII. Examples of MetaMap’s output for similar inputs.

Thus with MetaMap, we can directly produce:

—Bags of phrases or constituents. Notice that in contrast to bi-grams and higher order n-grams, theseare non-overlapping grammatical objects. As before, we can apply optional normalization steps suchas stemming and stop word removal (§4.3.1) to further simplify these bags. Depending on our featureextraction task, we could also use the lexical or syntax information to filter the data, e.g., filter for nounsand adjectives.

—Bags of Meta Candidates or Meta Mappings. We note that each of the Meta Candidates andMappings is associated with a unique code which makes this output easier to work with. We can usethe associated scores and semantic categories to filter this data, e.g., filter for the highest scoring MetaMappings belonging to the “Finding” semantic type.

We note that while MetaMap does not recognize many tokens including numbers, we can combine featuresextracted via MetaMap with our methods for extracting typed data to produce “typed Meta Mappings”.For example, when MetaMap processes the input text ‘‘blood pressure 100/70 mmHg’’, it recognizes theentire input as a single phrase but its Meta Mappings only include the concepts for ‘‘blood pressure’’

and ‘‘mmHg’’, but we can extract the numerical data and bind it to this output to produce a compositefeature, e.g., ((blood pressure, mmHg), (100, 70), (int, int)).

[Some other examples of feature extraction techniques include named entity recognition and

detection, FSA, CFG, etc. sentiment analysis OpenCalais]

4.5 EMR-specific processing details

Here, we highlight some additional details and challenges of working with EMR data.

4.5.1 Horizontally partitioning EMR text reports. In working with data from multiple EMR systems,we found that splitting on line breaks was too naive because many – although not all – regions of verboseunstructured text have line breaks inserted for visual formatting purposes. Thus in practice, we first usedsome simple heuristics to identify and remove line breaks that appear to be in the middle of sentences orparagraphs, and then we were able to split the text of each EMR text report.

We found that MetaMap could not parse many raw EMR text entries because they do not conform toits expectations for input text, i.e., a series of sentences that can be split apart and individually parsedaccording to its model for natural language.

4.5.2 Redacted data. Since EMR data comes from real patients, researchers who analyze such data musttake great care to maintain patient privacy. One important step is to de-identify the data, which involvesremoving patient identifiers such as names, alternate names and IDs used within the EMR system. TheRegenstreif dataset was de-identified by other parties prior to our analysis of this data according to anaggressive and fairly naive algorithm. The redacted dataset uses the token ‘XXXX’ to replace all instances ofall substrings matching any patient name parts or IDs appearing anywhere within the dataset. We excludedany tokens containing the substring ‘XXXX’ from our analysis.

5. EVALUATION

We show that explicitly capturing this structure enables both increased accuracy in statistical analytic tasks(§5) and a broad range of applications and user interfaces (§6). In this section, we compare the performance

15

Page 35: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Evalua'on: Epidemiology case study

•  Retrospec've drug effec'veness study

•  Compare to “ground truth” from clinical trials

Page 36: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Retrospec've drug effec'veness study

•  Treatment = high or low intensity sta'n – Clinical trials => slight protec've effect w/high

•  Outcome = heart a]ack?

•  Must iden'fy & adjust for confounders – Variables associated w/both treatment & outcome – Do EMR text reports help iden'fy confounders?

Page 37: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Semistructured & unstructured text

Retrospec've drug effec'veness study

Age/sex 2.05

? Words

N-‐grams

Discre'zed data

MetaMap

?

?

?

Treatment data (high vs. low) Outcome data (heart a]ack vs. ok)

Structured data

2.19

True < 1.0

Page 38: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Use odds ra'o (OR) to compare drug effec'veness under different models

•  Odds = Pr(x | y) / (1 – Pr(x | y)) where x = outcome (heart a]ack) and y = treatment (high-‐intensity or low-‐intensity sta'n)

•  OR = odds(x | high-‐int) / odds(x | low-‐int)

•  Clinical trials show high-‐intensity sta'ns slightly more protec've => expect OR slightly < 1.0

•  Compare OR w/confounder adjustment vs. crude OR

Page 39: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

EMR dataset

•  9,906 individuals, about 6 years

•  Inclusion criteria – 12 months with no fills for sta'ns – Variable period of 'me using a par'cular sta'n X

•  607,668 EMR text reports

Regenstrief Medical Record System / Indiana Network for Pa'ent Care (RMRS/INPC)

Page 40: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Features extracted for hd-‐PS

•  Bags of: – words, n-‐grams –  phrases, cons'tuents – Meta Mappings

•  With and without: –  Stop words –  Stemming –  Special tokens removed – Discre'zed quan'ta've data

*

* *

Page 41: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Results

(Results with 4-‐ through 7-‐grams similar to 2-‐ and 3-‐grams)

Expected OR from clinical trials: slightly less than 1.0

Model Porter Numbers OR (95% CI)

stemming replaced

Crude - - 2.19 (1.72, 2.80)

Age/sex adjusted - - 2.05 (1.58, 2.64)

Age/sex + claims adjusted - - 1.21 (0.86, 1.71)

Age/sex + claims + 1-gram adjusted ✗ ✗ 1.09 (0.76, 1.57)

✓ ✗ 1.16 (0.81, 1.66)

✓ ✓ 1.14 (0.80, 1.63)

Age/sex + claims + 2-gram adjusted ✗ ✗ 1.03 (0.71, 1.49)

✓ ✗ 1.22 (0.85, 1.76)

✓ ✓ 1.24 (0.86, 1.77)

Age/sex + claims + 3-gram adjusted ✗ ✗ 1.08 (0.75, 1.57)

✓ ✗ 1.09 (0.76, 1.57)

✓ ✓ 1.15 (0.80, 1.64)

Table VIII. Odds ratio (OR) for various models reported with 95% confidence intervals (CI). Adjustment by decile, trimmed.

Results with 4-grams through 7-grams are similar to those with 2-grams and 3-grams.

For illustration purposes, we focus on the most prevalent kind of EMR entries in our dataset, the set of all2001 electrocardiogram (ECG) text reports. The first and second examples of EMR entries that we showed(§1.3) are representative of these ECG reports.

By extracting typed text and identifying the distinct instances of template text, we were able to quicklydetect inconsistencies and errors in the reports. For example, we detected these very similar instances: P-R-TAXES $n $n $n and P-R-T AXES $n $n. It turns out that while the latter occurred a substantial but smallnumber of the reports, it represents ambiguous and incomplete information likely due to human data entryerror. We imagine an EMR system that could instantly prevent data entry errors by flagging anomalous, i.e.,low-frequency, instances of template text and suggesting close alternatives in the spirit of a spell-checker.

From the ECG reports, our extraction scheme identified data tuples corresponding to standard mea-surements used to characterize an ECG signal. Figure 1 shows examples of one-dimensional histogramsautomatically plotted from this extracted data. We highlight outliers that might correspond to errors dueto human error in data entry, bugs in data analysis software or environmental interference causing faultymachine behavior. This kind of outlier detection could be incorporated into EMR systems to automaticallydetect anomalous data in real time, thus enabling one of our use cases for structured EMR data (§2.2).

For each pair of distinct template text instances, we also automatically generated a scatterplot of alldata values. Figure 2 shows examples of these two-dimensional scatter plots. We again highlight outliers,including a very suspicious apparent functional dependency between VENT. RATE and PR INTERVAL. Wewould not expect such a functional dependency to arise from random human data entry error and thussuspect a problem with the ECG machine hardware or signal processing software.

7. RELATED WORK

Our work uses and builds upon methods and applications of representing or extracting features from un-structured (§7.1) and semistructured text (§7.2) as well as medical text in particular (§7.3).

7.1 Unstructured text

Natural language processing (NLP) and information retrieval (IR) techniques for working with unstructuredtext are surveyed extensively elsewhere [Jurafsky and Martin 2008; Manning et al. 2008]. NLP in particularoffers many more sophisticated models of language that capture grammatical structure in unstructured text.Here, we highlight some basic methods and applications for extracting structured features from text.

Modeling unstructured text documents as bags of words or as n-grams serves as a foundation for manytasks such as categorizing documents with respect to one or more topics [Damashek 1995; Blei et al. 2003;Wang et al. 2007], modeling word order in natural language [Brown et al. 1992] and and determining au-thorship [Mosteller and Wallace 1964; Keselj et al. 2003]. Our work relied heavily on techniques related tobags of words and n-grams. We note that while the technique of replacing tokens such as numbers and dateswith special tokens is commonly used to decrease the number of distinct tokens, we do not know of priorwork similar to ours for investigating distributions of data values associated with tokens or n-grams (§4.3.3).

16

Page 42: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Real-‐'me applica'ons

•  Detec'ng human data entry errors – “Did you mean … ?”

•  Detec'ng outlier data points – Flag anomalies poten'ally due to hardware, sodware or human error

Partners HealthCare LMR System

Page 43: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

1D distribu'ons and outliers

Page 44: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

2D distribu'ons, correla'ons, outliers

Page 45: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Future work

•  Addi'onal feature extractors –  Combina'ons, e.g., Meta Mappings + quan'ta've

•  EMR-‐specific applica'ons – Detec'ng trends (temporal data analysis) –  Visualiza'ons for doctors, pa'ents, researchers

•  Other domains – Government data, digital humani'es

Page 46: Extrac’ng*features*from** human2readable*semistructured*textelaine/pubs/angelino-structure... · example*emrentry:*ecg*report empi: xxxx mrn type: bwh mrn: xxxx report number: 0000000w

Thanks •  You

–  Stephen Chong, Jeremy Rassen, Margo Seltzer, Stuart Shieber

•  Others at BWH DoPE –  Todd Macgarvey, Sebas'an Schneeweiss, Peter Wahl

•  Partners Cardiology –  Robert Giugliano, Andrea Crompton

•  Regenstrief –  Marc Rosenman

•  Others –  Elif Yamangil, Heather Pon-‐Barry

D1.3 Burkhard Sanner, Luca Angelino 3 Edition, June 2015regeocities.eu/wp-content/uploads/2012/12/D1.3-3rd-Analysis-of... · D1.3 Burkhard Sanner, Luca Angelino 3rd Edition, June

MRN-1 – Cell dimensioning - Intranet DEIBhome.deib.polimi.it/capone/wn/MRN-EN-1-Cell dimensioning.pdf · MRN-1 – Cell dimensioning Mobile Radio Networks Prof. Antonio Capone

XXXX XXXX, BEFORE JENNIFER L. GRESOCK, STUDENT AN ... · AACPS Ex. 22 - Resume, XXXX XXXX AACPS Ex. 23 - Resume, XXXX XXXX AACPS Ex. 24 - Resume, XXXX XXXX, LCSW-C AACPS Ex. 25 -

mrn biz profiles july

CREDIT CARD BASICS · 2021. 1. 19. · XXXX XXXX XXXX XXXX Statement Period From : 02/12/2020 To : 01/01/2021 Customer Relationship No. XXXX XXXX XXXX XXXX All your cards, financial

BLM–DNA2–RPA–MRN and EXO1–BLM–RPA–MRN constitute …genesdev.cshlp.org/content/25/4/350.full.pdf · 2011-02-15 · BLM–DNA2–RPA–MRN and EXO1–BLM–RPA–MRN constitute

HardwareIDXXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX …

DNA2 RPA MRN and EXO1 BLM RPAmicrobiology.ucdavis.edu/kowalczykowski/PDF_files... · BLM–DNA2–RPA–MRN and EXO1–BLM–RPA–MRN constitute two DNA end resection machineries

2015-02233-00 M.P. ANGELINO LIZCANO RIVERA.pdf

Rhcv dr. angelino (rehab cardiovas)

XXXX XXXXXXXX XXXX XXX XXXX XXXXXXX XXXX · XXXXXXXX XXX XXXX XXXX XXXX XXXX XXXXXXX XXXX. DA Muustero dell'Feonomia €16, go edelle 046090 613 5 CITTA' DI ASSISI ( Provincia di

Mrn general resource guide

Send to us this Hardware ID XXXX-XXXX-XXXX-XXXX-XXXX-XXXX

pmkbs.com · 03-xxxx-xxxx 03-xxxx-xxxx 2010 090- 090 9- xxxx- 9 g1970 3 6 a Email [email protected] 1970 Jessie Global Cath Global Steve Global -XXXX-XXXX (fi) xxx-xxxx-xxxx xxx-xxxx-xxxx

Current trends in food retailing – small shops · Web viewGrocery retailing in selected countries from Europe. Back to home page. XXXX XXXX XXXX XXXX XXXX XXXX XXXX XXXX XXXX

MRN – 10 – LTE - Politecnico di Milanohome.deib.polimi.it/capone/wn/MRN-EN-10-LTE.pdf · MRN – 10 – LTE Mobile Radio ... BTS/NodeB Rel 6 GSM/UMTS Serving/ PDN Gateway MME

Send to us this Hardware ID XXXX-XXXX-XXXX-XXXX … ID XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX-XXXX LAN-ID XXXXXXXXXXXX ... DAS' SupÉCrtToCI XENTRY StattKeýCEh.Q„ DVD 03 | …

Winners of S$100 worth of free fuel ABDUL LATIFF … GEK LING XXXX XXXX XXXX 4031 CHOI KHIN MUN XXXX XXXX XXXX 5491 CHONG CHOON JENG XXXX XXXX XXXX 6246 CHONG GIM WAH XXXX XXXX XXXX

Privacy and Confidentiality Protection Overview...Single XXXX XXXX XXXX Married 4 51 54 Black F 3 36 36.7 Black M XXXX XXXX XXXX White M XXXX XXXX XXXX White F XXXX XXXX XXXX Solution

BONUS PRIZE : RM 100 CASH BACK - · PDF file20 xxxx xxxx xxxx 8640 yoe yong jian 100 ... 48 xxxx xxxx xxxx 9391 allan khoh hon mun 100 ... 110 xxxx xxxx xxxx 9169 lee chun teng 100

xxxx - xxxx (xxx)

Overview Application Features and Benefits lab disc disposable filters 6 entegris, inc. dimensions svlp40 xxxx aa xxxx svlp40 xxxx bb xxxx svlp40 xxxx cc xxxx svlp40 xxxx dd xxxx

MRN -3 – Mobility - Politecnico di Milanohome.deib.polimi.it/capone/wn/MRN-EN-3-Mobility.pdf · MRN -3 – Mobility Mobile Radio Networks ... o In the cell selection and reselection

· E—Mail E-Mail *LØNo_ E—Mail E-Mail 090—xxxx—xxxx xxxxxxx123 090—xxxx—xxxx xxxxxxxl 23 090—xxxx—xxxx xxxxxxx 1 23 090—xxxx xxxx

Adobe Photoshop PDFcdn.lesliespool.com/wpdf/Leslies-Custom-Cover1.pdfxxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx xxxx IF YOU HAVE CUTOUTS, PLEASE DRAW POOL IN BOX WITH MEASUREMENTS OF

Michael I. Jordan with Elaine Angelino, Maxim Rabinovich ... · Michael I. Jordan with Elaine Angelino, Maxim Rabinovich, Martin Wainwright and Yun Yang. Previous Work: Information

MRN – Exercises – part B - Intranet DEIBhome.deib.polimi.it/capone/wn/MRN-EN-Exercises-B.pdf · MRN – Exercises – part B Mobile Radio Networks Prof. Antonio Capone . ... A

U.S. Forces Okinawa, Japan Telephone Directory.pdf630-xxxx 632-xxxx 634-xxxx 622-xxxx 623-xxxx 645-xxxx 646-xxxx 637-xxxx 636-xxxx 625-xxxx 643-xxxx off base (include cellular) 098-970-5555

Extracting structure from human-readable semistructured textelaine/pubs/angelino-structure.pdf · Extracting structure from human-readable semistructured text Elaine Angelino ABSTRACT

XXXX XXXX SCHOOL

SECTION 7.2 INTRODUCTION TO ANGELINO HEIGHTSâ€™ ARCHITECTURAL STYLES

MRN – 7 – GPRS

PHASE I - YEAR END NEW RIDE CAMPAIGN (Y3) CASH … · 35 xxxx xxxx xxxx 0169 cheong si swan€ 100 ... 117 xxxx xxxx xxxx 9240 sia chay seng€ 100 ... 189 xxxx xxxx xxxx 4938 teo