effects of grayscale window/level on breast lesion detectability jeffrey johnson, phd a john...

28
Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD b b a Supported by U. S. Army Medical Research and Materiel Command, grant DAMD-17-01-1-0621

Upload: silvester-jefferson

Post on 19-Jan-2018

217 views

Category:

Documents


0 download

DESCRIPTION

3 Overview  This study evaluated the use of a visual discrimination model (VDM) for predicting effects of one type of image enhancement - grayscale window width and level (W/L) - on the detectability of breast lesions  Compared model and observer performance in two experiments: –2AFC detection thresholds with simulated mammograms and nonmedical observers –ROC observer performance study with radiologists and digitized mammograms

TRANSCRIPT

Page 1: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

Effects of Grayscale Window/Level on Breast

Lesion DetectabilityJeffrey Johnson, PhD a

John Nafziger, PhD a Elizabeth Krupinski, PhD b

Hans Roehrig, PhD b

baSupported by U. S. Army Medical Research and Materiel Command, grant DAMD-17-01-1-0621

Page 2: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

2

Rationale

Nearly 50% of breast lesions missed at initial screening are visible retrospectively

Digital mammography could reduce perceptual errors by enhancing lesion conspicuity with image processing

Perceptual models could be useful tools for automating and optimizing techniques for image enhancement

Page 3: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

3

Overview This study evaluated the use of a visual

discrimination model (VDM) for predicting effects of one type of image enhancement - grayscale window width and level (W/L) - on the detectability of breast lesions

Compared model and observer performance in two experiments:– 2AFC detection thresholds with simulated

mammograms and nonmedical observers– ROC observer performance study with radiologists

and digitized mammograms

Page 4: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

4

Methods: Simulated Mammograms

Backgrounds– Filtered noise, 1/f3

noise power spectrum– Two groups: Bright and

Dark central regions

Lesion signals– Mass: 2D Gaussian (d=50

arcmin)– Microcalcification cluster:

six blurred disks or “specks” (disk d=8 arcmin, cluster d=40 arcmin)

Page 5: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

5

Methods: W/L Conditions P-value transformations:

– Fully stretched– Understretched (-25%)– Overstretched (±25%)– Bright shifted (+25%)– Dark shifted (-25%)

Applied to full 512x512 pixel image or 170x170 pixel central region of interest containing lesion

0

256

512

768

1024

0 1024 2048 3072 4096p

p'

Fully stretchedUnderstretched (25%)Overstretched (25%)Bright shifted (25%)Dark shifted (25%)

Page 6: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

6

Example Test Images

Fullystretched

(FS)

Understretched

(US)

Overstretched

(OS)

GaussianFull W/L

Bright Center

GaussianCentral W/L

Bright Center

SpecksFull W/L

Dark Center

SpecksCentral W/LDark Center

Page 7: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

7

Example Test Images

Bright shifted(BS)

Dark shifted(DS)

GaussianFull W/L

Bright Center

GaussianCentral W/L

Bright Center

SpecksFull W/L

Dark Center

SpecksCentral W/LDark Center

Page 8: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

8

2AFC Threshold Detection

Side-by-side presentation of same background with/without signal

Signal amplitude varied in 1-up/3-down staircase procedure; detection threshold at ~80% correct

Five W/L conditions interleaved in same session

Separate sessions for two signal and two background types

Page 9: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

9

Test Conditions Siemens 5M-pixel CRT monitor (P45) Luminance range = 0.3 to 290 cd/m2

Barco 10-bit display controller DICOM-14 grayscale display function Three nonmedical observers Viewing distance = 52 cm; chin rest Ambient lights off

Page 10: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

10

Results: Detection Thresholds for Gaussian Signals

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Method

Det

ectio

n Th

resh

old

Full W/LCentral W/L

Gaussian on Bright Filtered Noise

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Method

Det

ectio

n Th

resh

old

Full W/LCentral W/L

Gaussian on Dark Filtered Noise

Error bars show 95% confidence intervals

Bright Backgrounds Dark Backgrounds

Page 11: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

11

Results: Detection Thresholds for Speck Clusters

Error bars show 95% confidence intervals

0.00

0.03

0.06

0.09

0.12

0.15

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Method

Det

ectio

n Th

resh

old

Full W/LCentral W/L

Specks on Bright Filtered Noise

0.00

0.03

0.06

0.09

0.12

0.15

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Method

Det

ectio

n Th

resh

old

Full W/LCentral W/L

Specks on Dark Filtered Noise

Bright Backgrounds Dark Backgrounds

Page 12: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

12

Experimental Detection Thresholds

Significant variations across W/L conditions Generally lower for central vs. full W/L

– due to local contrast enhancement- fully stretched not always optimal

Full W/L: Lowest thresholds for …– fully stretched, understretched (specks only)– dark shifted on bright, bright shifted on dark

Central W/L: Lowest thresholds for …– overstretched for Gaussians and specks on dark– dark shifted on bright, bright shifted on dark

Page 13: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

13

Visual Discrimination Modeling

Simulates physiological response of human visual system to visual stimuli: luminance patterns from images & video

Output is a deterministic prediction of feature or image discriminability as function of spatial location, spatial frequency, and time

Discriminability measured in units of Just Noticeable Differences (JND)

Page 14: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

14

VDM Architecture

JND scalar

Spatial frequencybands

Spatial orientation responses

Display& Ocular

Processing

Optics

Crossband Masking

JND map

JND Distance

Combin. Rule

Display luminance

Pair of input images

Probability

Contrast Pyramid (visual cortex)

Within-band Masking

Contrast Pyramid

Page 15: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

15

VDM vs. Experimental Thresholds for Gaussians on Bright

Backgrounds

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Gaussian on BrightFiltered Noise: Full W/L

Error bars show 95% confidence intervals

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Gaussian on BrightFiltered Noise: Central W/L

Full W/L Central W/L

Page 16: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

16

VDM vs. Experimental Thresholds for Gaussians on Dark Backgrounds

Error bars show 95% confidence intervals

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Gaussian on DarkFiltered Noise: Full W/L

0.00

0.03

0.06

0.09

0.12

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Gaussian on DarkFiltered Noise: Central W/L

Full W/L Central W/L

Page 17: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

17

VDM vs. Experimental Thresholds for Specks on Bright Backgrounds

Error bars show 95% confidence intervals

0.00

0.04

0.08

0.12

0.16

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Specks on BrightFiltered Noise: Full W/L

0.00

0.04

0.08

0.12

0.16

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Specks on BrightFiltered Noise: Central W/L

Full W/L Central W/L

Page 18: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

18

VDM vs. Experimental Thresholds for Specks on Dark Backgrounds

Error bars show 95% confidence intervals

0.00

0.04

0.08

0.12

0.16

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Specks on DarkFiltered Noise:

Full W/L

0.00

0.04

0.08

0.12

0.16

FullyStretched

UnderStretched

OverStretched

BrightShifted

DarkShifted

W/L Condition

Con

tras

t Thr

esho

ld

ExperimentalModel

Specks on DarkFiltered Noise: Central W/L

Central W/LFull W/L

Page 19: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

19

VDM vs. Experimental Thresholds: Simulated Lesions & Backgrounds

Generally good agreement between model and experimental detection thresholds and variations across W/L conditions

Consistently reduced thresholds with central (local ROI) vs. full-image W/L

Largest modeling discrepancies for specks, especially on dark backgrounds

Page 20: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

20

ROC Observer Study

Determine effects of W/L functions and size on detection of microcalcification clusters by mammographers

Evaluate utility of localized ROI contrast enhancement (central vs. full W/L)

Page 21: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

21

ROC Observer Study: Image Preparation

Digitized mammograms (n=15) from Digital Database for Screening Mammography

Extracted 512x512-pixel sections with single, centered microcalcification cluster

Removed calcifications by median filtering Generated five lesion-contrast levels: 0, 25, 50, 75,

and 100% Applied three W/L functions: Fully stretched, under

and over stretched by 15% Full and Central W/L sizes

Page 22: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

22

ROC Observer Study: Test Conditions

6 radiologists at Univ. of Arizona 225 images/session 2 reading sessions ~2 weeks apart Decision confidence on 6-point scale No image processing, no time limits,

ambient lights off; viewed at ~25 cm Siemens 5M-pixel CRT monitor (P45) Luminance = 0.8 to 500 cd/m2

DICOM-14 grayscale display function

Page 23: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

23

Examples of Test ImagesUnderstretched

(US, 15%) Overstretched

(OS, 15%) Fully stretched(FS, 0-4095)

FullW/L

CentralW/L

Page 24: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

24

ROC Observer Study: Results Compared central vs. full W/L across all W/L functions, all lesion

contrasts Observer performance statistically better (p<0.05) for FULL W/L size

Az Values

Page 25: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

25

ROC Observer Study: Results No statistically significant variations:

– between central and full W/L sizes for a single W/L function (all lesion contrasts)– between central and full W/L sizes for a single combination of W/L function and lesion contrast (except FS,

50%)– across W/L functions in central and full W/L sizes considered separately (all lesion contrasts)

Page 26: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

26

ROC Observer Study: Analysis Central W/L enhanced lesion contrast but changed appearance of parenchymal tissue relative to surrounding

areas Decision confidence lowered by nonuniform appearance of background tissue characteristics Conclusion: Calcifications may be easier to perceive (due to higher contrast) but more difficult to interpret (due

to cognitive factors, past experience)

Page 27: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

27

Conclusions For simulated lesions and backgrounds, VDM was generally a reliable predictor of W/L conditions for optimal detectability

Results with simulated images suggested benefits of localized contrast enhancement

Decision confidence and performance of mammographers actually lower with localized W/L, probably due to nonuniform tissue appearance

Page 28: Effects of Grayscale Window/Level on Breast Lesion Detectability Jeffrey Johnson, PhD a John Nafziger, PhD a Elizabeth Krupinski, PhD b Hans Roehrig, PhD

28

Future Directions Allow toggling between full and local W/L modes (combine uniform contextual data with local contrast enhancement) Evaluate effects of W/L on detection of very subtle lesions (low contrast, near threshold) Model refinements:

– improved crossband masking for higher frequency signals: specks/calcifications– include effects of background noise via statistical observer model