diagnosis of ovarian cancer based on mass spectrum of blood samples

24
Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples Committee: Eugene Fink Lihua Li Dmitry B. Goldgof Hong Tang

Upload: diane

Post on 25-Feb-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples. Hong Tang. Committee: Eugene Fink Lihua Li Dmitry B. Goldgof. Outline. Introduction Previous work Feature selection Experiments. Motivation. Early cancer detection is critical for successful treatment. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Committee:Eugene Fink

Lihua LiDmitry B. Goldgof

Hong Tang

Page 2: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Outline

• Introduction

• Previous work

• Feature selection

• Experiments

Page 3: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Motivation

Early cancer detection is criticalfor successful treatment.

Five year survival for ovarian cancer:• Early stage: 90%• Late stage: 35%

80% are diagnosed at a late stage.

Page 4: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Motivation

Desired features ofcancer detection:

• Early detection

• High accuracy

• Low cost

Page 5: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Mass spectrum

We can detect some early-stage cancersby analyzing the blood mass spectrum.

ratio of molecular weight to electrical charge

inte

nsity

20,0000 5,000 10,000 15,000

10–4

10–2

100

102

Page 6: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Mass spectrumMass spectrum

Data miningResults

Blood

Page 7: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Outline

• Introduction

• Previous work

• Feature selection

• Experiments

Page 8: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Initial work

• Vlahou et al. (2001): Manual diagnosis

of bladder cancer based on mass spectra

• Petricoin et al. (2002): Application of

clustering to mass spectra for the ovarian-

cancer diagnosis

Page 9: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Decision treesAdam et al. (2002): 96% accuracy for prostate cancerQu et al. (2002): 98% accuracy for prostate cancer

Later work

Neural networksPoon et al. (2003): 91% accuracy for liver cancer

ClusteringPetricoin et al. (2002): 80% accuracy for prostate cancer

Page 10: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Outline

• Introduction

• Previous work

• Feature selection

• Experiments

Page 11: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Feature selection

ratio of molecular weight to electrical charge

inte

nsity

200 400 600

CancerHealthy

2 21 2 1 2/ Statistical difference:

Page 12: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Feature selection

ratio of molecular weight to electrical charge

inte

nsity

200 400 600

Window size: minimal distance between selected points

CancerHealthy

Page 13: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Outline

• Introduction

• Previous work

• Feature selection

• Experiments

Page 14: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Data sets

Dataset

Number of cases Cancer Healthy

123

100100162

116116 91

Page 15: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Learning algorithms

• Decision trees (C4.5)

• Support vector machines (SVMFu)

• Neural networks (Cascor 1.2)

Page 16: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Control variables

• Number of features, 1–64

• Window size, 1–1024

Page 17: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Best control valuesDecision trees

Data set

Number of features

Window size

Accuracy

1 4 1 82%2 8 4 94% 3 8 64 99%

Page 18: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Best control valuesSupport vector machines

Data set

Number of features

Window size

Accuracy

1 32 16 83%2 4 2 94% 3 16 8 99%

Page 19: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Best control valuesNeural networks

Data set

Number of features

Window size

Accuracy

1 32 256 82%2 32 1 96% 3 16 2 99%

Page 20: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Learning curveData set 1

accu

racy

(%)

training size

90

80

60

100

70

Decision trees, SVM, Neural networks

50 100 150 200 250

Page 21: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

accu

racy

(%)

Learning curveData set 2

training size

90

80

60

100

70

Decision trees, SVM, Neural networks

0 50 100 150 200 250

Page 22: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Learning curveData set 3

accu

racy

(%)

training size

50 100 150 20060

70

90

80

100

0

Decision trees, SVM, Neural networks

250

Page 23: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Main results

Automated detection of ovarian cancer byanalyzing the mass spectrum of the blood

• Experimental comparison of decision

trees, SVM and neural networks

• Identification of the most informative

points of the mass-spectrum curves

Page 24: Diagnosis of Ovarian Cancer Based on Mass Spectrum of Blood Samples

Future work

• Experiments with other data sets

• Other methods for feature selection

• Combining with genetic algorithm