proteomic identification and systematic verification of ... · proteomic identification and...

Proteomic Identification and Systematic Verification of

Biomarkers for Aggressive Prostate Cancer

by

Yunee Kim

A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy

Medical Biophysics University of Toronto

© Copyright by Yunee Kim 2015

ii

Proteomic Identification and Systematic Verification of

Biomarkers for Aggressive Prostate Cancer

Yunee Kim

Doctor of Philosophy

Medical Biophysics

University of Toronto

2015

Abstract

Despite prostate cancer’s status as the most common cancer to affect men in the Western world,

only an estimated 20% of men will die from it. Due to the lack of effective means of

prognostication, many men are unnecessarily biopsied and treated. The early and non-invasive

identification of lethal disease would have a profound impact on the current practice of prostate

cancer management. To this end, the work presented in this thesis describes efforts to

characterize proteins in a prostatic fluid known as expressed prostatic secretions (EPS), and to

delineate factors in this fluid that demonstrate a high potential as prognostic biomarkers. Using

different mass spectrometry-based platforms, I present both a global view of the EPS proteome,

as well as targeted, quantitative measures of promising biomarker candidates in this fluid.

In order to generate an archive of soluble and secreted factors present in the proteome of EPS, I

used shotgun proteomics to analyze direct-EPS and EPS-urine. This led to the identification of

over 2000 proteins that comprise the EPS proteome. The data was integrated with other publicly

accessible information, such as gene expression profiles and immunohistochemistry images, to

characterize this proteome. Furthermore, a collection of prostate-enriched proteins was identified

iii

by comparing the proteomes of EPS-urine and urine, demonstrating the unique protein profile of

EPS.

Next, comparative proteomic profiling of direct-EPS from organ confined and extracapsular

tumors produced a long list of potential biomarker candidates. A targeted assay was

systematically developed in order to quantitatively assess the performance of the most promising

candidates in cohorts of EPS-urines from a heterogeneous population of prostate cancer patients

and controls. Utilizing the assay, a number of candidates were verified to be putative biomarkers

for prostate cancer diagnosis and prognosis. This information needs to be further validated and

holds promise for improved prostate cancer management.

iv

Acknowledgments

Let me begin with my deepest gratitude for the support and encouragement I received from my

supervisor, Dr. Thomas Kislinger. His integrity, courage, and commitment to contribute high

caliber research to advancing medical science have been great driving forces in my own pursuit

towards a productive Ph.D.

The members of my advisory committee, Drs. Theodorus Van der Kwast and Rama Khokha,

have been instrumental in my journey. I thank them for taking the time to engage in stimulating

discussions and offering their valuable insights.

This thesis is a culmination of the dedication of numerous individuals. I thank Drs. Richard

Drake, John Semmes, and Raymond Lance for organizing and providing the samples. Thanks to

Dr. Anthony Gramolini for permitting the unlimited use of the mass spectrometer. Thank you to

Drs. Paul Boutros, Clare Jeon, and Irina Kalatskaya, and Miss Cindy Yao for performing the

bioinformatics analyses and patiently answering my often naïve questions. Lastly, I thank the

patients and donors. None of this would have been possible without their generosity.

My time in grad school was made enjoyable in large part due to the many friends and groups that

became a part of my life. I have been fortunate to work alongside the past and present members

of the Kislinger lab: Ankit Sinha, Dr. Salvador Mejia, Alexandr Ignatchenko, Vladimir

Ignatchenko, Javier Alfaro, Dr. Erwin Schoof, Dr. Lusia Sepiashvili, Dr. Sarah Elschenbroich,

Lyudmila Nedyalkova, Dr. Simona Fontana and Dr. Simona Principe. I admire each and every

one of them for their bright minds and am grateful for the lasting friendships. A special shout out

to Sarah for patiently teaching me the way around a mass spectrometer and guiding me through

my first attempts at analyzing the data. The memories of the happy hours we spent together at

Café Volo after a hard day’s work will be forever cherished. I’m grateful to my lab mate and

friend Simona, with whom I shared countless lunches, washroom breaks, and random

adventures. I admire her tremendous abilities as a scientist and enormous warm heart. Thank you

to Lusia, for always having my back and taking the time to share your brilliant ideas with me. In

addition to my lab mates, I thank the Rottapel, Raught, and Gramolini labs for making the work

environment so pleasurable. The countless hours spent in the windowless mass spec room were

made enjoyable by the many friends that kept me company.

v

I am grateful to my best friends, Serge and Laura, for being my supporters and therapists, even

from half way around the world. I’m thankful to my #yologirls for being those with whom I can

be downright silly. Thanks to my MPC friends for their openness, kindness, generosity, and

deep, thought-provoking insights. They have inspired an exploration of myself to a depth and

truth I have never experienced before.

I thank my family for giving me the best foundation to grow from and showering me with their

warm love every day. With their strong support, I am fearless about whatever life throws my

way.

Last but not least, I thank my partner, Albert, for his quiet patience and unwavering love.

vi

Summary of Accomplishments

Publications (10)

Original research articles

1. Principe S, Jones E, Kim Y, et al. In-depth proteomic analyses of exosomes isolated from

expressed prostatic secretions in urine. 2013. Proteomics 13(10-11):1667. Underlined

authors contributed equally to this work.

2. Kim Y, et al. Identification of differentially expressed proteins in direct expressed prostatic

secretions of men with organ-confined versus extracapsular prostate cancer. 2012.

Molecular Cellular Proteomics 11(12):1870.

3. Principe S, Kim Y, Fontana S, et al. In-depth proteomic analyses of expressed prostatic

secretions in urine: identification of prostate-enriched proteins as potential biomarkers.

2012. Journal of Proteome Research 11(4): 2386. Underlined authors contributed equally

to this work.

4. Drake RR, Elschenbroich S, Lopez-Perez O, Kim Y, et al. In-depth proteomic analyses of

direct expressed prostatic secretions. 2010. Journal of Proteome Research 9:2109.

Reviews, book chapter, commentary

5. Kim Y, Kislinger T. Isoform-specific targeted proteomics sheds new light on synaptic

ligands. 2015. eLife (submitted)

6. Kim Y, Kislinger T. Novel approaches for the identification of biomarkers of aggressive

prostate cancer. 2013. Genome Medicine 5(6):56

7. Kim Y, Lance R, Drake R, Semmes O, Kislinger T. Proximal tissue fluids and targeted

proteomics: a novel approach to identify aggressive prostate cancers. Commentary in

UroToday, published February 26th, 2013 (http://urotoday.com)

vii

8. Cooper S, Tun H, Roper S, Kim Y. Current status of biomarker discovery in human clear

cell renal cell carcinoma. 2012. Journal of Molecular Biomarkers & Diagnosis S2:005.

doi:10.4172/2155-9929.S2-005.

9. Kim Y, et al. Use of Colloidal Silica-Beads for the Isolation of Cell Surface Proteins for

Mass Spectrometry-Based Proteomics. 2011. Methods in Molecular Biology 748:227.

10. Elschenbroich S, Kim Y, et al. Isolation of cell surface proteins for mass spectrometry-

based proteomics. 2010. Expert Review of Proteomics 7:141 Underlined authors

contributed equally to this work.

Presentations to Scientific Audiences (12)

Oral presentations

1. Canadian National Proteomics Network, 2014. Montreal, QC, Canada

2. International Conference on Urine-Omics, 2013. Costa da Caparica, Portugal

3. Canadian National Proteomics Network, 2012. Toronto, ON, Canada

4. Lake Louise Tandem Mass Spectrometry Workshop, 2011. Lake Louise, AB, Canada

5. James Lepock Memorial Symposium Ph.D. Talk, 2011. Toronto, ON, Canada

6. Ontario Cancer Institute Cancer Proteomics Group Meeting, 2010. Toronto, ON, Canada

7. Australian-Canadian Prostate Cancer Research Alliance Conference, 2010. Gold Coast,

QLD, Australia

Poster presentations

8. Canadian National Proteomics Network, 2014. Montreal, QC, Canada

9. Australian-Canadian Prostate Cancer Research Alliance Conference, 2010. Gold Coast,

QLD, Australia

10. Institute of Medical Science Day, 2010. Toronto, ON, Canada

viii

11. 22nd Annual Canadian Society for Immunology Conference, 2009. Whistler, BC, Canada

12. Centre for Genetic Improvement of Livestock Workshop, 2009. Guelph, ON, Canada

Abstracts (7)

1. Kim Y, et al., Biomarkers for the identification of biomarkers in expressed prostatic

secretions. International Conference in Urine-Omics, 2013. Costa da Caparica, Portugal.

2. Kim Y, et al., Identification of differentially expressed proteins in direct expressed

prostatic secretions of men with organ-confined versus extracapsular prostate cancer, 2012.

Canadian National Proteomics Network Conference, Toronto, ON, Canada.

3. Kim Y, et al., Identification of prognostic biomarkers of prostate cancer in prostatic fluids.

Lake Louise Tandem Mass Spectrometry Workshop, 2011. Lake Louise, AB, Canada.

4. Kim Y, et al., Proteomics of prostate cancer: from discovery to validation of putative

prognostic markers. Australian-Canadian Prostate Cancer Research Alliance Conference,

2010. Gold Coast, QLD, Australia.

5. Kim Y, et al., Proteomics of prostate cancer: from discovery to validation of putative

prognostic markers. Institute of Medical Science Day, 2010. Toronto, ON, Canada.

6. Fontana S, Drake R, Kim Y, et al., Proteome profiling of EPS-urine for the identification of

putative prostate cancer biomarkers. 10th Advanced Meeting on Cancer Omics, 2010.

Erice, Sicily, Italy.

7. Drake R, Elschenbroich S, Lopez-Perez L, Kim Y, et al., Comprehensive proteomic

profiling of direct expressed prostatic secretions: A novel strategy for the discovery of

prostate cancer biomarkers, 2009. Clinical Proteomics 5:275.

Academic Awards, Scholarships, Fellowships, and Honours (20)

1. 2014 Paul Starita Graduate Student Fellowships

2. 2014 Peterborough K.M. Hunter Graduate Studentships

ix

3. 2014 Frank Fletcher Memorial Fund

4. 2014 Canadian National Proteomics Network Travel Award

5. 2014 Canadian National Proteomics Network Best Oral Presentation by a Trainee

6. 2013 Ontario Graduate Scholarship

7. 2013 Scace Graduate Fellowship in Prostate Cancer Research

8. 2013 Paul Starita Graduate Student Fellowship

9. 2013 Medical Biophysics Travel Grant


11. 2012 Medical Biophysics Excellence Fellowship

12. 2011 James Lepock Memorial Symposium Best PhD Talk Award


14. 2011 Canadian Society for Mass Spectrometry Travel Award



17. 2010 Australian-Canadian Prostate Cancer Research Alliance Travel Award

18. 2010 Prostate Cancer Foundation of Australia Poster Contest


20. 2009 Institute of Medical Science Entry Award

x

Table of Contents

Acknowledgments .......................................................................................................................... iv

Summary of Accomplishments ...................................................................................................... vi

Table of Contents ............................................................................................................................ x

List of Tables ................................................................................................................................ xv

List of Figures .............................................................................................................................. xvi

List of Appendices ....................................................................................................................... xix

List of Abbreviations ................................................................................................................... xxi

Chapter 1 Introduction and thesis overview ................................................................................... 1

1 Introduction and thesis overview ............................................................................................... 1

1.1 The prostate gland ............................................................................................................... 1

1.1.1 Anatomy and histology of the prostate gland ......................................................... 1

1.2 Prostate cancer .................................................................................................................... 3

1.2.1 Prostate cancer epidemiology and statistics ............................................................ 3

1.2.2 Histopathology and clinical progression of prostate cancer ................................... 5

1.2.3 Current concepts of insignificant versus significant prostate cancer ...................... 6

1.2.4 Genetic alterations and molecular subtypes of prostate cancer .............................. 7

1.2.5 Current methods of prostate cancer and diagnosis ................................................. 8

1.3 Prostate cancer biomarkers ............................................................................................... 11

1.3.1 Introduction to cancer biomarkers ........................................................................ 11

1.3.2 Clinical and analytical considerations for cancer biomarker assay development . 12

1.3.3 Biological sources of biomarkers .......................................................................... 15

1.3.3.1 Tissue proximal fluids ............................................................................ 16

1.3.3.2 Biomarkers in biological fluids .............................................................. 17

1.3.4 Adjunct prostate cancer biomarkers ...................................................................... 18

xi

1.3.4.1 Cellular markers ..................................................................................... 19

1.3.4.2 Genetic aberrations as prognostic biomarkers ........................................ 20

1.3.4.3 MicroRNAs ............................................................................................ 21

1.3.4.4 Circulating biomarkers ........................................................................... 21

1.3.4.5 Urinary biomarkers ................................................................................. 22

1.4 Clinical proteomics for biomarker discovery ................................................................... 23

1.4.1 Discovery proteomics using shotgun, bottom-up proteomics ............................... 23

1.4.2 Protein quantification using mass spectrometry ................................................... 27

1.4.3 Targeted proteomics .............................................................................................. 29

1.4.3.1 Selected reaction monitoring mass spectrometry ................................... 30

1.4.3.2 Application of selected reaction monitoring mass spectrometry to

biomarker research ................................................................................. 31

1.5 Rationale and objectives ................................................................................................... 33

1.5.1 Rationale ............................................................................................................... 33

1.5.2 Hypotheses and aims ............................................................................................. 33

Chapter 2 In-depth proteomic analyses of prostate proximal tissue fluids ................................... 36

2 In-depth proteomic analyses of prostate proximal fluids ......................................................... 36

2.1 Abstract ............................................................................................................................. 36

2.2 Introduction ....................................................................................................................... 37

2.3 Results ............................................................................................................................... 39

2.3.1 In-depth proteomic analyses of direct expressed prostatic secretions .................. 39

2.3.1.1 MudPIT profiling of direct expressed prostatic secretions from

prostate cancer patients ........................................................................... 39

2.3.2 Integrative data mining and comparison to other public databases ...................... 43

2.3.2.1 Comparison of the direct-EPS proteome to other published proteomic

datasets ................................................................................................... 45

2.3.2.2 Multi-dataset integration......................................................................... 46

xii

2.3.3 Identification of prostate-enriched proteins by in-depth proteomic analyses of

expressed prostatic secretions in urine .................................................................. 50

2.3.3.1 MudPIT profiling of expressed prostatic secretions in urine ................. 50

2.3.3.2 Identification of prostate-enriched proteins ............................................ 57

2.3.3.3 Characterization of prostate-enriched proteins ....................................... 62

2.3.3.4 Generation of candidate shortlist ............................................................ 64

2.3.3.5 Verification of proteomic data ................................................................ 64

2.4 Discussion ......................................................................................................................... 66

2.5 Materials and methods ...................................................................................................... 67

2.5.1 Materials ............................................................................................................... 67

2.5.2 Sample collection and properties .......................................................................... 67

2.5.3 Protein digestion and sample preparation ............................................................. 69

2.5.4 MudPIT analyses .................................................................................................. 69

2.5.5 Protein identification and data analyses ................................................................ 71

2.5.5.1 Gene Ontology annotation and data comparisons .................................. 72

2.5.6 Prostate-enriched proteins characterization .......................................................... 73

2.5.6.1 SDS-PAGE and Western blot analysis on urine and EPS-urine pools ... 74

Chapter 3 Identification of candidate biomarkers of aggressive prostate cancer ......................... 75

3 Identification of candidate biomarkers of aggressive prostate cancer ..................................... 75

3.1 Abstract ............................................................................................................................. 75

3.2 Introduction ....................................................................................................................... 76

3.3 Results ............................................................................................................................... 78

3.3.1 Proteomic characterization of expressed prostatic secretions from organ-

confined and extracapsular prostate cancers ......................................................... 78

3.3.2 Semi-quantitative comparison of expressed prostatic secretions from

extracapsular and organ-confined prostate cancers .............................................. 84

3.3.3 Generation of candidate shortlist .......................................................................... 87

xiii

3.3.4 Verification of differentially expressed candidates in direct-EPS ........................ 90

3.3.5 Verification of differentially expressed candidates in EPS-urines ....................... 93

3.4 Discussion ......................................................................................................................... 99

3.5 Materials and methods .................................................................................................... 103

3.5.1 EPS Sample Properties ....................................................................................... 103

3.5.2 Protein digestion and sample preparation for mass spectrometry ...................... 104

3.5.3 Protein identification and data analysis .............................................................. 104

3.5.4 Gene Ontology annotation, pathway enrichment and network-based analysis

of EPS proteins ................................................................................................... 105

3.5.5 Integrative data mining ....................................................................................... 106

3.5.6 Immunoassays in prostatic fluids ........................................................................ 106

3.5.7 Statistical analysis ............................................................................................... 107

3.5.8 SRM-MS development ....................................................................................... 107

3.5.9 Relative quantification by SRM-MS in EPS-urines ........................................... 109

Chapter 4 Systematic development of a quantitative assay for putative prostate cancer

biomarkers in expressed prostatic secretions ......................................................................... 110

4 Systematic large-scale development of targeted proteomics assays for biomarker

verification in prostate-proximal fluids .................................................................................. 110

4.1 Abstract ........................................................................................................................... 110

4.2 Introduction ..................................................................................................................... 111

4.3 Results ............................................................................................................................. 112

4.3.1 Candidate selection for targeted quantification by selected reaction monitoring

mass spectrometry ............................................................................................... 112

4.3.2 Targeted relative quantification of candidates in a cohort of EPS-urines .......... 114

4.3.3 Selection of peptide candidates for verification .................................................. 116

4.3.4 Absolute quantification for the verification of candidates in EPS-urines ........... 119

4.3.5 Comparative analysis between subgroups of patients ........................................ 122

4.4 Discussion and future direction ...................................................................................... 127

xiv

4.5 Materials and Methods .................................................................................................... 130

4.5.1 Sample collection and properties ........................................................................ 130

4.5.2 Sample preparation for mass spectrometry ......................................................... 131

4.5.3 Target acquisition by selected reaction monitoring mass spectrometry ............. 131

4.5.4 Phase 2 peptide selection .................................................................................... 132

4.5.5 Phase 2 peptide quantification and Phase 3 peptide selection ............................ 132

4.5.6 Generation of response curves and determination of figures of merit ................ 133

4.5.7 Multiplexed, scheduled selected reaction monitoring mass spectrometry

method ................................................................................................................. 133

4.5.8 Verification of Phase 3 peptides in EPS-urines .................................................. 133

4.5.9 Quantitative and statistical analyses ................................................................... 134

Chapter 5 Summary and future direction .................................................................................... 135

5 Summary and future direction ................................................................................................ 135

5.1 Summary of key findings ................................................................................................ 135

5.1.1 In-depth proteomic analyses characterizing the direct and urinary expressed

prostatic secretion proteomes .............................................................................. 135

5.1.2 Discovery phase: identification of candidate biomarkers of extracapsular

prostate cancer .................................................................................................... 136

5.1.3 Verification phase: targeted quantification of selected prostate cancer

biomarker candidates .......................................................................................... 136

5.1.4 Additional proteomics analysis of prostate-proximal fluids ............................... 137

5.2 Future directions ............................................................................................................. 138

5.3 Concluding remarks ........................................................................................................ 142

References ................................................................................................................................... 143

xv

List of Tables

Table 1.1 Descriptors of disease biomarkers. ............................................................................... 14

Table 1.2 Common sources of prostate cancer biomarkers. ......................................................... 15

Table 1.3 Summary of common labeled and label-free protein quantification techniques using

MS. ................................................................................................................................................ 29

Table 2.1 Clinical information of the direct-EPS fluids analyzed by MudPIT proteomics. ......... 39

Table 2.2 Clinical information for the prostate cancer (5 samples) and benign prostatic

hyperplasia (6 samples) EPS-urines analyzed by MudPIT. .......................................................... 51

Table 2.3 Clinical information for the pooled urine and EPS-urine samples analyzed by

MudPIT.. ....................................................................................................................................... 58

Table 3.1 Clinical information of patients used for discovery proteomics. .................................. 80

Table 3.2 Fourteen candidates selected by data mining scheme. ................................................. 89

Table 3.3 Clinical information of patient samples used for Western blotting verification. .......... 91

Table 3.4 Clinical information of patient samples used to generate EPS-urine pools. ................. 95

Table 4.1 Overview of clinical information from patients enrolled in Cohort A of EPS-urines. 114

Table 4.2. Limits of detection and quantification for 34 peptides determined by response curves

generated using AQUA peptides spiked into urine. .................................................................... 118

Table 4.3 Overview of clinical information from patients enrolled in Cohort B of EPS-urines. 119

Table 4.4 Relative expression of Phase 3 proteins and peptides in EPS fluids using various

quantitative methods. .................................................................................................................. 126

xvi

List of Figures

Figure 1.1 The prostate gland. ........................................................................................................ 2

Figure 1.2 Cell types of the prostate gland.. ................................................................................... 3

Figure 1.3 Progression of human prostate cancer. .......................................................................... 6

Figure 1.4 Receiver operating characteristic curve. ...................................................................... 13

Figure 1.5 Prostate specific antigen access to circulation. ............................................................ 17

Figure 1.6 Various strategies for novel molecular biomarker identification. ............................... 19

Figure 1.7 Schematic of multidimensional protein identification technology (MudPIT). ........... 25

Figure 1.8 Dynamic concentration range of human plasma proteins. .......................................... 27

Figure 1.9 Selected reaction monitoring mass spectrometry. ....................................................... 30

Figure 2.1 Proteomics analyses of direct expressed prostatic secretions. ..................................... 41

Figure 2.2 Functional enrichment of proteins detected in EPS. ................................................... 42

Figure 2.3 Integrative data mining and comparison to other databases. ....................................... 44

Figure 2.4 Comparison to other published proteomes. ................................................................. 46

Figure 2.5 Integration of multiple data resources with the EPS proteome. .................................. 48

Figure 2.6 Immunohistochemistry images as obtained from the Human Protein Atlas. .............. 50

Figure 2.7 Study workflow. Proteomic analysis of EPS-urine and urine samples. ...................... 53

Figure 2.8 Characterization of the EPS-urine proteome.. ............................................................. 54

Figure 2.9 Functional enrichment of proteins detected in EPS-urine and comparison of the EPS-

urine proteome to other related body fluids. ................................................................................. 56

xvii

Figure 2.10 Comparison between the two proteomics data of the current study. ......................... 59

Figure 2.11 Data mining to identify prostate-enriched proteins. .................................................. 62

Figure 2.12 Additional annotations for prostate-enriched proteins. ............................................. 63

Figure 2.13 Validation of mass spectrometry data for seven selected proteins. ........................... 65

Figure 3.1 Workflow of study design from the discovery of differentially expressed proteins to

verification of candidates in prostatic fluids. ................................................................................ 79

Figure 3.2 Average number of unique proteins identified in the duplicate MudPIT runs for

discovery direct-EPS samples. ...................................................................................................... 81

Figure 3.3 Total number of peptides (A) and total number of spectra (B) recorded for each

individual direct-EPS analyzed in the discovery phase. ............................................................... 81

Figure 3.4 Venn diagram depicting the total number of proteins exclusively expressed in the EC

or OC groups. ................................................................................................................................ 82

Figure 3.5 The current proteomics dataset compared to prostate-related proteomics data from

various reports. .............................................................................................................................. 82

Figure 3.6 Proportion of proteins with predicted SPs and TMDs. ................................................ 83

Figure 3.7 Gene Ontology enrichment of the EC and OC direct-EPS. ......................................... 83

Figure 3.8 Distribution of all 624 proteins identified in the current direct-EPS data set. ............ 84

Figure 3.9 Two significantly enriched pathways within the 133 differentially expressed proteins

are involved in hemostatic and immunological processes. ........................................................... 86

Figure 3.10 Additional modules enriched by pathway analysis. .................................................. 87

Figure 3.11 Protein prioritization scheme for the selection of candidates for verification. .......... 88

Figure 3.12 Measurement of PSA and PAP in direct-EPS. .......................................................... 90

Figure 3.13 Differential expression of verified candidates. .......................................................... 92

xviii

Figure 3.14 Differential expression of candidates measured by immunoassays. ......................... 93

Figure 3.15 Standard curves generated for each peptide used for SRM-MS measurements. ....... 97

Figure 3.16 Verification of differential expression of candidates in EPS-urines pooled from

patients with EC or OC prostate tumors. ...................................................................................... 98

Figure 3.17 PSA measurements in individual EPS-urines from patients with EC and OC prostate

tumors by ELISA .......................................................................................................................... 99

Figure 4.1 Study workflow outlining the three major phases of the current project. ................. 113

Figure 4.2 Peptides in EPS-urine.. .............................................................................................. 115

Figure 4.3 Phase 3 peptides. ....................................................................................................... 117

Figure 4.4 Cohort B SRM-MS data quality. ............................................................................... 120

Figure 4.5 Absolute quantification of Phase 3 peptides in EPS-urine. ....................................... 122

Figure 4.6 Comparative quantification of candidates in different subgroups of EPS. ............... 125

Figure 5.1 Overview of all proteomics datasets generated for various prostate-proximal fluids..

..................................................................................................................................................... 138

xix

List of Appendices

Appendix Table 1.1 Definitions of the American Joint Committee on Cancer for prostate

cancer tumor staging

Appendix Table 2.1 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS

samples


samples that could be mapped to the Human Gene Atlas mRNA

microarray


samples that could be mapped to an antibody in the Human Protein Atlas


samples and 22Rv1 prostate cancer culture medium


samples and PC3 prostate cancer cell culture medium


samples and LNCaP prostate cancer cell culture medium


samples and seminal plasma


samples and normal urine


samples and ≥ 10-fold above median in prostate tissue on the Human Gene

Atlas microarray

Appendix Table 2.10 Proteins identified in either the individual EPS-urines or the pooled

samples pre- and post-DRE

xx

Appendix Table 2.11 All 49 prostate-enriched proteins identified in the current study including

additional annotations

Appendix Table 3.1 Protein report

Appendix Table 3.2 Peptide report

Appendix Table 3.3 Pathway enrichment analysis for major network modules identified by

clustering algorithm

Appendix Table 3.4 Peptide sequences used for SRM-MS

Appendix Table 3.5 SRM-MS relative quantification of peptides

Appendix Figure 3.1 Transitions and integrated quantifier ions

Appendix Table 4.1 Information for the Phase 1 (232) peptides selected for SRM-MS

evaluation

Appendix Table 4.2 Fold changes of diagnostic and prognostic peptides in Cohort A measured

by SRM-MS

Appendix Table 4.3 Figures of merit for 34 peptides selected for verification

Appendix Table 4.4 Cohort A patient information

Appendix Table 4.5 Cohort B patient information

Appendix Table 4.6 Functions of Phase 3 protein candidates

Appendix Figure 4.1 Response curves for each of the 34 peptides analyzed in the verification

phase

Appendix Figure 4.2 Correlation between the 34 Phase 3 peptides from EPS-urine of Cohorts A

and B

xxi

List of Abbreviations

AMACR alpha-methylacyl-coenzyme A racemase

AR androgen receptor

AUC area under the curve

BCR biochemical recurrence

BPH benign prostatic hyperplasia

CA125 cancer antigen 125

CID collision induced dissociation

CNA copy number alteration

CV coefficient of variation

DIA data independent acquisition

DRE digital rectal examination

DTT dithiotreitol

EC extracapsular

ELISA enzyme linked immunosorbent assay

EPS expressed prostatic secretions

ESI electrospray ionization

FDR false discovery rate

FFPE formalin fixed paraffin embedded

FI functional interaction

FT-ICR Fourier transform ion cyclotron resonance

GO Gene Ontology

HGPIN high grade intraepithelial neoplasia

HPA Human Protein Atlas

ICAT isotope coded affinity tags

IL-6sR interleukin 6 receptor

iTRAQ isobaric tags for relative and absolute quantification

KLK3 kallikrein 3

LC liquid chromatography

LOD limit of detection

LOQ limit of quantification

m/z mass-to-charge ratio

MALDI matrix assisted laser desorption ionization

miRNA microRNA

MME membrane metalloendopeptidase

mRNA messenger RNA

MS mass spectrometry

MS/MS tandem mass spectra

MSMB microseminoprotein-β

MudPIT multidimensional protein identification technology

NC noncancer

xxii

NPV negative predictive value

NR nonrecurrent

NSAF normalized spectral abunance factor

OC organ-confined

PAP prostatic acid phosphatase

PARK7 Parkinson protein 7

PCA3 prostate cancer antigen 3

PPV positive predictive value

PRM parallel reaction monitoring

PSA prostate specific antigen

PSMA prostate specific membrane antigen

QconCAT quantification concatemer

QqQ tripe quandrupole

ROC receiver operating characteristic

RP radical prostatectomy

RP reverse phase

SCX strong cationic exchange

SD standard deviation

SD signal peptide

SFN stratifin

SID stable isotope dilution

SILAC stable isotope labeling with amino acids in cell culture

SRM selected reaction monitoring

TGF-β1 transforming growth factor beta 1

TGM4 transglutaminase 4

TIMP1 tissue inhibitor of metalloproteinase 1

TMD transmembrane domain

TMPRSS2:ERG transmembrane protease serine 2 gene fusion with E twenty-six transcription factors

TMT tandem mass tags

TOF time-of-flight

uPA urokinase plasminogen activator

1

Chapter 1

Introduction and thesis overview

Portions of this chapter have been reformatted from the original publication: Kim, Y., Kislinger

T. (2013). Novel approaches for the identification of biomarkers of aggressive prostate cancer.

Genome Medicine 5:56

1 Introduction and thesis overview

1.1 The prostate gland

1.1.1 Anatomy and histology of the prostate gland

The prostate is a secretory gland that sits at the base of the bladder in the male urogenital tract

(Figure 1.1). It surrounds the urethra and functions to produce fluid that consists of factors that

nourish sperm and aids in fertilization.

Three major regions of the prostate that differ histologically and biologically exist: the

peripheral, central, and transition zones (1). The transition zone is commonly the site of benign

prostatic hyperplasia, whereas carcinoma occurs primarily in the peripheral zone of the gland (1,

2). Surrounding the prostate is an integral fibromuscular band known as the prostatic capsule.

Within the epithelium of the prostate, three distinct cell types exist (Figure 1.2): secretory

luminal cells, basal cells, and neuroendorcrine cells (2). Secretory luminal cells predominate the

epithelium and function to produce prostatic proteins, including prostate specific antigen (PSA),

that are secreted into the lumen of the prostatic duct. They are characterized by the expression of

cytokeratin 8 and 18, CD57, and androgen receptor (2-7). The basal cells form a continuous

layer, separating luminal cells from the basement membrane and stroma (1). They do not

produce secretory proteins and are characterized by the expression of cytokeratins 5, 14, and

CD44 (3-8). Lastly, a small population of neuroendocrine cells are dispersed throughout the

basal cell layer (2). These cells are believed to support the growth of the secretory luminal cells

(9, 10), and can, in rare cases (1% of all prostate cancers), become neoplastic, producing highly

aggressive small cell carcinoma of the prostate (2).

2

Figure 1.1 The prostate gland. The secretory gland is located at the base of the bladder in the

male urogenital tract. The prostatic urethra runs through the organ, carrying urine through it.

Three major zones of the prostate exist: the central, transition, and peripheral zones.

3

Figure 1.2 Cell types of the prostate gland. Secretory luminal cells predominate the prostate

epithelium, releasing proteins into the lumen of the prostatic acini. Basal cells separate the

luminal cells from the basement membrane and stroma. A small population of neuroendocrine

cells are dispersed throughout the basal cell layer.

1.2 Prostate cancer

1.2.1 Prostate cancer epidemiology and statistics

Prostate cancer is the most common cancer among Canadian men (excluding non-melanoma skin

cancer), occurring in 1 of every 8 men and killing every 1 in 28. In 2015 alone, 24,000 new cases

and 4100 deaths are estimated in Canadian men (11). Most patients are diagnosed after age 65

and most men over 85 years of age have histological prostate cancer (12). The world-wide

incidence rate of this disease can vary considerably, with the highest rates occurring among

black American men and the lowest in India (13). These differences can be attributed to genetic

4

susceptibility, exposure to external risk factors, or the accuracy of cancer registration across

countries (14). Despite these daunting statistics, only 15-20% of men will present with

aggressive disease (15, 16), while the majority of patients remain asymptomatic for many years.

The incidence of prostate cancer is climbing in both high and low-risk populations around the

world for different reasons (17). For instance, in the US, the wide-spread use of PSA screening

has led to the identification of asymptomatic men who harbor the disease (18), while in Sweden,

where screening is much less common, the frequent use of transurethral resection of the prostate

and random biopsies has led to a staggering 100% increase in prostate cancer detection over the

last three decades (18). These practices cannot explain, however, the rising incidence of prostate

cancer in developing countries, where unidentified external risk factors likely play a role (19).

Epidemiological studies have demonstrated a stronger familial clustering of prostate cancer

amongst patients when compared to age and race-matched controls (20). For instance, brothers

of young prostate cancer patients (age at diagnosis <63 years) have a four-fold increased risk of

the disease compared to their brothers-in-law and the general population (21). Brothers and

fathers of prostate cancer patients have a 76% higher risk of prostate cancer than the male first-

degree relatives of control subjects (22). Interestingly, the risk of prostate cancer increases for

first-degree relatives of prostate cancer patients with decreasing age at diagnosis in the patients;

thus, suggesting that screening in men with a family history of prostate cancer should begin at an

earlier age (23, 24). Polymorphisms in genes regulating androgen metabolism and metabolites of

oxidative stress related to apoptosis (25-32) have been linked to prostate cancer. Androgens play

a vital role in the development of the prostate, and medical castration (withdrawal of

testosterone) is a method of treatment for patients with metastatic prostate cancer. Animal studies

have demonstrated that testosterone and dihydrotestosterone can induce the formation of prostate

tumors; however, a definitive study in humans correlating androgen concentrations and prostate

cancer has yet to surface (14). Dietary factors such as high intakes of alpha-linolenic acid and

calcium, as well as red meat cooked at high temperatures, may be linked to increased risk of

disease (14). Other factors such as smoking and alcohol consumption have also been

investigated with varying results.

5

1.2.2 Histopathology and clinical progression of prostate cancer

Prostate tumors predominantly arise from the malignant transformation of luminal epithelial cells

(adenocarcinomas), accounting for >95% of all cancers of the prostate. Other, much less

prevalent categories of prostate cancer are ductal adenocarcinoma, mucinous carcinoma, signet

ring carcinoma, and neuroendocrine. Primary prostate cancer consists of multiple independent

and genetically distinct foci (33-37), while different metastases within an individual may be

clonally related (38, 39). This suggests that metastatic cancer may arise from the selective

advantage of individual clones during cancer progression (40). Age is the greatest risk factor for

prostate cancer; even still, prostates from healthy men between the ages of 20 and 40 frequently

bear histologic foci of prostate cancer (41-43). This suggests that, although multiple neoplastic

transformation events may occur in the prostate, most of them only give rise to latent disease and

do not progress to clinical presentation. Furthermore, latent prostate cancer may arise from a

different pathogenic program from clinical prostate cancer, whereby activating events are

mitigated or there is sufficient tumor suppression to maintain the foci in a subclinical state (40).

High grade prostatic intraepithelial neoplasia (HGPIN), which occurs in 0.7-7.1% of needle

biopsies (44), is thought to be an immediate precursor of early invasive prostate cancer (2, 45,

46), although not a prerequisite for carcinomas (Figure 1.3). This is supported by their existence

in proximity to carcinomas within the peripheral zone and their resemblance to early invasive

carcinoma including multifocality, similarity in chromosomal abnormalities and reduction in the

expression of the differentiation markers E-cadherin and vimentin (2, 47-53). In contrast,

neoplasms can be distinguished from early invasive cancer by the disruption of the basal layer,

which is completely lost in cancer (54). Prostatic intraepithelial neoplasia is histologically

characterized by the presence of luminal epithelial hyperplasia, reduction in basal cells,

enlargement of nuclei and nucleoli, cytoplasmic hyperchromasia, nuclear atypia, and a marked

elevation of cellular proliferation markers (45).

6

Figure 1.3 Progression of human prostate cancer. Adapted Shen and Abate-Shen, Genes

Dev. 2010 © Cold Spring Harbor Laboratory Press. Prostate cancer arises as a result of

genetic events that culminate in the transformation of the secretory epithelial cells

(adenocarcinoma). Prostatic intraepithelial neoplasia is thought to be a non-obligatory precursor

for prostate cancer.

Secondary metastasis of prostate cancer most commonly occurs in bone, where it forms

characteristic osteoblastic lesions (55, 56). Although the mechanisms by which prostate cancer

cells form metastases in bone are poorly understood, tumor cells from patients with localized and

advanced disease can be detected in circulation as well as in bone marrow (57-59).

1.2.3 Current concepts of insignificant versus significant prostate cancer

Insignificant and indolent are terms used interchangeably to prostate describe cancer that is low-

grade, small volume, organ-confined (OC), and thus not likely to progress to clinical significance

in the absence of treatment (60). The Epstein preoperative criteria was developed in an attempt to

clinically and pathologically define subgroups of patients with insignificant disease who may be

spared treatment (61). Insignificant disease is defined as: PSA <10 ng/ml, organ-confinement

without extraprostatic extension, no Gleason pattern 4/5, and <33% positive biopsies (60-64).

Extraprostatic or extracapsular (EC) extension refers to the presence of tumor beyond the

confines of the prostate, and is defined as carcinoma mixed with periprostatic adipose tissue, or

bulging out beyond the contours of the gland. The presence of extraprostatic extension is

associated with an increased risk of biochemical recurrence (BCR) (65). The reported incidence

of clinically insignificant tumors is approximately 40% (62); however, about 30% of patients are

misclassified based on the criteria and present significant disease upon inspection of their radical

7

prostatectomy (RP) specimens (60). Furthermore, even insignificant tumor patients have a 10%

5-year risk of BCR after treatment (66). Therefore, there is a clear and urgent need for

identifying truly biologically significant prostate cancer.

Between 27% and 53% of patients who receive definitive treatment for prostate cancer develop

BCR within 10 years of treatment, as indicated by two consecutive PSA values of >0.2 ng/ml

(67, 68). However, only about 30% of patients with PSA relapse develop clinical recurrence and

6% of them die from it (69, 70). Interestingly, the rate of recurrence appears to be correlated with

the risk of disease. Kapadia et al., demonstrated that in patients treated with radiotherapy, BCR

occurred in 8%, 15%, and 36% of low-risk, intermediate-risk, and high-risk patients, respectively

(71). Time to post-RP BCR, PSA velocity and doubling time, pathological stage and Gleason

score are important clinical parameters for stratifying patients into risk groups for prostate

cancer-specific mortality (72). Approximately half of the patients develop local disease, while

the other half develops distant metastatic disease with or without local disease. Patients may

undergo diagnostic imaging and biopsies to evaluate clinical recurrence and may take a course of

salvage therapy (radiotherapy, hormone therapy, salvage RP), with varying outcomes depending

on factors such as age, pre-treatment PSA, and the presence of distant metastatic disease.

1.2.4 Genetic alterations and molecular subtypes of prostate cancer

Genetic mutations found in prostate cancers occur in abrupt periodic bursts, resulting in large

scale reshuffling of DNA leading to cancer (73). This phenomenon, known as “chromoplexy”, is

a process by which distant genomic regions are disrupted in a coordinated fashion that eliminates

cancer-suppressor genes, leaving genes that may drive cancer progression. Common mutations in

prostate cancers are somatic copy number alterations (CNA), point mutations, structural

rearrangements, and aneuploidy (74). Copy number alterations are present in 90% of prostate

cancers, and are more common in prostate cancer than other types of cancers (75).

Highly altered primary tumors are associated with increase risk of relapse (76). Deletions can be

seen on chromosomes 6q, 8p, 10q, and 13q and include tumor suppressing genes such as NKX3-

1, pTEN, BRCA2, and RB1. Meanwhile, frequent amplification of chromosomes X, 7, 8q, and

9q can be observed, which include the androgen receptor (AR) and MYC oncogenes (74).

Structural rearrangements are more commonly observed in high-risk prostate tumors, of which

8

the TMPRSS2-ERG and ETS fusions are present in 50% of cases (77). Interestingly, recurrent

mutations in SPOP (involved in JNK and Hedgehog signaling pathways) may define a new

molecular subtype of prostate cancer since this mutation is only found in ETS fusion-negative

tumors and exhibit a pattern of distinct alterations in 5q21 and 6q21 (78).

Like breast, renal, and ovarian cancers, primary prostate cancer has a somatic mutation rate of 1

x 10-6 to 2 x 10-6 (79-81), leading to thousands of point mutations that can exist in the prostate

tumor genome; however, only about 20 per genome may have influence on protein stability and

function (74). In prostate cancer, point mutations have been observed in TP53, PTEN, RB1, and

PIK3CA tumor suppressors (78, 81-83), and activating mutations have been reported in the

oncogenes KRAS and BRAF.

1.2.5 Current methods of prostate cancer and diagnosis

In the absence of symptoms, early screening of prostate cancer begins with a digital rectal

examination (DRE) and a PSA blood test. The DRE is a physical examination in which the

physician examines the prostate for nodules through the rectum. Prostate specific antigen, also

known as kallikrein 3 (KLK3), is a glycoprotein and serine protease that is produced by the

secretory epithelial cells of the prostate. Elevated levels of PSA in the bloodstream (the most

widely accepted cutoff for PSA is 4.0 ng/ml) is associated with prostate cancer. However, PSA

increases with benign prostatic conditions such as benign prostatic hyperplasia (BPH) and

prostatitis, as well as transient causes such as acute urinary retention and ejaculation within 48

hours of testing (84).

The estimated sensitivity and specificity of the PSA test at a cutoff of 4.0 ng/ml was ~21% and

91%, respectively, for detecting prostate cancer (85). The positive predictive value (PPV; i.e., the

proportion of men with PSA >4.0 ng/ml who have prostate cancer) is ~30% (86-88), thus about 1

man in 3 with an elevated PSA will have prostate cancer upon biopsy; with PSA >10 ng/ml, the

PPV increases to ~40-60% (87, 89). The negative predictive value (NPV; i.e., the proportion of

men with a PSA ≤4ng/ml who do not have prostate cancer) is 85% (90). Prostate specific antigen

screening has been shown to reduce prostate cancer-specific mortality by 20% but the number of

men needed to be treated in order to prevent one death is estimated to be 18-29 (91).

9

The confirmation of a prostate cancer diagnosis can only be established by examining biopsy

samples. The biopsy is performed with transrectal ultrasound (TRUS) imaging to help guide the

biopsy needles in the prostate. Typically, 6 or more cores are taken to sample different areas of

the organ. Prostate cancer from biopsy specimens is marked by the absence of p63 and

cytokeratin 5 and 14, markers for basal cells (92, 93), while α-methylacyl-CoA racemase is

upregulated in cancerous lesions (93-95). If cancer is identified, the pathologist will determine

the grade and stage of the cancer.

Disease prognosis is defined as the prediction of malignancy in the absence of or post therapy

(96). Prognostic factors are those that, independent of treatment, predict relapse or progression

(96). Currently, pathologic Gleason grading is considered to be the best predictor of outcome

(97). Devised by Dr. Donald Gleason, this procedure involves the histologic evaluation of the

extent of glandular differentiation and pattern of arrangement of carcinoma cells in hematoxylin

and eosin (H&E) stained tissue sections (98). Five grades, from 1 (well differentiated) to 5 (most

poorly differentiated), are assigned to the most predominant growth pattern (primary grade) and

the second most common pattern (secondary grade), which are summed to generate a Gleason

score ranging from 2 to 10.

Patients with Gleason scores 7 or higher are at increased risk of extraprostatic extension and

recurrence after therapy (99, 100). Furthermore, individuals with Gleason 4+3 tumors may be at

greater risk of prostate cancer-specific mortality than Gleason 3+4 patients (101). Poorer

prognosis is associated with Gleason 4 or 5 cancers, while patients with tumors composed of

Gleason 3 pattern are unlikely to progress and die of the disease (101, 102). The defining

difference between Gleason 3 and 4 is a focal loss of integrity of discrete glandular units of the

tumor. Gleason 3 cancers consist of acini that have a smooth and rounded edge with an intact

basement membrane, while Gleason 4 tumors are composed of poorly differentiated, fused

glands, frequently with irregular borders that invade the stroma (103). Gleason 5 pattern is

represented by sheets of cancerous cells due to the complete loss of any rounded glandular shape

(54).

Gleason grading has also been linked to various clinical end points such as clinical stage,

response to therapy, biochemical failure, progression to metastatic disease, and cancer-specific

and overall survival (104-106). Needle biopsy Gleason grade is typically combined with serum

10

PSA and clinical stage in Partin tables, which predict risk for extraprostatic extension, seminal

vesicle invasion, lymph node metastasis, and aid in treatment planning (107). Needle biopsy

Gleason grading provides some idea of the pathologic stage, but is not absolutely accurate;

patients with low-grade prostate cancer may still progress and develop tumors that spread outside

of the prostate, while some high grade tumors may never extend outside of the gland. Although

Gleason grading allows for the assignment of two separate patterns in an individual sample, thus

controlling for some degree of tumor heterogeneity within the gland, an average of 2.7 Gleason

grade patterns are observed in whole prostates (33), with some even having four different

patterns (108). Even tertiary patterns of high grade Gleason pattern 4 or 5 that make up <5% of

the tumor have been found to impact stage and progression (109), so high grade tertiary patterns

are now recommended to be included in reports (107). In addition, it has been recommended that

the most predominant pattern and the highest grade should be summed as the Gleason score, in

cases where the highest grade is not the most predominant nor the second most common pattern

observed (110).

Staging is performed in order to assess the extent of spread of the cancer within the prostate or to

other parts of the body and to facilitate treatment planning. Various tests are performed in order

to gather sufficient information for an accurate assessment of the disease, including imaging and

biopsies. The American Joint Committee on Cancer TNM cancer staging system (Appendix

Table 1.1) is most widely used and is based on five factors: 1) the extent of the primary tumor (T

category); 2) the involvement of lymph nodes (N category); 3) the presence of metastasis (M

category); 4) serum PSA at diagnosis; and 5) the Gleason score (obtained from biopsy or

prostatectomy). Nomograms are multivariate prognostic algorithms that combine different

prognostic factors and facilitate treatment decision-making. Preoperative nomograms include

information such as ethnicity, prostate weight, serum PSA, clinical stage, and Gleason score.

Postoperative nomograms incorporate additional information such as surgical margins, seminal

vesicle invasion, and regional lymph node involvement.

11

1.3 Prostate cancer biomarkers

1.3.1 Introduction to cancer biomarkers

Accurate and timely assessment of prostate cancer prognosis remains one of the most significant

clinical challenges in prostate cancer management. Rapid advances in molecular technologies are

likely to lead to significant progress in the foreseeable future. Despite these technological strides,

prostate cancer is still over-diagnosed and many patients are unnecessarily treated. Possible

reasons are the multifocal and heterogeneous nature of this disease leading to frequent

misclassification of patients, intra-institutional variability, and patient variability; all of which

contribute to the lack of well-defined and validated prognostic biomarkers.

A biomarker is a measureable biological indicator that can provide information about the

presence or progression of a disease, or the effects of a given treatment. A clinically useful

biomarker should have high sensitivity and specificity, as well as high PPV and NPV, and

facilitate clinical decisions to administer optimal care (111) (Table 1.1).

According to the Early Detection Research Network (112), a biomarker should undergo five

major phases of development before it can be confidently utilized under clinical settings and

benefit the population. These phases are: i) preclinical exploratory studies, where

tumor/aggressive disease-associated samples are compared to non-tumor/indolent disease

specimens in order to identify molecular characteristics that distinguish both cohorts and can be

further explored; ii) clinical assay development and validation, whereby an assay that can

accurately measure the biomarker and reliably segregate tumor from non-tumor specimens is

developed; iii) retrospective longitudinal studies that utilize specimens from individuals who

were monitored over time for the development or progression of disease (e.g., patients who

progress from indolent to aggressive prostate cancer) are compared to individuals who do not

develop disease or do not progress; iv) prospective screening studies that are performed using the

assay in order to evaluate the extent of disease at the time of detection; and v) randomized

control studies that are performed in order to determine the reduction of disease burden in the

population, as a result of performing the assay.

12

1.3.2 Clinical and analytical considerations for cancer biomarker assay

development

Once a putative cancer biomarker has been determined, a clinically useful assay must be

developed. Clinical validation involves determining the sensitivity and specificity, which

measure the diagnostic accuracy of a test (113) (Table 1.1). Clinical sensitivity refers to the

ability of a test to correctly classify individuals affected by the disease. Specificity is the ability

of the test to correctly identify individuals without the disease. These parameters depend on the

decision threshold used for a test; that is, a predetermined threshold that, when exceeded,

indicates disease or state (113). Receiver operating characteristic (ROC) analysis can be used to

evaluate the performance of a clinical assay and is used for determining the threshold (Figure

1.4). Various methods of calculating the optimal decision threshold exist. The most common

method is to find the point on the curve closest x = 0, y = 1 coordinates (114, 115). The area

under the curve (AUC) of the ROC curve is a measure of the sensitivity and specificity of a test

over all threshold values. An ideal assay would have an AUC of 1.0 (100% sensitivity and

specificity), while an assay with no discriminatory power would have an AUC of 0.5.

13

Figure 1.4 Receiver operating characteristic curve. A biomarker with good discriminatory

power has an AUC closer to 1.0 (e.g., marker A). A biomarker with no discriminatory power has

an AUC of 0.5 (e.g., marker B).

The sensitivity and specificity of a biomarker tell us about the accuracy of the test, but do not

provide information about the probability of disease. The clinical prevalence of the disease in

question also plays an important role in the performance of an assay. If the prevalence of the

disease is high in the population, it is more likely that an individual who tests positive for a

disease will truly have the disease than if the prevalence of the disease is low. The parameters

that need to be evaluated are the PPV and the NPV. The PPV indicates the likelihood that, given

a positive test result, the patient actually has the disease; the NPV indicates the likelihood that,

given a negative test result, the patient is free of the disease in question. Although an assay may

have high accuracy, low prevalence of the disease may limit its adoption into clinical use (113).

For instance, the ovarian cancer biomarker, cancer antigen 125 (CA-125), has a PPV of 4%;

hence, only 4 out of 100 positive test will indicate actual disease (113, 116).

14

Table 1.1 Descriptors of disease biomarkers.

Assays targeting a single or a multitude of biomarker candidates must also meet rigorous

analytical standards for reliable quantification and reproducibility. Parameters that should be

accounted for include precision, limit of detection (LOD), limit of quantification (LOQ),

linearity and working range, and specificity (113). These so called Figures of Merit (117) may be

affected by differences in sample matrix and external factors that affect the assay’s sensitivity

(117). Therefore, a calibration curve, represented by the response versus the concentration of a

given analyte (117), should be generated for each analyte for which an assay is being developed.

Calibration is defined as the procedure for determining the relationship between the assay’s

generated signal for a given analyte and the analyte’s concentration (113). Preferably in the

sample matrix that the assay will be used in, a series of concentrations of the analyte (spanning at

least two orders of concentration magnitude) is introduced into the matrix. The data is plotted as

the measured concentration versus the spiked-in concentration. The slope of the curve represents

the analytical sensitivity of the assay for the analyte. A slope of 1 and an intercept of 0 is ideal,

signifying a perfect correlation between the theoretical and measured concentrations of the

analyte.

The calibration curve is also used to define the linear range of detection, the LOD and LOQ, and

the reproducibility of the assay. The lowest concentration at which an analyte can be reliably

distinguished from noise is represented by the LOD (113). The lower LOQ is the lowest analyte

concentration at which a quantitative measurement can be reliably made, while the upper LOQ is

described as the highest concentration of the analyte that falls within the linear range of the assay

(117). The calibration curve can also provide information about possible interference or

endogenous signals from the sample matrix, determined by measurements of the blank sample.

15

Precision, usually expressed as standard deviation (SD), variance, or the coefficient of variation

(CV), is determined by the closeness of agreement between a series of measurements for an

analyte in a sample under specific conditions. Within-run precision (repeatability) refers to

repeated measurements performed under the same conditions. Within-laboratory precision

(reproducibility) is determined by measurements made across laboratories, under different

conditions such as analysis by different technicians (113). Low analyte concentrations may

hamper precision as the detection limit of the assay is reached. Biological variation – i.e., the

fluctuation of an analyte concentration around a homeostatic set point – can greatly affect

precision and thus should be taken into consideration.

1.3.3 Biological sources of biomarkers

When faced with the task of identifying disease biomarkers, “gold standard” samples would be

those in which cases and controls differ absolutely and exclusively for the disease of concern

(118). This would require that cases and controls be identical in all respects except for the

condition for which the biomarker is being identified. Model systems, including cell lines and

genetically homogeneous animals that reflect human conditions may be useful in this respect

(118). These systems allow for ample longitudinal sample collection, have uniform genetic

background, and are typically subjected to highly controlled environments that greatly abrogate

the vast variation seen amongst human subjects. However, models can only provide insight into

potential biomarkers candidates, and should not be wholly relied upon for subsequent analyses.

Table 1.2 Common sources of prostate cancer biomarkers.

16

Human tissues represent an essential class of biomarker sources (Table 1.2). Indeed, tissue can

provide histopathological features, molecular factors, and genetic aberrations or patterns that

may be indicative of disease, patient outcome, and facilitate treatment design. Allowing for

direct evaluation of protein or gene expression, tissue samples can provide important insights

into the biological mechanisms of a given condition (119). However, tissues are obtained

through invasive means that are associated with risks for the patients and limit repeat

measurements. Blood (i.e., serum or plasma) is an attractive alternative to tissue specimens that

is already in routine clinical use. Factors such as circulating tumor cells and DNA, microRNA,

metabolites, and soluble proteins can be non-invasively and repeatedly collected with ease.

Unfortunately, the immense complexity, wide dynamic range, and dilution of specific disease

markers make it extremely difficult to discover protein biomarkers in this fluid. To facilitate this

goal, extensive fractionation strategies and affinity approaches that remove some of the most

abundant blood constituents can be utilized, but introduces additional sample preparation steps.

1.3.3.1 Tissue proximal fluids

Tissue proximal fluids are a class of biofluids that are in direct contact with the disease site. It is

rationalized that local bathing of the site results in enrichment of protein biomarkers compared to

distal circulating fluids (120, 121). In the context of prostate cancer, prostate proximal fluids

include urine, seminal fluid, semen, and expressed prostatic secretions (EPS). Expressed

prostatic secretions can be collected directly from the prostate just prior to RP. Although this

fluid is presumably abundant in prostatic proteins, it is not possible to collect more than once.

Urine has been shown to be a source of microRNAs (122), RNAs (123), soluble proteins (124),

and exosomes (125), and can be considered as a prostate proximal fluid since the prostatic

urethra carries it through the prostate. In the context of a DRE, urine containing EPS (EPS-urine)

can be routinely collected in voided urine. Indeed, EPS-urine has previously been interrogated

for prostate cancer-related proteins and prostasomes (126, 127). A major advantage of urine over

serum or plasma, with regards to protein biomarker detection, is that its contents remain

relatively stable and do not undergo massive proteolytic degradation (128). Nevertheless,

variations in collection volume may result in varying protein concentrations, underscoring the

need for standardized collection protocols.

17

1.3.3.2 Biomarkers in biological fluids

Fluctuations in the levels of molecules in biological fluids, such as blood and tissue proximal

fluids, can serve as cancer biomarkers. As cancer progresses, tumor angiogenesis and tissue

invasion causes the destruction of the tissue architecture. This results in the release of factors into

the interstitial fluid and delivery into the circulation by way of the lymphatics. For instance,

PSA, which is normally abundantly expressed by the secretory epithelial cells of the prostate

diffuses through the basement membrane, stromal layer, and the walls of blood and lymphatic

capillaries into the circulation, and can be seen in the serum of healthy men in the range of 0.5-2

ng/ml (129) (Figure 1.5). In prostatic carcinoma, the destruction of tissue architecture through

the loss of basal cells and degradation of the basement membrane, angiogenesis, and disruption

of cell attachment, causes PSA to leak out of the glandular lumen and into circulation, reaching

serum PSA levels beyond 4 ng/ml (129).

Figure 1.5 Prostate specific antigen access to circulation. Adapted from Kulasingam and

Diamandis Nature Clinical Practice. 2008 © Macmillan Publishers Limited. Normally, small

amounts of PSA enter the circulation by way of the lymphatics. Under diseased states, the

architectural integrity of the prostate is compromised. This leads to PSA leakage out of the gland

and subsequent elevation in circulating PSA levels.

18

Changes in protein abundances may also be explained by aberrant secretion or shedding of

secreted or membrane-bound proteins. Indeed, mutations in the signal peptide (SP) domains of

proteins causes impairment of the SP function and cellular localization of human proteins (130).

Furthermore, a loss of epithelial cell polarity in cancer may result in irregular trafficking of

plasma membrane proteins to apical and basolateral domains causing misdirectional release of

proteins into the extracellular domain (131). Quantitative changes in molecules may also be

explained by fluctuations in gene expression. Alterations in gene or chromosome copy numbers

and changes in transcriptional activity have been described for prostate cancer and are

highlighted in the following sections.

1.3.4 Adjunct prostate cancer biomarkers

In broad terms, current and proposed alternative or adjunct markers for prostate cancer can be

divided into clinical-pathological features and molecular factors. These include the classic

Gleason grading and more recently discovered molecular features that might offer insight into

disease progression and prognosis. Various methodologies can be utilized to identify these

factors (Figure 1.6), and some of them are highlighted in the following sections.

19

Figure 1.6 Various strategies for novel molecular biomarker identification. Adapted from

from Kulasingam and Diamandis, Nature Clinical Practice. 2008 © Macmillan Publishers

Limited. Advancing technologies have paved the way for disease biomarker discovery. The

integration of multiple discovery modalities holds promise for novel molecular biomarkers.

1.3.4.1 Cellular markers

Ki-67 is a nuclear protein that is associated with cellular proliferation (132). Its

immunohistochemical staining index has been correlated with outcome in treated patients (133-

136). Heterogeneous immunohistochemical staining for α-methylacyl-coenzyme A racemase

(AMACR) has been correlated with Gleason score (137), and low AMACR gene expression in

localized prostate cancer has been linked to recurrence and metastasis (138). Prostate specific

membrane antigen (PSMA) is a transmembrane protein expressed in all types of prostatic tissue

that is used in the diagnosis of prostate cancer (139). Its over-expression is associated with

higher tumor grade, stage, PSA recurrence, and metastatic disease (140, 141).

20

1.3.4.2 Genetic aberrations as prognostic biomarkers

Focusing on a specific pathway or a group of interrelated genes involved in fundamental tumor

biology has also proven useful. Cuzick et al. (142) focused on genes involved in cell cycle

progression and measured mRNA expression of 126 genes in formalin-fixed paraffin-embedded

prostate cancer tissues. A 31-gene signature was generated on the basis of their correlation with

the mean expression of the entire panel of 126 genes. When used to retrospectively score patients

who underwent prostatectomy and patients with localized disease, this signature was shown to

predict recurrence after surgery and risk of death in conservatively managed patients,

independently of Gleason score and other clinical factors. Using comparative transcriptomic

analyses, Ding et al. (143) identified the robust activation of the Tgfβ/Bmp-Smad4 signaling

pathway in indolent Pten-null mouse prostate tumors. Deletion of Smad4 in the Pten-null mouse

prostate led to highly proliferative, invasive, metastatic and lethal tumors. Combined with key

molecular players, cyclin D1 and osteopontin, their four-gene signature (PTEN, SMAD4, cyclin

D1 and osteopontin) could predict BCR and supplement the Gleason score in predicting lethal

metastasis of prostate cancer in patients.

Genomic instability, caused by variations such as CNAs, may also be useful prognosticators of

prostate cancer. Recently, Lalonde et al., identified a 100-loci DNA signature that could predict

5-year biochemical relapse in patients treated with radiotherapy or RP; its effect augmented by

tumor hypoxia (144). In a comprehensive genomic analysis of prostate cancer, Taylor and

colleagues analyzed CNAs in primary prostate tumors and found distinct patient clusters with

varying degrees of relapse, with no association with Gleason score (76). Penney and colleagues

(145) constructed a 157-gene signature based on the comparison of Gleason ≤6 and Gleason ≥8

patients. When applied to patients with Gleason 7 scores, their signature improved the prediction

of lethality, compared to Gleason score alone. DNA methylation patterns in prostate cancer may

also provide insight into prostate cancer outcome. Cottrell et al. (146) performed a genome-wide

scan in patients with early recurrence, high Gleason score or advanced stage and identified 25

methylation markers that were significantly different between low and high Gleason score

patients. Furthermore, the methylations of 3 markers (GPR7, ABHD9, Chr3-EST) were found to

be significantly increased in patients with recurrence, as measured by elevated post-

prostatectomy PSA levels.

21

1.3.4.3 MicroRNAs

MicroRNAs (miRNAs) are a class of small, non-coding RNA molecules, involved in the

negative regulation of gene expression. Porkka and colleagues (147) demonstrated distinct

miRNA expression profiles of BPH, untreated prostate cancers, and hormone-refractory prostate

cancers, suggesting a potential prognostic role for miRNAs. Mitchell et al. (148) demonstrated

that tumor-derived miRNAs are present in plasma and could show that miR-141 was

significantly elevated in the sera of prostate cancer patients, demonstrating the utility of miRNAs

as blood-based cancer biomarkers. Khan et al. (149) analyzed localized prostate tumors and

adjacent normal tissues, as well as samples from advanced cases using isobaric tags for relative

and absolute quantification (iTRAQ) followed by mass spectrometry (MS). Integrating their

findings with a cancer microarray database, the authors identified differentially expressed

proteins that are targets of miR-128, which was further supported by in vitro experiments

demonstrating a role for miR-128 in prostate cancer invasion.

1.3.4.4 Circulating biomarkers

Urokinase plasminogen activator (uPA) and its inhibitor, PAI-1, have been associated with

aggressive prostate cancer exhibiting extraprostatic extension and seminal vesicle invasion, and

post-prostatectomy recurrence in patients with aggressive disease (150). Preoperative plasma

levels of transforming growth factor beta 1 (TGF-β1) has been shown to be a predictor of BCR

(151) and, in conjunction with preoperative plasma levels of interleukin 6 receptor (IL-6sR), has

been associated with metastasis and progression (152).

Disseminated tumor cells in the bone marrow, a common site of prostate cancer metastasis, have

been shown to have an association with metastatic disease and high Gleason score (153, 154).

Although disseminated tumor cells may be a prognostic marker of unfavorable outcome in

patients with localized disease at diagnosis, attention has shifted to tumor cells that have entered

the peripheral blood as these are more easily accessible. The number of circulating tumor cells

can be determined at the time of diagnosis and elevated numbers, as indicated by reverse

transcriptase polymerase chain reaction for PSA, have been associated with advanced stage and

increased Gleason score (155). Goodman et al. (156) determined that prior to treatment, a cut-off

22

value of 4 circulating tumor cells per 7.5 ml of blood or more was negatively correlated with

survival and could predict metastasis.

1.3.4.5 Urinary biomarkers

Prostate cancer antigen 3 (PCA3) is a prostate specific non-coding RNA that was first identified

in a comparative transcriptomics study in tumor and adjacent normal tissues (157). Subsequently,

a RT-PCR based test was developed for its detection in urinary EPS (158). A ratio of the

PCA3:PSA RNA, known as the PCA3 score, is used in combination with other clinical

information to guide decisions for repeat biopsy in men who are 50 years of age or older and

who previously have had at least one negative prostate biopsies. Interestingly, Nakanishi et al.

(159) reported the mean PCA3 score to be significantly lower in patients with low volume and

low-grade prostate tumors, compared to patients with advanced tumors. However, the ability of

the PCA3 test to predict aggressive prostate cancers is under debate (159-161). Tomlins et al.

(77) first reported the occurrence of a recurrent TMPRSS2:ERG (transmembrane protease serine

2 gene fusion with E twenty-six (ETS) transcription factors) fusion transcript. These fusions

were detectable in 42% of urinary EPS samples from men with prostate cancer (162), although

their presence in urinary sediment was not correlated with biopsy Gleason scores (163).

Telomerase is a ribonucleoprotein involved in telomere synthesis and repair (164). Its activity,

which can be measured in urinary EPS using the telomeric repeat amplification protocol assay

(165, 166), was found to be increased in prostate cancer and has been shown to be associated

with Gleason score (165). Urinary annexin A3 and various matrix metalloproteinases have also

been shown to have diagnostic/prognostic potential for prostate cancer (167-170).

Approximately 3% of the total urinary protein content is comprised of exosomal proteins (171),

thus representing a sub-fraction for discovery of prostate cancer biomarkers (172, 173).

Exosomes are small vesicles (40-100 nm) that are secreted by various normal and tumor cells

(173), containing protein, RNA and lipids (174). Wang et al. (175) used shotgun proteomics to

generate the largest catalogue of urinary exosome proteins to date. In their study, over 3000

unique proteins were identified from samples derived from nine healthy individuals. Exosome

secretion is elevated in biofluids of cancer patients, including prostate cancer (176) and

exosomes have been shown to be enriched in tumor cell-specific transcripts (177, 178). MiRNA

and mRNA are transferable between cells via exosomes and have been shown to be functional in

23

their new location (179). Nilsson et al. (173), in a proof-of-concept study, showed that urinary

exosomes derived from prostate cancer patients contain two known biomarkers (PCA3 and

TMPRSS2:ERG) and thus may be used as disease biomarkers.

1.4 Clinical proteomics for biomarker discovery

Proteins are the effector molecules of the cell and are likely to be the most affected in disease

(118). Proteomes are dynamic and dictated by cellular mechanisms that regulate gene expression,

post-transcriptional events, and post-translational modifications. Furthermore, significant

changes occur in proteomes in response to physiological and environmental stimuli. Hence,

cataloguing the proteome – the entirety of proteins expressed by a particular cell, tissue, or

organism under specific conditions (180) – may provide important indicators of disease.

Clinical proteomics is defined as the analysis of the identity, quality and the quantity of proteins

in patient samples (181). Mass spectrometry-based proteomics is the most widely applied tool in

clinical proteomics and holds promise to identify new disease biomarkers, drug targets, and

aberrant molecular pathways in disease. A typical cancer biomarker discovery workflow consists

of a discovery phase, whereby a comprehensive comparative catalogue of proteins under

different disease states is generated. This is followed by verification of putative biomarkers using

targeted methods of quantification, and finally, validation and clinical assay development (118).

1.4.1 Discovery proteomics using shotgun, bottom-up proteomics

Proteomic analysis of intact proteins (known as top-down proteomics) largely remains limited by

sensitivity and throughput (182). Therefore, bottom-up proteomics – based on the analysis of

proteolysed peptides emerging from proteins (183) – has been widely adopted in proteomic

investigations. Shotgun proteomics refers to the application of bottom-up proteomics to protein

mixtures (184-186).

A mass spectrometer consists of an ion source that ionizes analytes, a mass analyzer that

measures the mass-to-charge ratio (m/z) of the ionized analytes, and a detector that measures the

number of ions of each m/z value (187). Most commonly, proteins and peptides are volatized and

ionized using the soft ionization approaches: electrospray ionization (ESI) (188) or Matrix-

Assisted Laser Desorption Ionization (MALDI) (189). Because ESI ionizes analytes out of

24

solution, it is amenable to liquid-based separation such as liquid chromatography (LC), and is the

preferred method for the analysis of complex samples (187).

As ionized peptides enter the mass spectrometer, they are guided and manipulated by electric or

magnetic fields in a vacuum system. The basic types of mass analyzers used for proteomics are:

time-of-flight (TOF), quadrupole, ion trap, Fourier transform ion cyclotron resonance (FT-ICR),

and more recently, Orbitrap. These can also exist as hybrids, such as triple quadrupoles, TOF-

TOF, and quadrupole TOF. Time-of-flight instruments determine the m/z of an ion based on the

amount of time it takes to travel through a flight tube and hit a detector (190-194); lighter ions

fly faster than heavier ions, and thus reach the detector sooner. Quadrupoles are essentially mass

filters, allowing the selective passage of ions of a defined m/z by applying a particular potential

that stabilizes their trajectory down the quadrupole, while filtering out ions of other m/z values

(195, 196). A mass spectrum is generated by the step-wise application of different potentials and

detecting the ions that pass through at each m/z value over the m/z range (196). Ion traps capture

and accumulate incoming ions in a dynamic electric field; subsequently ejecting them to the

detector according to their m/z. Orbitrap mass analyzers trap ions around a central spindle

electrode (197-199). The oscillation frequency of trapped ions is mass dependent and can be

converted from the recorded time-domain signal into a m/z spectrum using the Fourier

transformation algorithm (200). Similarly, FT-ICR mass analyzers resolve the masses of ions by

recording the rotation frequency of the ions in a magnetic field (201). Detectors (e.g. electron

multiplier) detect ions that pass through the mass analyzer and generate an electric current that is

proportional to their abundance.

The workflow of a typical shotgun experiment begins with the enzymatic digestion (usually with

the protease trypsin) of a mixture of proteins in a gel or in solution into peptides, followed by LC

separation. The generation of high-quality spectra heavily depends on the sufficient fractionation

of immensely complex samples (for instance, a typical serum digest contains 400,000 – 600,000

peptides (202)). First reported by Washburn and colleagues (184, 203), multidimensional protein

identification technology (MudPIT) is a technique of fractionation, yielding comprehensive

proteome coverage (204) (Figure 1.7). Here, peptides are loaded on a biphasic LC column

consisting of a strong cationic exchange (SCX) and a C18 reverse phase (RP). The protein

mixture is separated by means of SCX using iterative steps of increasing salt concentration (“salt

25

bumps”) followed by RP chromatography. Peptides are thus separated in two orthogonal

dimensions prior to MS analysis.

Figure 1.7 Schematic of multidimensional protein identification technology (MudPIT). In

this setup, a peptide mixture sample (S) is automatically loaded onto a biphasic pre-column

consisting of a strong cation exchange (SCX) and a reverse phase (RP) resin, coupled to an

analytical RP column. The mobile phase is water with increasing concentrations of an organic

solvent (e.g., acetonitrile). The eluting peptides are subsequently introduced to the mass

spectrometer by way of ESI. A series of increasing concentrations of ammonium acetate steps

(pink to red circles) further separate increasingly basic (charged) peptides.

In shotgun proteomics, intact peptide ions (known as precursor/parent ions) travel through the

mass spectrometer, where their m/z values are recorded to generate a mass spectrum referred to

as a MS or survey scan (196, 206). Following each survey scan, precursor ions are isolated based

on abundance in a data-dependent manner and subsequently fragmented. This is known as a

MS/MS or product ion scan and generates tandem mass spectra. The MS spectrum provides

information about the exact mass and charge of a peptide and the MS/MS spectrum provides

information about the amino acid sequence of that peptide. This is compared to theoretical, in

26

silico tandem mass spectra in a protein database and peptides are assigned to protein sequences

in a process known as protein inference.

The process of data dependent acquisition collects MS/MS data for only 3-10 of the most

abundant precusor ions that exceed a predetermined intensity threshold (207). In highly complex

samples, the number of co-eluting ions can significantly exceed the number of MS/MS spectra

the mass spectrometer can record (202). Therefore, high abundance peptides are more likely to

be selected for fragmentation, while low abundance species will be sampled by chance. This

“random sampling” of peptides results in the masking of low abundance peptides by co-eluting

peptides of higher abundance, and is affected by sample complexity, chromatographic resolution,

ionization efficiency, ion suppression, and MS cycling time (202).

The dynamic concentration range of proteins, especially in clinical samples, remains a significant

challenge in proteome analyses by current LC-MS systems. For instance, the plasma proteome

concentration range spans up to 12 orders of magnitude (Figure 1.8). The issue is further

amplified by the presence of highly abundant proteins such as serum albumin (35-30 mg/ml) and

immunoglobulins (5-18 mg/ml), which mask lower abundance proteins below one microgram

per milliliter (208, 209). To circumvent this, fractionation of the proteome can be performed in

order to reduce sample complexity and increase sensitivity of detection. However, this approach

is significantly more time consuming, labour intensive, and introduces another level of

variability due to the introduction of more steps in sample processing (208). Alternatively, high

abundance proteins, such as albumin in plasma, can be selectively removed by immunodepletion,

thereby enriching for less abundant proteins (210). Yet another approach exploits the principles

of Michaelis-Menten kinetics of protein digestion in order to remove high-abundance proteins

(211). In this method, a limited amount of protease, trypsin, is introduced to a lysate, whereby

high-abundance proteins are digested before lower-abundance proteins, following Michaelis-

Menten kinetics. The resulting peptides are then removed via a molecular weight cut-off spin

filter and the remaining proteome is digested under standard digestion conditions and analyzed

by LC-MS/MS. This method has been shown to improve the identification and sequence

coverage of low-abundance proteins by 3-fold. Other avenues of tackling the wide dynamic

range and the immense heterogeneity of complex proteomes is to enrich for sub-proteomes (for

example, N-glycoproteins account for the majority of the current clinically used biomarkers and

27

drug targets, as they are often secreted by cells or found at the cell surface (212)) and tissue-

proximal fluids that may be more abundant in relevant biomarker candidates.

Figure 1.8 Dynamic concentration range of human plasma proteins. Adapted from Schiess

et al., Molecular Oncology. 2008 © Federation of European Biochemical Societies. The

dotted line represents the concentration range of various plasma proteins. Classic plasma

proteins, such as albumin, are in excess of 5 orders of magnitude higher in abundance than tissue

leakage products and other potentially disease-related proteins.

1.4.2 Protein quantification using mass spectrometry

Mass spectrometry-based quantification strategies can be categorized as label-free or labeled

(Table 1.3). Label-free quantification includes spectral counting and extracted ion

chromatograms. Spectral counting (202, 203, 213) is an estimate of the protein abundance based

on the frequency of MS/MS spectra (spectral counts) generated from a given protein divided by

the number of theoretically observable peptides for that protein (known as the protein abundance

index) (214). It is based on the assumption that higher abundance proteins produce more tryptic

peptides, resulting in higher spectral counts. Indeed, a linear relationship between spectral counts

and abundance has been demonstrated over at least two orders of magnitude (202, 215);

however, this method biases against low abundance species.

28

The intensity-based approach uses the measured peptide chromatographic peak areas of peptide

precursor ions that belong to a given protein (202, 213, 216, 217). The key to this approach is the

alignment of chromatographic peaks in order to achieve accuracy in comparative quantitative

data. High quality peaks have aligned retention times, matching precursor ions, charge state, and

fragment ions (218). The AUC of the aligned peptides from each sample are derived and

compared for estimating relative protein abundances. The AUC has been shown to directly

correlate with the concentration of peptides in the range of 10 fmol – 100 pmol (216, 219). One

obvious factor that must be carefully taken into consideration when using this approach is the

presence of co-eluting peptides and shifts in retention time. It has been demonstrated that both

spectral counting and extracted ion chromatograms correlate with protein abundances in complex

samples (217, 220-223).

Isotope labels can be resolved either at the MS or MS/MS level, depending on the type of

labeling method adopted. These include stable isotope labeling with amino acids in cell culture

(SILAC) (224), using mass tags such as isotope-coded affinity tags (ICAT) (225), isobaric tags

for relative and absolute quantification (iTRAQ) (226) and tandem mass tags (TMT) (227).

Although labeling methods allow for more accurate quantification, additional preparation steps

and costs should be considered.

Absolute quantification of proteins by MS is based on stable isotope dilution (SID) (228), which

relies on the introduction of predetermined amounts of stable heavy isotope labeled standards

into the sample. Isotope dilution has been used in the quantification of small molecules for

decades (229), and has been adopted for the absolute quantification of peptides in biological

material in recent years. Peptide labeling methods include chemical synthesis (AQUA) (230),

biologically synthesized artificial quantification concatemer (QconCAT) peptides (231), and

intact isotope labeled protein standard absolute quantification (232-234).

Absolute quantification (AQUA) peptides are highly purified, chemically synthesized peptides

with 13C and 15N isotope labels at lysine and arginine residues. These standards have identical

sequences and thus the same physicochemical properties and chromatographic elution times to

endogenous peptides, but they have a known mass difference that is resolvable by MS.

Quantification using this approach is very straightforward – known amounts of AQUA internal

standards are spiked into the sample prior to the LC-MS step and the resulting recorded

29

abundance of the standard is compared to the endogenous peptide. Although a highly attractive

method, some caveats exist: due to limitations of chemical synthesis, certain peptides such as

those above 15 amino acids in length and those containing chemically reactive residues (e.g.,

oxidation of tryptophan, methionine, cysteine, etc.) or sequences (such as aspartate-glycine, N-

terminal glutamine, etc.) may not be compatible. AQUA peptides are very expensive, due to the

labour intensive process of production, purification, and quantification. Furthermore, since

AQUA peptides are usually fully tryptic peptides, complete tryptic digestion of the target

peptides must be ensured in order to prevent quantification of only the digested fraction (235).

Table 1.3 Summary of common labeled and label-free protein quantification techniques

using MS.

1.4.3 Targeted proteomics

Discovery proteomics does not require a priori knowledge about the contents of the sample being

analyzed, but the stochastic nature of shotgun proteomics limits the reproducible detection and

quantification of proteins between different experiments. Targeted proteomics approaches

selectively analyze only a subset of predefined peptides within a proteome, and hence, offer

30

much higher sensitivity, selectivity, and accuracy as compared to quantification by discovery

methods.

1.4.3.1 Selected reaction monitoring mass spectrometry

Selected reaction monitoring mass spectrometry (SRM-MS) is the most widely used targeted MS

technique and has been used for reliably quantifying small molecules (236). In more recent

years, it has been adopted as a robust, sensitive, reproducible and specific assay for protein

quantification (237-239) and validation of cancer biomarkers from various sources (240-244).

Selected reaction monitoring mass spectrometry allows for the detection and quantification of

predefined sets of peptide targets by employing the use of triple quadrupole (QqQ) mass

spectrometers that allow for two-stage mass filtering. In SRM-MS mode, a precursor ion of a

specific m/z is selected in a front-end mass analyzer (Q1) (Figure 1.9). The ion is then

fragmented by collision-induced dissociation (CID) in a collision cell (Q2), and its resulting co-

eluting fragment ions (transitions) (206) are selectively measured as a function of time in a

second mass analyzer (Q3).

Figure 1.9 Selected reaction monitoring mass spectrometry. By using two stages of mass

filtering in a triple quadrupole mass spectrometer, a specific peptide can be isolated, fragmented,

and subsequently analyzed. This method offers a high level of sensitivity and specificity and can

be coupled to stable isotope dilution for accurate quantification of proteins in complex mixtures.

In order to develop a SRM-MS assay, a priori selection of proteotypic peptides (peptides that are

unique to the proteins of interest and are reliably detected by MS) and the determination of their

MS coordinates (peptide elution time, most intense fragments, and relative intensities of

fragments) is required. Only a small fraction of the millions of peptides resulting from the

enzymatic digestion of a cellular proteome are proteotypic (245), and thus, the ability to target

31

these greatly facilitate peptide detection and alleviate redundant peptide sampling and analysis

time (206). Computational tools to guide the selection of proteotypic peptides have been

developed based on physicochemical properties of frequently observed peptides from large-scale

shotgun experiments (245). Tools such as SRM-MS Collider (246) can identify redundant

transitions of a peptide within a proteome background of choice that can help determine

potentially interfering transitions. Spectral libraries from previous experiments (either available

through online repositories such as the Global Proteome Machine, SRMAtlas, or through own

experimental data) can also be used to help select frequently observed proteotypic peptides.

Alternatively, recombinant proteins of candidates can be digested and analyzed by MS to

generate a spectral library. The acquired spectra can provide information for the peptide SRM-

MS assay: m/z of the precursor and its fragments, the preferred ionization charge state, the

relative intensity of the fragment ions signals, and the chromatographic retention time (206).

To achieve sufficient sensitivity and precise quantification, the SRM-MS cycle time and dwell

time should also be considered. The dwell time refers to the amount of time the instrument

spends monitoring a transition. A higher dwell time, that is, the longer the instrument spends

measuring a given transition, the higher the sensitivity of the assay for that transition. The cycle

time is the sum of all the dwell times of each transition to be monitored in the assay. A longer

cycle time results in fewer data points being measured over the chromatographic elution time of

each peptide, which can affect both the detection and quantification of peptides. Under

conditions where too many transitions are monitored in a single run, the dwell time may be

reduced to sufficiently compromise sensitivity and quantitative accuracy. In such cases a

scheduled approach, whereby the instrument scans for transitions only during a small window of

time centered around a peptide’s expected elution time, can significantly augment the number of

monitored transitions in a single experiment. For less abundant peptides, the dwell time can also

be increased during the elution window in order to increase sensitivity.

1.4.3.2 Application of selected reaction monitoring mass spectrometry to

biomarker research

Validation of proposed biomarker candidates in a major bottleneck in protein biomarker

development. The gold standard for clinical protein biomarker quantification is enzyme linked

immunosorbent assay (ELISA), which relies on the use of specific antibodies. Although ELISAs

32

have excellent specificity and sensitivity, the limited number of antibodies and the cost of

developing a clinically validated ELISA assay (which runs up to $1 million (247, 248)) severely

limit their use for biomarker validation. Protein quantification by SRM-MS has been shown to

have good correlation with ELISAs (249, 250), although its sensitivity has been shown to range

from 34-60% lower depending on the analyte (251). Considering the vast numbers of proposed

biomarkers, the capability of SRM-MS to simultaneously quantify a multitude of analytes in a

matter of hours compared to the usually single analyte analysis of ELISA, significantly facilitates

the timely appraisal of putative candidates. Selected reaction monitoring mass spectrometry

assays can be used to rapidly evaluate putative biomarker candidates, of which the most

promising candidates would be carried forward for ELISA assay development. Therefore, SRM-

MS can serve as an attractive complementary technique to ELISA in the biomarker validation

pipeline.

Selected reaction monitoring mass spectrometry application to the quantification of proteins in

complex biological systems has made considerable headway in recent years. Picotti et. al (252)

demonstrated the application of SRM-MS to target proteins in a complex biological system (total

S. cerevisiae extract) and its ability to cover a dynamic range of over 4-5 orders of magnitude. In

their work, low-abundance proteins that were previously unidentifiable by MS were sucessfully

detected and quantified. Hüttenhain et al. (253) developed a high-throughput workflow for the

quantification of cancer-associated proteins in human urine and plasma. Their study, which

utilized SRM-MS, tracked 408 urinary proteins detectable by SRM-MS. Interestingly, 169 of

these were previously undetected in the datasets from Human Protein Atlas and a urinary

proteome dataset from Adachi et al. (124). Furthermore, using SRM-MS assays in ovarian cancer

and benign ovarian tumor patient plasma, the authors demonstrate the reproducible differential

expression of a number of candidates. In another study, Cima and colleagues (240) focused their

analyses on the N-glycoproteome of Pten-null mouse serum and prostate. Label-free comparative

analysis of the Pten-null animals and age-matched wild-type mice revealed 111 and 12

candidates that were significantly differentially expressed in tissue and sera, respectively. Next,

the authors utilized SRM-MS assays to reliably quantify the 39 protein orthologs (selected based

on consistent quantification) in sera of prostate cancer patients and controls, and used the

resulting profiles to build predictive regression models for diagnosis and grading of prostate

cancer. In our own study (described in Chapter 3), I identified a number of putative prognostic

33

markers of prostate cancer and verified the differential expression of these in prostatic fluids

using SRM-MS (244).

1.5 Rationale and objectives

1.5.1 Rationale

There exists a wide discrepancy between the number of men with prostate cancer and those

dying from it. Although treatment efficacy has improved, the discrepancy is, at least in part,

attributable to the detection of cancers that are non-life threatening within the patient’s lifetime

(254). Considering the potentially serious hazards associated with prostate biopsies, and even

more so with treatment procedures, the current modalities of diagnosis and prognosis are failing

to accurately classify patients based on the significance of their disease and subjecting them to

unnecessary risks. Establishing non-invasive molecular features capable of distinguishing

insignificant prostate tumors from significant ones would greatly alleviate such burdens. In the

current thesis, I have utilized proteomics methodologies to investigate the molecular factors

present in prostatic fluids in order to delineate differential protein signatures of prostate cancer

subtypes.

1.5.2 Hypotheses and aims

(1) Discovery proteomic exploration of EPS and identification of prostate-enriched proteins

I hypothesized that the EPS proteome is a rich source of prostatic proteins. The following aims

were carried out to test the hypothesis:

a) Catalogue the direct-EPS and EPS-urine proteomes using discovery proteomics

techniques;

b) Integrate the proteomics data with other available resources to characterize the EPS

proteomes;

c) Comparatively analyze the proteomes of pre- and post-DRE urines for the identification

of prostate-enriched proteins;

34

d) Verify the expression of a number of enriched proteins.

(2) Comparative proteomic analyses of organ-confined and extracapsular EPS fluids

I hypothesized that comparative analyses of the proteomes of EPS from patients with EC and OC

prostate cancers would reveal putative prognostic biomarkers for prostate cancer. The following

aims were carried out to test the hypothesis:

a) Characterize the proteomes of EPS derived from prostate cancer patients with EC and OC

tumors;

b) Implement a data mining strategy to prioritize proteins based on their biomarker potential

utilizing pathway analysis, publicly available datasets, and literature review;

c) Verify the differential expression of candidates using immunoassays;

d) Evaluate the application of targeted proteomics using SRM-MS in EPS-urine;

e) Develop SRM-MS assays and quantify candidates in EPS-urine.

(3) Systematic large-scale development of targeted proteomics assays for biomarker verification

in prostate-proximal fluids

I hypothesized that protein signatures of prostate cancer could be generated and accurately

quantified in EPS-urine using a multiplexed proteomics assay. The following aims were carried

out to test the hypothesis:

a) Pinpoint putative protein biomarkers and their proteotypic peptides based on discovery

proteomics data from Aim 2;

b) Design SRM-MS assays for peptides using synthetic peptide standards in a urine background;

c) Apply SRM-MS assays to a cohort of EPS-urines from patients with EC, OC prostate cancer,

as well as noncancer individuals for relative quantification of candidates;

35

d) Develop a single SRM-MS assay capable of quantifying multiple biomarker candidates;

e) Apply SRM-MS assay to an independent cohort of EPS-urines for absolute quantification and

verification;

f) Utilize bioinformatics and statistics to identify a protein signature to distinguish different

patient groups.

36

Chapter 2

In-depth proteomic analyses of prostate proximal tissue fluids

This chapter has been reformatted from the following original publications:

Drake, R.R., Elschenbroich, S., Lopez-Perez, O., Kim, Y., Ignatchenko, V., Ignatchenko, A.,

Nyalwidhe, J.O., Basu, G., Wilkins, C.E., Gjurich, B., Lance, R.S., Semmes, O.J., Medin, J.A.,

and Kisilnger, T. In-depth proteomic analyses of direct expressed prostatic secretions. Journal of

Proteome Research 2010; 9:2109. © 2010 American Chemical Society

Principe, S.*, Kim, Y.*, Fontana, S.*, Ignatchenko, V., Nyalwidhe, J.O., Lance, R.S., Troyer,

D.A., Alessandro, R., Semmes, O.J., Kislinger, T., Drake, R.R., and Medin, J.A. Identification of

prostate-enriched proteins by in-depth proteomic analyses of expressed prostatic secretions in

urine. Journal of Proteome Research 2012; 11:2386. © 2012 American Chemical Society *These

authors contributed equally to this work.

2 In-depth proteomic analyses of prostate proximal

fluids

2.1 Abstract

It is expected that clinically obtainable fluids that are proximal to organs contain a repertoire of

secreted proteins and shed cells reflective of the physiological state of that tissue, and thus

represent potential sources for biomarker discovery, investigation of tissue-specific biology, and

assay development. The prostate gland secretes many proteins into a prostatic fluid that

combines with seminal vesicle fluids to promote sperm activation and function. Proximal fluids

of the prostate that can be collected clinically are seminal plasma and expressed prostatic

secretions (EPS). In this chapter, I present two separate proteomic investigations into direct-EPS

and EPS-urine. In the first study, multidimensional protein identification technology (MudPIT)-

based proteomics was applied to direct-EPS obtained from nine men with prostate cancer,

resulting in the confident identification of 916 unique proteins. Bioinformatics analyses using

publicly available microarray data of 21 human tissues (Human Gene Atlas), the Human Protein

37

Atlas database, and other published proteomics data of shed/secreted proteins were performed to

systematically characterize this comprehensive proteome.

In a second study, 1022 unique proteins in a heterogeneous cohort of 11 EPS-urines derived from

biopsy-negative noncancer diagnoses, benign prostatic hyperplasia (BPH), and low grade

prostate cancer were identified. Furthermore, MudPIT was used to generate and compare the

differential proteome from a subset of pooled urines and EPS-urines (pre- and post-digital rectal

examination urines, DRE, respectively) from noncancer and prostate cancer patients. The direct

proteomic comparison of these highly controlled patient sample pools enabled the generation of

a list of prostate-enriched proteins detectable in EPS-urine that is distinguishable from the

complex urine protein background. A combinatorial analysis of both proteomics datasets and the

systematic integration with publicly available proteomics data of related body fluids, human

tissue transcriptomic information, and immunohistochemistry images from the Human Protein

Atlas database allowed the demarcation a robust panel of 49 prostate-derived proteins in EPS-

urine. Finally, the expression of seven of these proteins was validated using Western blotting,

supporting the likelihood that they originate from the prostate.

2.2 Introduction

Proximal fluids are found adjacent to a given tissue or organ and contain a repertoire of secreted

proteins and shed cells reflective of the physiological state of that tissue (255). Hence, tissue-

proximal fluids are rapidly emerging as discovery sources of proteins, metabolites, and genetic

biomarkers for cancers. Seminal plasma and EPS are prostate-proximal fluids that can be

collected in the clinic. Seminal plasma consists of fluids originating not only from the prostate,

but also from several other male accessory glands such as the seminal vesicles, epididymis, and

Cowper’s gland. In aging men with prostatic disease, obtaining seminal fluids for diagnostic

assays and screening purposes is highly dependent on patient age and disease status, and thus not

an attractive assay medium.

The exocrine compartment of the prostate is composed of differentiated epithelial cells, that

actively secrete proteins such as prostate specific antigen (PSA), prostatic acid phosphatase

(PAP), and prostaglandins into the glandular lumen (256), and hence, these proteins can be

accessed by the expulsion of prostatic fluids. Expressed prostatic secretions can be readily

38

obtained in the clinic by forced (or expressed) ejection of prostatic fluids into the urethra

following prostate massage. For evaluation of some types of prostatitis, a vigorous massage is

done to force fluid through the urethra to collect a few drops from the tip of the penis for

subsequent use in microorganism cultures. If a more gentle massage is performed in the context

of a DRE of the prostate, the EPS is then immediately collected in voided urine post-exam (EPS-

urine). Currently, a commercial genetic assay for prostate cancer detection, based on the

presence of a non-coding RNA (PCA3), uses post-DRE EPS-urines as a source of prostate-

derived genetic material (257-259).

In the context of prostate cancer, undiluted EPS (direct-EPS) can be collected while the patient is

under anesthesia, just prior to prostatectomy. In comparison to the clinical DRE massages, a

more vigorous digital rectal massage can be done to force prostatic fluids into the urethra with

subsequent collection from the penis. For prostate cancer biomarker discovery purposes, direct-

EPS has several ideal clinical features beyond its nature as a proximal fluid. Since direct-EPS is

collected just prior to prostatectomy, clinical information that led to the decision to perform a

prostatectomy is readily available (255). Post-prostatectomy pathology staging and grading can

also be obtained for comparison with the pre-surgical assessments. When linked with clinical

outcome data, these direct-EPS fluids thus have the potential to be used for discovery of both

diagnostic and prognostic biomarkers for management of prostate cancer.

A MudPIT-based proteomics approach was employed to obtain a detailed inventory of proteins

present in direct-EPS and EPS-urine samples. By using a high mass accuracy Orbitrap mass

spectrometer and rigorous identification criteria, a high quality account these fluids was obtained

to a degree not previously reported. To select for proteins likely secreted from the prostate in the

complex EPS-urine background, a highly controlled quantitative comparison of pooled samples

of urine collected prior to DRE and EPS-urine was performed. Importantly, these pre- and post-

samples were collected from the same patients, enabling direct quantitative intra-patient

comparisons to identify proteins that are upregulated in the post-DRE samples. The combination

of these data and their integration with publicly available proteomic (124, 260), microarray data

from the BioGPS portal (http://biogps.org/) (261), and immunohistochemistry images from the

Human Protein Atlas (HPA; http://www.proteinatlas.org/) (262, 263) allowed the generation of a

panel of 49 proteins likely secreted by the prostate and detectable in EPS-urine. Additionally, the

39

entire proteomics data was deposited to the Tranche server (www.proteomecommons.com) for

others to utilize.

2.3 Results

2.3.1 In-depth proteomic analyses of direct expressed prostatic

secretions

2.3.1.1 MudPIT profiling of direct expressed prostatic secretions from

prostate cancer patients

Nine individual direct-EPS samples from men with prostate cancer (Gleason 6-7, stages pT1c-

pT2b) were selected for MudPIT-based proteomic analysis (Table 2.1). To minimize variability

introduced by sample handling, direct-EPS samples were directly digested in-solution and

analyzed by a 9-step MudPIT on a LTQ-Orbitrap XL mass spectrometer.

Table 2.1 Clinical information of the direct-EPS fluids analyzed by MudPIT proteomics. a

Serum PSA values, ng/ml. b Gleason score determined by post-prostatectomy pathology. c

Determined by post-prostatectomy pathology. d Determined pre-prostatectomy by transurethral

ultrasound measurements.

Rigorous protein identification criteria resulted in the identification of 916 unique proteins with

high confidence (Figure 2.1A and B), with 230 to 550 unique proteins identified per individual

cancer patient sample (Figure 2.1B). The list of 916 proteins (Appendix Table 2.1) obtained from

40

the direct-EPS cancer samples were compared to those reported in a recent publication by Li and

colleagues (264) who collected several EPS fluids using a more vigorous prostate massage

procedure (Figure 2.1C). Only twenty proteins were unique to the study of Li et al. (264),

consisting primarily of blood-derived proteins. The vast majority of proteins of the 916 proteins

were uniquely identified in the current study with 816 unique protein clusters and 94 protein

clusters that were shared with Li et al. Based on the very low levels of hemoglobin and

seminogelin proteins detected in the analysis, it can be concluded that direct-EPS samples have

minimal contamination with blood and seminal vesicle derived proteins. In relation to known

prostate derived proteins, the next most abundant proteins accounting for an additional 25% total

abundance were lactoferrin, PSA, PAP, zinc-α2 glycoprotein and aminopeptidase N. Functional

annotation of the direct-EPS proteome using Gene Ontology, predicted transmembrane domains

(TMD; TMAP algorithm) and signaling peptides (SP; PrediSi algorithm) revealed a wide variety

of functional categories (Figure 2.2A-C). A significant number of detected proteins were

membrane proteins and members of the extracellular region, including 34 CD molecules

(Appendix Table 2.1). This was further supported by the fact that ~ 40% of the detected proteins

had at least one predicted TMD and contained a predicted SP, indicative of targeting to the

secretory pathway (Figure 2.2D and E). A smaller number of proteins could be assigned to other

intracellular organelles.

41

Figure 2.1 Proteomics analyses of direct expressed prostatic secretions. (A) EPS from nine

men with confirmed prostate cancer were in solution digested and individually analyzed by

online MudPIT on an LTQ-Orbitrap XL mass spectrometer. Rigorous peptide and protein

identification criteria resulted in the identification of 916 unique proteins. (B) Number of unique

proteins detected in each of the analyzed EPS samples. (C) Comparison of our EPS proteome to

a previously published EPS dataset.

42

Figure 2.2 Functional enrichment of proteins detected in EPS. (A-C) Shown are selected

Gene Ontology (GO-Slim) categories of the EPS proteome. (D) Distribution of predicted

transmembrane domain containing proteins (TMAP algorithm). (E) Distribution of predicted

signal peptides (PrediSi algorithm).

43

2.3.2 Integrative data mining and comparison to other public databases

One goal of the current study was to provide the research community with a high quality, well-

annotated resource of proteins expressed in EPS. This could improve our understanding of

general prostate biology and guide the discovery of novel prostate cancer biomarkers. To

accomplish this goal, the data was mapped to two publicly available resources: the Human Gene

Atlas (265), which is based on the mRNA microarray expression profile of >50 human tissues,

and the Human Protein Atlas (263, 266), a large resource of immunohistochemistry images and

available human antibodies (Figure 2.3). The rationale for these comparisons was to identify

proteins detected in EPS fluids that are also enriched in prostate tissue relative to other human

tissues included in the microarray database. Also, integration of EPS proteins with the Human

Protein Atlas database can catalogue the availability of validated antibodies, which could provide

researchers with an important tool to further study the biological function of selected proteins.

Using only mRNAs with ≥3-fold above median expression in prostate tissue compared to 20

other human tissue types in the Human Gene Atlas, 279 direct-EPS gene products were

identified, implying a potential function specifically related to prostate biology. In the Human

Protein Atlas, a total of 513 proteins detected in the direct-EPS proteome currently have

antibodies with varying staining intensities in human prostate tissue. The identities of the EPS

proteins mapped to the two public databases are available in Appendix Tables 2.2 and 2.3.

44

Figure 2.3 Integrative data mining and comparison to other databases. (A) The EPS

proteome was mapped against the Human Gene Atlas (mRNA microarray) and Human Protein

Atlas (antibodies and immunohistochemistry images). (B) 279 unique gene products detected in

our EPS proteome showed ≥3-fold above median expression in normal prostate tissue on the the

Human Gene Atlas microarray (detected by 317 probe IDs), as compared to 20 other tissues.

45

Mapping against available Human Protein Atlas (HPA) antibodies and CD molecules is

provided.

2.3.2.1 Comparison of the direct-EPS proteome to other published

proteomic datasets

Next, the direct-EPS proteome was integrated with three other published proteomic datasets

(124, 267, 268). This included a recent publication of proteins secreted into the cell culture

media by three established prostate cancer cell lines (PC3, LNCaP, and 22Rv1) (268) and two

human body fluid datasets (urine and seminal plasma) directly related to EPS (267, 268). The

rationale for the comparison to the cell secretions was to distinguish between proteins readily

secreted by prostate cancer cells from proteins being secreted by other cell types of the prostate.

This is not an ideal comparison, however, as the cell lines are more reflective of metastatic

prostate cancers and the direct-EPS samples were from lower grade Gleason 6 and 7 cancers. An

additional caveat is that these cells have numerous genomic alterations and have been grown

under culture conditions for many passages distant from the primary isolates, and might not truly

reflect an in vivo setting. Approximately 300 proteins (Figure 2.4) were shared between the EPS

proteome and each of the three prostate cancer cell lines (see Appendix Tables 2.4-2.6 for

detailed mappings). Likewise, approximately 500 proteins were shared between the EPS

proteome and normal human urine and seminal plasma (see Appendix Tables 2.7 and 2.8).

Several well-known prostate markers (i.e. PSA, PAP) were shared among the direct-EPS

proteome and the described prostate cancer cell lines. The data provided in this study could

therefore be used to mechanistically investigate proteins identified in vivo (EPS) using well-

established prostate cancer cell lines. Interestingly, several well known prostate markers (i.e.

MSMB, TMPRSS2) were only identified in the EPS proteome and the other two available body

fluids (see below), strengthening the rationale of using in vivo obtainable fluids to study

prostate/prostate cancer biology. It is likely that proteins exclusively detected in the EPS

proteome, as compared to proteins detected in healthy human urine and seminal plasma, are

specifically enriched in secretions of the prostate under pathological conditions, and were

therefore absent or at a very low concentration, in the other body fluids.

46

Figure 2.4 Comparison to other published proteomes. The EPS proteome was compared to

proteins detected in two other published, related body fluid proteomes Pilch et al. seminal plasma

and Adachi et al. urine) and proteins secreted into the condition medium by three established

prostate cancer cell lines (Sardana et al.).

2.3.2.2 Multi-dataset integration

To fully utilize the power of multi dataset comparison, all publicly available data resources were

integrated with the current direct-EPS proteome (Figure 2.5), as an example of a potential data

mining strategy. Gene products strongly enriched in normal prostate tissue using the described

microarray dataset (10-fold above median expression) that were also detected in the direct-EPS

proteome and had antibodies through the Human Protein Atlas database were focused on. This

integration resulted in 54 unique gene products satisfying all described integration criteria

47

(Figure 2.5 and Appendix Table 2.9). A heat map display of this integration is presented in

Figure 2.5, including available mappings to other public proteome datasets. Although all gene

products are strongly expressed in prostate tissue, most showed additional expression in several

other tissues. Known prostate markers (vertical line), such as PSA, PAP, and

microseminoprotein-β (MSMB) were readily identified. Comparison to other published

proteomics datasets is also indicated in the presented heat map, and this data could be used to

further guide potential follow up analyses. As all 54 proteins displayed in the heat map have

available immunohistochemistry images at the Human Protein Atlas, the available images were

systematically screened for both normal prostate and prostate cancer tissues. Several proteins

with intriguing staining patters are discussed below, demonstrating an example of how the

current data could be used by others.

48

Figure 2.5 Integration of multiple data resources with the EPS proteome. Heat map of 54

unique gene products detected in the EPS proteome with available antibodies in the Human

49

Proteome Atlas database. Selected gene products were enriched in normal prostate tissue as

compared to 20 other human tissues using the Human Gene Atlas microarray database (≥10-fold

above median expression in prostate). Mappings to other available proteome datasets are

indicated. Well known prostate markers are highlighted by a vertical line.

Several proteins were identified which displayed a trend towards differential expression in

normal prostate compared to prostate cancer tissue using available IHC images, although patient

variability and image quality differences were also observed (Figure 2.6). Neprilysin (MME,

CD10) showed a trend towards decreased expression in prostate cancer tissue. Alternatively,

GDF15 and FASN showed a trend towards increased expression in prostate cancer tissues

(Figure 2.6). As the entire dataset has been linked to available mRNA microarray expression,

Human Protein Atlas antibody staining, and other published, related proteomics datasets, this

data will likely provide an important resource for the prostate biology/cancer community to

guide the investigation of other proteins in hypothesis driven experiments.

50

Figure 2.6 Immunohistochemistry images as obtained from the Human Protein Atlas.

Selected examples of proteins found in the Human Protein Atlas database that showed trends

towards differential expression in normal compared to cancer prostate tissues.

2.3.3 Identification of prostate-enriched proteins by in-depth proteomic

analyses of expressed prostatic secretions in urine

2.3.3.1 MudPIT profiling of expressed prostatic secretions in urine

The first aim of this study was to provide the first in-depth proteome catalogue of this clinically

useful proximal tissue fluid (Figure 2.7, left panels). In particular, individual EPS-urine samples

from men diagnosed with low grade prostate cancer (n = 5; Gleason score total 6-7) and biopsy

negative benign conditions (n = 6, BPH) were selected for MudPIT-based proteomic analyses

(Table 2.2).

51

Table 2.2 Clinical information for the prostate cancer (5 samples) and benign prostatic

hyperplasia (6 samples) EPS-urines analyzed by MudPIT. Serum PSA values (reported in

ng/ml) are from the time of initial diagnosis. The treatment column indicates the clinical course

followed for each cancer patient (DVP – DaVinci prostatectomy; Cryo, cryoablation therapy;

AS, active surveillance.

The proteomic characterization of the EPS-urine was performed on this heterogeneous group of

11 samples as a representation of the different types of patient origin, reflective of the most

common benign and prostate cancer conditions presenting in urology clinics. For each sample,

150 µg of total protein was directly digested in solution and analyzed in triplicate by a 9-step

MudPIT on a LTQ-Ion Trap mass spectrometer, as previously described (269, 270). A total of

1022 unique proteins were identified in the EPS-urines (Appendix Table 2.10) by at least two

unique peptides (Figure 2.7, left panels), ranging from 178 to 667 unique proteins determined per

individual MudPIT run (Figure 2.8A). Although the same amount of total protein was digested

and each sample was analyzed in triplicate with a relatively low overall standard deviation

(average standard deviation of 27), there was still large variation in total proteins detected for

each sample. This high variability is reflective of the biological inter-sample variations among

human specimens, which complicates data analyses. Furthermore, dynamic metabolic changes

within each individual are manifested in urine protein content (271), and highlight some of the

52

general problems of proteomic analyses of proximal body fluids (272). These results could also

reflect variation in sample collection, as the DRE procedure required to “express” the prostatic

fluids will be different for each individual (255). Other known limitations are related to the DRE

collection procedure and can be attributed to the physician (e.g. size of hand, ability to reach to

the prostate, etc.) or to some patient physical parameters (e.g. orientation on table during

examination, overweight status, etc.). Therefore, standard collection protocols as well as internal

standards are required to ensure proper collection and to circumvent the introduction of sample

variability resulting from the collection procedure (255).

53

Figure 2.7 Study workflow. Proteomic analysis of EPS-urine and urine samples. The EPS-urine

proteome was defined by MudPIT analyses of 11 individual heterogeneous samples (prostate

cancer, PCa and benign prostatic hyperplasia, BPH) of EPS-urine. A similar analysis was

performed on pooled urine and EPS-urine samples (PCa and noncancer, NC). The comparison of

the two datasets using bioinformatics data mining lead to the identification of some putative

prostate-enriched candidates within the complex EPS-urine proteome.

54

Figure 2.8 Characterization of the EPS-urine proteome. Number of unique proteins detected

per MudPIT analyses in the (A) individual EPS-urine samples and in the (B) pooled urine and

EPS-urines. The black dots in (A) (from EPS-U1 to EPS-U5) represent the EPS-urines from

individual prostate cancer patients and the white dots in (from EPS-U6 to EPS-U11) represent

the EPS-urines from individual benign prostatic hyperplasia patients. Every sample was analyzed

in triplicate. (EPS-U: EPS-urine; U: urine; PCa: prostate cancer; NC: noncancer).

To obtain a systematic overview of the functional categories of proteins expressed in EPS-urine,

Gene Ontology (GO) (273) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways

analyses (274) on the 1022 identified proteins (Figure 2.9) were performed. The comparison was

limited to several representative GO terms in order to obtain a high-level functional overview. In

the biological processes category, a large proportion of identified proteins had functional roles

55

involved in proteolytic activity, cellular adhesion and motion, and immune responses (Figure

2.9A). In the cellular component category (Figure 2.9B), GO terms with an annotation to the

extracellular region were overrepresented; this was further supported by the evidence that ~ 50%

of the detected proteins were classified in the SwissProt database as proteins with signal peptide

sequences, and more than 30% were classified as secreted proteins (Figure 2.9E). Notably, 12%

of the EPS-urine proteins contained the annotation of vesicle localization by GO (Figure 2.9B);

this could explain why the lysosomal pathway is the most highly over-represented among the

KEGG pathways in the EPS-urine dataset (Figure 2.9D). In the molecular function category, a

large proportion of detected proteins had calcium ion binding activity and were involved in the

regulation of peptidase activity (Figure 2.9C). Based on the global GO term analyses, EPS-urine

proteins belong to a large variety of functional categories and, as expected from a secreted fluid

fraction, a large number of these proteins are extracellular and contain a signal peptide sequence.

To further characterize the EPS-urine proteome in the context of other prostate-related fluids, the

current dataset was integrated with normal human urine (124) and direct-EPS (260), which

showed that 455 proteins are shared between these fluids by proteomics analyses (Figure 2.9F).

Present among these shared proteins were several previously proposed prostate cancer

biomarkers (i.e. PSA, PAP, MSMB, PSMA, TMPRSS2), strengthening the rationale of using in

vivo obtainable fluids to study the prostate and prostate cancer biology. On the other hand, these

candidates were detected in all three fluids, which demonstrates the dilemma in detecting

prostate-enriched proteins in a complex protein background of general urine proteins.

56

Figure 2.9 Functional enrichment of proteins detected in EPS-urine and comparison of the

EPS-urine proteome to other related body fluids. (A-E) The list of the 1022 unique proteins

identified in the EPS-urine samples was compared to the human UniProt Database. The reported

graphs show significantly over-represented (p < 0.001) Gene Ontology (GO) terms (A-C),

KEGG pathways (D) and SwissProt entries (E) in the EPS-urine dataset (several representative

annotation terms are shown). (F) The current EPS-urine dataset was compared to previously

published urine (124) and direct-EPS (from Section 2.3.1) (260) proteomic datasets. Proteins are

clustered in homogeneous groups based on 98% similarity (i.e. cluster anchors).

57

Focusing on prostatic secretions, 181 proteins were shared between EPS-urine and direct-EPS

(Figure 2.9F), suggesting that these proteins are specifically enriched in prostatic secretions and

absent or at a very low concentration in urine. This could represent a useful protein dataset for

selecting new prostate specific biomarkers with diagnostic and prognostic capacities for prostatic

diseases. Interestingly, 113 proteins were unique to EPS-urine (absent from direct-EPS and

urine). This could be explained by different mass spectrometry (MS) conditions used between

our group and the study by Adachi et al (124), as well the phenomenon of random sampling

(202) and biological variability, since neither study was likely successful in detecting the entire

body fluid proteome.

This study provides a detailed description of EPS-urine and extensively expands our knowledge

of the EPS-urine proteome. The dataset of 1022 proteins (Appendix Table 2.10) can be used to

implement diagnostic test platforms and improve current screening procedures for prostatic

diseases.

2.3.3.2 Identification of prostate-enriched proteins

A second aim of this study was to provide the first direct comparison of urine samples pre- and

post-DRE using pooled samples from both normal and prostate cancer patients (Figure 2.7, right

panels). These valuable samples enabled the identification of proteins likely released by the

prostate as a result of the DRE. For this purpose, five patients with prostate cancer and five

noncancer individuals were screened once prior to DRE and once after DRE, in order to obtain

internally controlled urines and EPS-urines, respectively (Table 2.3). Each sample pool was

analyzed by a 5-step MudPIT on a LTQ-Orbitrap XL, leading to the identification of 444 unique

proteins (Figure 2.7, right panels and Appendix Table 2.10), with a range of 141 to 258 unique

proteins determined per individual MudPIT run (Figure 2.8B).

58

Table 2.3 Clinical information for the pooled urine and EPS-urine samples analyzed by

MudPIT. Urine and EPS-urine sample pairs were collected from the same patient and pooled as

described in Materials and Methods (serum PSA reported in ng/ml).

In order to highlight signatures of proteins enriched in the EPS-urine, a semi-quantitative

comparative analysis of the EPS-urine and urine data was performed based on the QSpec

algorithm (275). Proteins were considered to be prostate-enriched if they had a ≥2-fold change in

spectral abundance factors (202, 275) with a false discovery rate ≤0.05 by QSpec analysis.

Applying these criteria to the entire list of 444 proteins, a panel of 49 significantly enriched

proteins was generated (Appendix Table 2.11). Interestingly, direct comparison of both

proteomic datasets (1,022 versus 444) showed an overlap of 406 proteins that included the 49

protein signature of “prostate-enriched proteins” (Figure 2.10).

59

Figure 2.10 Comparison between the two proteomics data of the current study. The Venn

diagram shows a comparison between the EPS-urine dataset (1022 proteins) and the dataset

containing proteins identified in both pooled urine and EPS-urine samples (444 proteins). The

overlapping area of 406 proteins includes a short list of proteins enriched in the EPS-urine

samples compared to the urine samples (termed: prostate-enriched proteins).

The prostate tissue selectivity of the 49 identified protein signature was also evaluated at the

transcriptome level (Figure 2.11A). The proteins were mapped in the BioGPS database, and the

gene expression profiles for normal human prostate tissue were compared to 24 other major

normal human tissues, under the assumption that transcript levels measured by microarrays can

be correlated with protein abundance (270). Although it is well known that expression levels of

both biomolecules are not always concordant (269, 276, 277), our own experience suggests

reasonable correlation if the data is locked on the protein level (i.e. using only transcripts with

available proteins evidence) (269). This type of comparison can provide additional evidence for

prostate-enriched expression levels, similar to previous investigations (260). Genes with prostate

expression levels 2-fold above the median across all 25 tissues were considered to be prostate-

enriched. The distribution of this analysis is shown in Figure 2.11A and represents the expression

level of each gene coding for the 49 selected proteins in prostate tissue. Among the 49 selected

gene products identified several well-known prostate biomarkers, such as PSA (KLK3), MSMB

60

and PAP (ACPP) (highlighted in green), supporting the hypothesis that direct comparison of

urine and EPS-urine by semi-quantitative proteomics results in the identification of prostate-

enriched proteins suitable for further investigation in biomarker studies. According to this

analysis, 32 of the 49 informative genes are enriched in the prostatic tissue. The genes indicated

with red bars in the graph correspond to the seven proteins for which antibodies were obtained

and used to verify the MS-based proteomics data.

Next, a random sampling of the dataset was examined to address the question of chance selection

in arriving at the 49-protein candidate list. Proteins were randomly selected 10,000 times from

the entire list of 1022 proteins detected in EPS-urine in order to evaluate if a random selection

from a larger dataset of EPS-urine proteins yielded the same probability of selecting prostate-

enriched proteins. The boxplot shown in Figure 2.11B reports the comparison between the two

datasets. As expected, the shortlist of 49 selected proteins is enriched in prostate-specific

proteins when compared with a random assortment of 10,000 selections from the same EPS-

urine database. The median values of the two distributions tend to be lower in the random

sampling analysis (median = 0.6) compared to the selection (median = 2.2) with a highly

significant P-value ≤0.001.

62

Figure 2.11 Data mining to identify prostate-enriched proteins. (A) Comparison of the 49

prostate-enriched proteins to published human tissue transcriptomic data from the BioGPS gene

expression portal (261). On the X-axis gene names are shown and the Y-axis represents a scale

indicating the gene expression level in prostate as compared to 24 additional human tissues. The

Y-values were obtained as log2 [fold change] ratio of prostate gene expression level versus the

median value calculated for 25 major organs (listed in Materials and Methods). Green bars

highlight protein biomarkers already suggested as candidates for prostate cancer prognosis and

diagnosis (used as internal positive controls). Red bars highlight the 7 proteins that have been

validated in this study. The red line demarcates the 2-fold increased expression level in normal

prostate as compared to all other tissues. (B) Box plot shows prostate-enriched proteins in the 49

protein candidate list versus a random sampling of the entire 1022 EPS-urine protein dataset. The

Y-axis has the same log2 scale reported in (A), rectangles are bounded by the lower and upper

quartiles, the solid lines in the rectangles are the medians, the box whiskers extend to the

minimum to the maximum data point of the rectangle, and the circles represent outliers beyond

this range. *** p-value < 0.001.

2.3.3.3 Characterization of prostate-enriched proteins

To better characterize the list of prostate-enriched proteins, information available at the Human

Protein Atlas (262, 263) and UniProt databases (278) were used. As shown in Figure 2.12A, 38

proteins (78%) of all 49 have at least one antibody ID available at the Human Protein Atlas (262,

263) (Appendix Table 2.11), which was validated and used to generate tissue expression profiles.

In Figure 2.12A, the protein expression patterns based on the immunohistochemistry images

available in the database are indicated. The evaluation of these images, based on staining

intensity and protein distribution in the human normal prostate tissue, suggested that 45% of the

proteins positivity was found in the glandular epithelium, only 13% in the stromal cells and 29%

showed equal staining between the two compartments.

63

Figure 2.12 Additional annotations for prostate-enriched proteins. (A) Availability of

antibodies reported in the Human Protein Atlas database, for the 49 selected proteins. The

immunohistochemistry images were manually screened and protein distribution was annotated

(glandular cells or stroma). (B) Subcellular location of the 49 prostate-enriched proteins

according to the UniProt database.

Together, protein tissue distribution and cellular localization are parameters that can provide

important insights into the function of a protein. As expected, the respective subcellular patterns

reported from the UniProt database (278) were correlated (Figure 2.12B) with additional

information collected from other available resources (i.e. ProteinCenter, signal peptide

annotations) (Appendix Table 2.11). The majority (33%) of the proteins in the analyzed EPS-

urine samples were secreted (without overlap with the other categories) (Figure 2.12B). This

localization is consistent with the biological expectation that urine contains, by definition, many

extracellular proteins. Another significant percentage of proteins (22.5%) were localized to the

64

membrane compartment. Interestingly, although the EPS-urine is not enriched in intracellular

proteins (18.4%), almost half (44%) of this category consist of lysosomal proteins. This probably

points out that exosome formation is the dominant excretion pathway in urine and reflects the

biological and physiological role of these proximal fluids, through the presence of specific

transport pathways for lysosomal proteins.

2.3.3.4 Generation of candidate shortlist

The analyses suggest an enrichment of prostate-specific proteins in the EPS-urines compared to

the urine samples. In particular, the list of 49 proteins (Appendix Table 2.11) encompasses a

number of prostate specific markers that are currently used or have been previously considered

as potential candidates for prostate cancer screening; among these are: MSMB (279), PSA (280,

281), PAP (reported in 1938 as the first serum biomarker for prostate cancer (282, 283),

TMPRSS2 (284), and PSMA (285).

A small verification set, aimed at validating the findings from the proteomic investigations, was

selected out of the 49 differentially expressed proteins. Selection was largely based on antibody

availability; but other unbiased considerations were tissue expression patterns of candidates

across prostate cancer and normal prostate tissues available through the Human Protein Atlas

(262, 263), tissue specificity based on mRNA microarray data (see above), subcellular

localization by GO analysis, and in-house proteomic datasets generated from urines and prostatic

fluids from various conditions. This led to a shortlist of 7 proteins that were assayed in the

verification stage.

2.3.3.5 Verification of proteomic data

In order to verify the proteomic data, Western blot analyses were carried out for each of the 7

candidates in pooled EPS-urines and urines from prostate cancer and noncancer patients. The

panel in Figure 2.13A shows that each of the shortlisted candidates are more abundant in EPS-

urine samples compared to the matching urines, supporting results from the MudPIT analyses. In

particular, comparing noncancer and prostate cancer EPS-urines showed the downregulation of

TGM4, LTF, ANPEP, MME and TIMP1, and upregulation of PARK7 and 14-3-3σ (Figure

2.13A), which correlated with the proteomic data analyses. These preliminary results also follow

a similar trend when looking at immunohistochemical staining patterns in normal and neoplastic

65

prostatic tissues via Human Protein Atlas (262, 263), but will require further verification in large

unrelated EPS-urine cohorts in the future (Figure 2.13B).

Figure 2.13 Validation of mass spectrometry data for seven selected proteins. (A) Western

blot analyses of the seven candidate proteins confirmed the proteomic data (based on SpC)

showing an enrichment of each selected protein in the EPS-urine compared to the urine

samples. (B) Immunohistochemistry images obtained from Human Protein Atlas database

showing the differential expression in normal and cancerous prostate glandular tissue of the

seven selected proteins. The same trend was obtained by MS and Western blot analysis. (U_NC:

66

urine non-cancer; EPS-U_NC: EPS-urine non-cancer; U_Ca: urine cancer; EPS-U_Ca: EPS-

urine cancer).

2.4 Discussion

The cumulative list of identified proteins in the EPS samples currently represents the largest

compilation resource of proteins present in prostatic fluid secretions from men with confirmed

prostate cancers. In comparison to other proteomics studies aimed at discovering prostate

function or prostate cancer biomarkers using cell secretions (268), plasma (250, 286), or tissue

(287, 288), the studies presented in this chapter used a relevant organ-proximal body fluid for the

detection of secreted proteins in vivo. One goal of the studies presented in this chapter was to

provide a high-quality, well-annotated resource of proteins present in EPS. This could improve

our understanding of general prostate biology and guide the discovery of novel prostate cancer

biomarkers. While commercial genetic assays have been developed using EPS-urines as a source

of prostate-derived genetic material (257, 289, 290), an in-depth proteomic analysis of this fluid

has not been reported. In contrast to direct-EPS (260), which is likely to contain prostate-secreted

proteins at a higher concentration, EPS-urine is highly diluted by a dynamic and variable urine

background. Nevertheless, EPS-urine samples are clinically more relevant since they can be

obtained by routine DRE examination, and can be collected repeatedly for longitudinal sample

collection relevant to active surveillance monitoring of men with prostate cancer. By directly

comparing pre- and post-DRE samples, a number of proteins that are enriched in EPS-urines

compared pre-DRE urines were identified.

Current early screening for prostate cancer relies on PSA detection in serum (291). Over time,

this screening has dramatically reduced the presentation of high grade disease, yet other studies

have indicated that the majority of early detected prostate cancers are not lethal and may be

considered clinically insignificant (292-294). Thus, current methods to establish the risk of

progression and prognosis of prostate cancers remain suboptimal, and a large number of patients

are over treated with a significant negative impacts on patients and on health care (292, 293).

Thus, new diagnostics are needed that could predict disease course and discriminate indolent

tumors from aggressive tumors, allowing for more appropriate treatment options to be offered to

newly diagnosed prostate cancer patients. Targeting identification of potential new biomarkers in

67

proximal fluids of the prostate like direct-EPS and post-DRE EPS-urines could address these

clinical needs.

With the availability of modern mass spectrometers and high performance peptide separations

like MudPIT (203, 205), we are now able to routinely identify hundreds to thousands of proteins

in complex biological samples. This can result in a significant challenge for the standard

biomarker validation strategy, ELISA, which is time-consuming and costly to develop. However,

MS-based assays, such as selected reaction monitoring mass spectrometry (SRM-MS), can

facilitate the rapid verification of candidates prior to ELISA development. An additional

challenge is the selection of the most likely candidate biomarkers from the large list of identified

proteins. To guide the selection criteria of promising candidate proteins involved in prostate

function and prostate cancer biomarker discovery, the EPS proteome was integrated with a large

variety of publicly available datasets. To further streamline SRM-MS assay development, the

entire MS data has been deposited to the Tranche database for others to mine, as previously

reported (269, 295, 296). Cumulatively, this EPS proteome dataset should facilitate the selection

and confirmation of individual biomarker candidates in follow up studies.

2.5 Materials and methods

2.5.1 Materials

Ultrapure-grade urea, ammonium acetate, ammonium bicarbonate, Tris, and calcium chloride

were from BioShop Canada Inc. (Burlington, ON, Canada). Ultrapure-grade iodoacetamide,

DTT and formic acid were obtained from Sigma-Aldrich. HPLC-grade solvents (methanol,

acetonitrile and water) were obtained from Thermo Fisher Scientific (San Jose, CA).

Trifluoroacetic acid was from J.T. Baker (Phillipsburg, NJ, USA). Mass spectrometry-grade

trypsin gold was from Promega (Madison, WI, USA). Solid-phase extraction C18 MacroSpin™

Columns were from The Nest Group, Inc. (Southboro, MA, USA).

2.5.2 Sample collection and properties

All samples were collected from patients following informed consent and use of Institutional

Review Board approved protocols at Urology of Virginia, Sentara Medical Group and Eastern

Virginia Medical School between 2007 and 2009, along with the Research Ethics Board of the

68

University Health Network (Table 2.1). Tumor grades, staging, prostate volumes and tumor

volumes (297) were determined using standard pathological procedures. Patient information was

recorded, including demographics, medical history, pathology results, and risk factors, and stored

in a Caisis database system. All personal information or identifiers beyond diagnosis and lab

results were not available to the laboratory investigators.

Direct-EPS fluids were obtained by vigorous prostate massage from subjects under anaesthesia

just prior to prostatectomy as a result of a prostate cancer diagnosis. Each sample (0.2-1 ml) was

collected from the tip of the penis in a 10 ml tube, diluted with saline to 5 ml, and stored on ice

until transport to the biorepository for processing (within 1 hour of collection). For processing,

any particulate material was removed by low-speed centrifugation. Aliquots (1 ml) of the

supernatant were stored at -80oC.

EPS-urine samples were collected by performing a gentle massage of the prostate gland during

DRE prior to biopsy, as previously described (260). The massage consisted of three strokes on

each side of the median sulcus of the prostate and the expressed fluid from the glandular network

of the prostate was subsequently voided in urine.

To generate sample pools from noncancer and cancer patients, 10-20 ml of urine and EPS-urine

were collected from the same individual an hour before the DRE massage, herein denoted as

urine, and after DRE, herein denoted as EPS-urine. Samples were pooled in order to generate

sufficient material for analyses. Urine and EPS-urine from a group of 5 patients with prostate

cancer and 5 biopsy negative, noncancer individuals (Table 2.3), were pooled together to

generate a sample panel comprising 4 different conditions: U_NC: urine noncancer; EPS-U_NC:

EPS-urine noncancer; U_Ca: urine cancer; EPS-U_Ca: EPS-urine cancer. After collection,

samples were stored on ice for no longer than 1 hour. Each sample was aliquoted and stored at -

80°C until use.

Individual EPS-urines were obtained from an independent cohort of 11 different patients: 5 with

low-grade prostate cancer and 6 with biopsy negative benign conditions (BPH) (Table 2.2).

Following collection, 9 ml of EPS-urine was centrifuged at 14,000 g to remove the cell

pellet/sediment. The supernatant was recovered and concentrated using an Amicon Ultra-15

Centrifugal Filter (3 kDa cutoff; Millipore, Billerica, MA, USA) according to the manufacturer’s

69

instructions. Approximately 500 μl of each concentrated EPS-urine sample was recovered from

the filter device and stored at −80°C until use.

2.5.3 Protein digestion and sample preparation

Direct-EPS fluid corresponding to 100 µg of total protein (as measured by Bradford assay) were

concentrated to ~ 50 µl using an Amicon Ultra spin filter (3kDa cut-off; Millipore) followed by

digestion as stated below. Briefly, concentrated EPS corresponding to 100 µg of total protein

was diluted with an equal volume of 8M urea, 4mM DTT, 50mM Tris-HCl, pH 8.5 and

incubated at 37ºC for 1 hour, followed by carbamidomethylation with 10mM iodoacetamide for

1 hour at 37ºC in the dark. Samples were diluted with 50mM ammonium bicarbonate, pH 8.5 to

~ 1.5M urea. Calcium chloride was added to a final concentration of 1mM and the protein

mixture digested with a 1:30 molar ratio of recombinant, proteomics-grade trypsin at 37ºC

overnight. The resulting peptide mixtures were solid phase-extracted with C18 MacroSpin

Columns from The Nest Group, Inc. (Southboro, MA, USA). Samples were concentrated by

speedvac and stored at -80ºC until used for MudPIT analysis.

Urine and EPS-urine were quantified using a NanoDrop™ 2000 spectrophotometer (Thermo

Fisher Scientific, San Jose, CA) and volumes corresponding to 100 μg of total protein for the

pools and 150 μg of total protein for the individual EPS-urines were resuspended in 50 μl of 8 M

urea, 2mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C with constant shaking for 30

min. Carbamidomethylation was performed by incubating samples with 8 mM of iodoacetamide

for 30 min at 37 °C in the dark. Samples were then diluted to approximately 1.5 M urea using

100 mM ammonium bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of

2 mM and the protein mixture was digested with trypsin (1:40 trypsin to protein ratio) at 37 °C

overnight. The digested peptide mixture was purified with C18 MacroSpin™ columns and

concentrated by vacuum centrifugation and reconstituted to a volume of 40 μl with 0.1 % formic

acid. Samples were stored at -80 °C until used for MudPIT analysis.

2.5.4 MudPIT analyses

Direct-EPS samples

70

A fully automated 9-cycle two-dimensional chromatography sequence was set up as previously

described (205). Peptides were loaded on a 7cm pre-column (150 µm i.d.) containing a Kasil frit

packed with 3.5 cm 5µ Magic C18 100 Å reversed phase material (Michrom Bioresources)

followed by 3.5 cm Luna® 5µ SCX 100Å strong cation-exchange resin (Phenomenex, Torrance,

CA). Samples were automatically loaded from a 96-well microplate autosampler using the

EASY-nLC system (Proxeon Biosystems, Odense, Denmark) at 3 µl/minute. The pre-column

was connected to an 8cm fused silica analytical column (75 µm i.d.; 5μ Magic C18 100 Å

reversed phase material (Michrom Bioresources)) via a microsplitter tee (Proxeon Biosystems) to

which a distal 2.3 kV spray voltage was applied. The analytical column was pulled to a fine

electrospray emitter using a laser puller. For peptide separation on the analytical column, a

water/acetonitrile gradient was applied at an effective flow rate of 400 nl/minute, controlled by

the EASY-nLC system. Ammonium acetate salt bumps (8µl) were applied at 0%, 5%, 10%,

15%, 20%, 25%, 30%, 40%, and 100% using the 96-well microplate autosampler at a flow-rate

of 3 µl/minute in a vented-column set-up.

All direct-EPS samples were analyzed on a LTQ-Orbitrap XL. The instrument method consisted

of one MS full scan (400-1800 m/z) in the Orbitrap mass analyzer, an AGC target of 500,000

with a maximum ion injection of 500 ms, 1 μscan and a resolution of 60,000 and using the

preview scan option. Five data-dependent MS/MS scans were performed in the linear ion trap

using the five most intense ions at 35% normalized collision energy. The MS and MS/MS scans

were obtained in parallel. AGC targets were 10,000 with a maximum ion injection time of 100

ms. A minimum ion intensity of 1,000 was required to trigger a MS/MS spectrum. The dynamic

exclusion was applied using a maximum exclusion list of 500 with one repeat count with a repeat

duration of 30 seconds and exclusion duration of 45 seconds.

Urine and EPS-urine samples

Individual EPS-urines (5 prostate cancer and 6 BPH) were analyzed in triplicate using the same

9-cycle MudPIT procedure as described above. A quaternary HPLC pump was interfaced with a

linear ion-trap mass spectrometer (LTQ, Thermo Fisher Scientific, San Jose, CA) equipped with

a nanoelectrospray source (Proxeon Biosystems, Odense, Denmark). The pooled urine and EPS-

urine samples were analyzed in triplicate on a LTQ Orbitrap XL, using a modified 5-cycle

MudPIT, as previously described (205).

71

2.5.5 Protein identification and data analyses

Direct-EPS samples

Raw data was converted to m/z XML using ReAdW and searched by X!Tandem against a locally

installed version of the human UniProt complete human proteome protein sequence database

(version 2009_11; 20,323 sequences). The search was performed with a fragment ion mass

tolerance of 0.4 Da, a parent ion mass tolerance of ±10 ppm. Complete tryptic digest was

assumed. Carbamidomethylation of cysteine was specified as fixed, and oxidation of methionine

as variable modification. A target/decoy search was performed to experimentally estimate the

false positive rate and only proteins identified with two unique high quality peptide

identifications were considered as previously reported (297) (1 decoy protein identified; FDR

<0.5%). An in-house protein grouping algorithm was applied to satisfy the principles of

parsimony.

Urine and EPS-urine samples

The following parameters were applied according to the instrument used:

LTQ analyses of individual EPS-urines

The search was performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass

tolerance of 4 Da. Complete tryptic digest was assumed. Carbamidomethylation of cysteine was

specified as fixed and oxidation of methionine as a variable modification. Only proteins

identified with two unique high-quality peptide identifications per triplicate were considered, as

previously reported (269, 296, 298) (11 decoy proteins identified; FDR ~1%). Each sample (n =

11) was examined by 3 technical replicates (33 total MudPIT analyses).

LTQ-Orbitrap XL analyses of pooled urines and EPS-urines

The search was performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass

tolerance of ±10 ppm. Complete tryptic digest was assumed. Carbamidomethylation of cysteine

was specified as fixed and oxidation of methionine as variable modification. Only proteins

identified with two unique high quality peptide identifications per analyzed sample were

72

considered, as previously reported (205, 296) (2 decoy proteins identified; FDR ~0.5%). Each

sample pool was analyzed by ≥3 technical replicates (13 total MudPIT analyses).

Protein relative abundance was calculated using the QSpec algorithm (269, 296, 298). Proteins

were considered to be up-regulated in the pooled EPS-urine samples versus the urine samples if

they complied with the following parameters: false discovery rate (FDR) <0.05 and fold change

(FC) ≥ 2, based on the QSpec algorithm (241, 295).

2.5.5.1 Gene Ontology annotation and data comparisons

Direct-EPS

Annotation mappings (i.e. GO, predicted transmembrane domains, signal peptides etc.) were

performed using the ProteinCenter bioinformatics software (Proxeon Biosystems, Odense,

Denmark). Comparison of direct-EPS detected proteins against other proteomic datasets

(prostatic secretions (275), seminal plasma (275), urine (264), condition media (267)) was

accomplished by ProteinCenter. Proteins were sequence aligned (i.e. BLASTed) against each

other and only proteins with at least 95% sequence identity were considered to match (i.e.

protein clusters). Comparison to the Human Gene Atlas (mRNA microarray) (124) and the

Human Protein Atlas (268) was accomplished via gene mapping using an in-house relational

database. For the comparison to data in the Human Protein Atlas, only proteins with available

IHC staining in normal or prostate cancer tissues were included. The staining criteria used for

antibody selection were directly adapted from the Human Protein Atlas website and are based on

rigorous multi-step antibody quality control criteria established by this consortium. Only gene

products with a medium to high IHC scoring intensity in either normal prostate or prostate

cancer tissue were used. Detailed information regarding the quality assurance can be found at

(http://www.proteinatlas.org/qc.php). Cluster analyses (Euclidean) were performed using the

open-source tool Cluster 3.0 (http://bonsai.ims.u-

tokyo.ac.jp/mdehoon/software/cluster/software.htm) and heat maps were displayed using

TreeView (http://jtreeview.sourceforge.net).

Urine and EPS-urine

73

Functional annotations (Gene Ontology terms, KEGG pathways, and Swiss Prot entries) were

assigned using the Database for Annotation, Visualization and Integrated Discovery (DAVID,

bioinformatics resources v6.7; http://david.abcc.ncifcrf.gov/) (265). Unique proteins detected in

the EPS-urine dataset were compared to the UniProt database and the top five significantly over-

represented categories were reported (p-value <0.001). Comparisons of the present EPS-urine

dataset to urine (266) and direct-EPS (299) datasets was accomplished using ProteinCenter

(Proxeon Biosystems, Odense, Denmark). Proteins were sequence-aligned against each other and

only proteins with at least 95% sequence identity were considered a match (i.e. protein clusters).

2.5.6 Prostate-enriched proteins characterization

The BioGPS portal (http://biogps.org/) (124) was used to map identified proteins against

available mRNA microarray datasets. Twenty-five major organ systems among those available in

BioGPS were selected and linked to proteins via gene accessions. The expression level for each

gene was based on averaged probe intensities, and the significant enrichment in prostate tissue

(>2 fold change) was calculated as a log2 ratio compared to the other selected tissues. The

random sampling analysis was carried out using the unpaired one-tailed Student`s t-test. A p-

value ≤0.05 was considered statistically significant. The alphabetical order of the selected organs

is as follows: bone marrow, colon, heart, hypothalamus, kidney, liver, lung, lymph node, ovary,

pancreas, placenta, prostate, salivary gland, skeletal muscle, skin, small intestine, smooth

muscle, spinal cord, testis, thalamus, thymus, thyroid, uterus, whole blood, and whole brain.

The UniProt database (260) was used to assign subcellular localization to the 49 proteins

enriched in EPS-urine pooled samples. The annotations were manually reported and the 49

proteins were grouped into three main categories: secreted, membrane, and intracellular (which

includes cytoplasmic, nuclear, and lysosomal). Identified proteins were also screened against the

Human Protein Atlas database (260) for availability of antibodies and to examine their prostate

tissue expression patterns.

http://david.abcc.ncifcrf.gov/

http://biogps.org/

74

2.5.6.1 SDS-PAGE and Western blot analysis on urine and EPS-urine

pools

For Western blotting, 40 μg of total proteins were separated on 8 or 10% SDS-PAGE gels and

blotted on PVDF membranes (0.2 μm; Bio-Rad Laboratories, Hercules, CA). Membranes were

blocked with 5% milk in TBS-Tween (0.2%) for 1 hour at room temperature and subsequently

incubated overnight at 4°C with the following primary antibodies: anti-Lactoferrin (1:1000

#ab10110; Abcam, Cambridge, UK), anti–CD10 (MME 1:1000 #ab951; Abcam, Cambridge,

UK), anti–TIMP1 (1:2000 #RP1-TIMP1; Triple Point Biologics, Forest Grove, OR), anti–CD13

(ANPEP 1:500 #ab7417; Abcam, Cambridge, UK), anti–TGM4 (1:500 #sc55791; Santa Cruz

Biotechnology, Santa Cruz, CA), anti–14-3-3σ (1:250 #ab14123; Abcam, Cambridge, UK), and

anti–PARK7 (1:1000 #ab11251; Abcam, Cambridge, UK). After three 10-minute washes with

TBS-Tween (0.2%), membranes were incubated with anti-mouse/anti-rabbit/anti-goat IgG-HRP

secondary antibody (Invitrogen, Carlsbad, CA) at a dilution of 1:25,000 for 1 hour at room

temperature, washed and visualized with the SuperSignal West Pico chemiluminescent substrate

(Thermo Fisher Scientific, San Jose, CA).

75

Chapter 3

Identification of candidate biomarkers of aggressive prostate

cancer

Portions of this chapter have been reformatted from the original publication: Kim, Y.,

Ignatchenko, I., Yao, C.Q., Kalatskaya, I., Nyalwidhe, J.O., Lance, R.S., Gramolini, A.O.,

Troyer, D.A., Stein, L.D., Boutros, P.C., Medin, J.A., Semmes, O.J., Drake, R.R., Kislinger, T.

(2012). Identification of differentially expressed proteins in direct expressed prostatic secretions

of men with organ-confined versus extracapsular prostate cancer. Molecular Cellular Proteomics

11:1870.

3 Identification of candidate biomarkers of aggressive

prostate cancer

3.1 Abstract

Current protocols for the screening of prostate cancer cannot accurately discriminate clinically

indolent tumors from more aggressive ones. One reliable indicator of outcome has been the

determination of organ-confined (OC) versus non-OC disease but even this determination is

often only made following prostatectomy. This underscores the need to explore alternate avenues

to enhance outcome prediction of prostate cancer patients. Fluids that are proximal to the

prostate, such as expressed prostatic secretions (EPS), are attractive sources of potential prostate

cancer biomarkers as these fluids likely bathe the tumor and accumulate disease related factors.

Direct-EPS samples from 16 individuals with extracapsular (EC, n = 8) or OC (n =8) prostate

cancer were used as a discovery cohort, and analyzed in duplicate by 9-step multidimensional

protein identification technology (MudPIT) on a LTQ-Orbitrap XL mass spectrometer. A total of

624 unique proteins were identified by at least two unique peptides with a 0.2% false discovery

rate (FDR). A semi-quantitative spectral counting algorithm identified 133 significantly

differentially expressed proteins in the discovery cohort. Integrative data mining prioritized 14

candidates, including two known prostate cancer biomarkers: prostate specific antigen (PSA) and

prostatic acid phosphatase (PAP), which were significantly elevated in the direct-EPS from the

76

OC cancer group. These and five other candidates (SFN, MME, PARK7, TIMP1, and TGM4)

were verified by Western blotting in an independent set of direct-EPS from patients with

biochemical recurrence (BCR) (n = 5) versus patients with no evidence of recurrence upon

follow-up (n =10). Lastly, we performed proof-of-concept selected reaction monitoring mass

spectrometry (SRM-MS)-based relative quantification of the five candidates using unpurified

heavy isotope-labeled synthetic peptides spiked into pools of EPS-urines from men with EC and

OC prostate tumors. This study represents the first efforts to define the direct-EPS proteome

from two major subclasses of prostate cancer using shotgun proteomics and verification in EPS-

urine by SRM-MS.

3.2 Introduction

Prostate cancer is the most common malignancy to affect men in the Western world, but only

15–20% of these men will present with aggressive, lethal disease (15, 300) whereas the majority

of patients will die of other causes. Although the implementation of large-scale screening for

prostate cancer using serum PSA has dramatically improved early detection of disease,

unnecessary biopsies and patient overtreatment are becoming increasingly evident (16, 300).

Consequently, there has been a shift in emphasis away from detection of prostate cancer and

toward identification of lethal disease. Currently, Gleason grading is considered to be one of the

best outcome predictors; however, patients with Gleason 7 tumors are in the clinical “gray zone,”

whereby the predictive ability of Gleason grading is mixed (72, 145). A recent study constructed

a 157-gene signature based on the comparison of Gleason score <6 and 8 patients, and could

show that their panel could predict lethality in the cohort of Gleason 7 patients (145).

Nonetheless, the development and large-scale implementation of prognostic markers of prostate

cancer has been hampered by numerous factors owing, in part, to the heterogeneous and

multifocal nature of the disease (2). Although the widely used Gleason grading system attempts

to control for heterogeneity of the glands and multifocality of cancerous lesions by summing the

2–3 most commonly observed histological patterns via inspection of multiple (typically 8 –12)

core biopsies, cancerous foci are still often missed (2, 300) providing only partial information

leading to imprecise diagnoses and prognoses. Pathologic staging remains the gold standard for

disease staging and risk assessment (301, 302); however, this process lacks timeliness in

discriminating OC from EC disease. Indeed, one-third of individuals with non-OC disease are

77

identified only after surgery (303). Furthermore, 35% of men treated with radical prostatectomy

(RP) with curative intent subsequently develop BCR (304-307) and the mean time from surgery

to recurrence is 3.5 years (72). Significant risk factors for time to prostate specific mortality

following BCR after RP are PSA doubling time, pathological Gleason score, and time from

surgery to BCR (72). Estimates place the percent of lethal cases at 20–25% of all patients that

show BCR, suggesting that nearly 75– 80% of patients in this group may be over treated (69).

There is an emerging trend toward recruitment of men with perceived low-risk disease to an

“active surveillance” monitoring approach. This is based on the supposition that most prostate

cancers are slow growing, and that the more aggressive forms can be identified during a period

of observation with little increased risk of death. Although a consensus may not exist for

defining the disease stage where active surveillance is warranted, there is considerable agreement

that men who have a PSA level less than 10 ng/ml, impalpable disease (clinical stage T1c) and

only 1 biopsy core out of 12 or more that show Gleason 6 cancer are most likely to harbor

indolent disease (308). Even so, these candidates for active surveillance will still contain

individuals who will have disease progression and die from their cancer. Thus, despite efforts to

recruit individuals to active surveillance protocols, overtreatment of prostate cancer is fueled by

the lack of reliable means to accurately discriminate between men with clinically indolent

prostate cancer from those with more aggressive disease (309, 310). This inability to accurately

predict prostate cancer aggressiveness based solely on standard clinicopathologic features clearly

underscores the need to explore the ability of additional biomarkers to enhance outcome

prediction for men with prostate cancer. Furthermore, it is important to acknowledge that a single

biomarker alone is unlikely to have sufficient prognostic power; rather, the integration of a panel

of biomarkers hold the promise for improved prostate cancer detection and prognosis (300).

Fluids that are proximal to the prostate are attractive sources of potential prostate cancer

biomarkers (255, 300), as they house secreted proteins and sloughed cells that provide a

presumably more comprehensive assessment of the organ and extent of disease. Further, fluids

such as urine are clinically favorable for their ease of collection, the volume and frequency at

which they can be obtained, and their adaptability to routine clinical assays. Prostate-proximal

fluids include seminal fluid, semen, and EPS. Here, we focus on the analysis of EPS as our

biological specimen, using direct-EPS samples for the discovery of candidate prognostic

biomarkers and both direct-EPS and pooled EPS-urines derived from independent sets of patients

78

for candidate biomarker verification. Direct-EPS is a prostatic fluid that is collected from

patients undergoing prostatectomy by massaging the organ and expelling 0.5–1 ml of the fluid

just prior to surgical removal. It was chosen as our discovery fluid as it is expected to house

prostate-secreted proteins at a higher concentration and purity, and we have developed a

workflow for the in-depth proteomic analysis of this fluid (260). Following discovery proteomics

in 16 clinically stratified direct-EPS samples, verification studies were performed using

independent sample sets of direct-EPS. Next, we focused our attention on the verification and

quantitative analysis of candidate proteins in pooled EPS-urines. Before EPS-urine collection,

men undergo digital rectal examination (DRE), often as part of a routine procedure, which causes

direct-EPS to be expelled from the prostate and subsequently voided in urine. Because EPS-urine

can be collected with substantial ease and in greater volumes and frequencies than direct-EPS,

much attention has been paid to this fluid as a valuable resource of prostate cancer biomarkers

amenable to routine clinical analysis. Following the recent FDA approval of the EPS-urine assay

for prostate cancer gene 3 (PCA3), standardized clinical collection protocols will be widely

implemented and easier access to this fluid is expected. Moreover, we have recently identified a

number of prostate-enriched proteins in EPS-urine by comparing its proteome to a urine

background (127).

The present study used MudPIT coupled with bioinformatics to catalog and comparatively

analyze the direct-EPS proteomes from a small cohort of patients with EC versus OC prostate

cancers. A semi-quantitative algorithm based on spectral counts (QSpec) (275) and an integrative

data mining strategy led to the selection of a number of putative biomarkers that were verified by

Western blotting in direct-EPS. Lastly, to demonstrate accurate quantitative measurements of

verified candidates in EPS-urine, a pilot study utilizing SRM-MS was undertaken as a proof-of-

concept.

3.3 Results

3.3.1 Proteomic characterization of expressed prostatic secretions from

organ-confined and extracapsular prostate cancers

An overview of the study approach is illustrated in Figure 2.1. Direct-EPS derived from 16

individuals who were clinically stratified as having EC (n = 8) or OC (n = 8) prostate tumors

79

(Table 3.1) were digested in-solution and analyzed by a 9-step MudPIT on a LTQ- Orbitrap XL

mass spectrometer, in duplicate (32 MudPITs).

Figure 3.1 Workflow of study design from the discovery of differentially expressed proteins

to verification of candidates in prostatic fluids.

80

Table 3.1 Clinical information of patients used for discovery proteomics.

A total of 624 unique proteins were identified by at least two unique peptides with a 0.2% FDR

(1 total reverse proteins) (Figure 3.2, 3.3, Appendix Tables 3.1 and 3.2). Among these, 78

proteins (13%) were only identified in the EC group and 216 (34%) were only identified in the

OC group, while the majority (330 proteins, 53%) were shared by both (Figure 3.4). A

substantial overlap of our data set with other published prostate-proximal and urine proteomic

data sets was observed (Figure 3.5). Furthermore, a large proportion of proteins had at least one

predicted transmembrane domain (TMD) and signaling peptide (SP) sequences (Figure 3.6).

Proteins in this fluid spanned a wide range of functional roles, as determined via Gene Ontology

(GO) enrichment (Figure 3.7). A number of Gene Ontology terms were significantly enriched in

both tumor groups. They were implicated in proteolysis, regulation of programmed cell death,

and cellular proliferation in the biological process category, as well as peptidase activity and

cytoskeletal protein binding in the molecular function category. Notably, many proteins were

localized in the extracellular region, as indicated by their over- representation in Gene Ontology.

81

Figure 3.2 Average number of unique proteins identified in the duplicate MudPIT runs for

discovery direct-EPS samples.

Figure 3.3 Total number of peptides (A) and total number of spectra (B) recorded for each

individual direct-EPS analyzed in the discovery phase.

82

Figure 3.4 Venn diagram depicting the total number of proteins exclusively expressed in

the EC or OC groups.

Figure 3.5 The current proteomics dataset compared to prostate-related proteomics data

from various reports.

83

Figure 3.6 Proportion of proteins with predicted SPs and TMDs.

Figure 3.7 Gene Ontology enrichment of the EC and OC direct-EPS.

84

3.3.2 Semi-quantitative comparison of expressed prostatic secretions

from extracapsular and organ-confined prostate cancers

To identify differentially expressed proteins in both patient groups we employed the QSpec

algorithm (275). Only proteins with a FDR of < 0.05 and at least a two-fold change in

normalized spectral abundance factor between the EC and OC groups for each protein were

considered. This analysis resulted in 133 differentially expressed proteins for further

consideration (Figure 3.8), and 100 (77%) of these were present in a functional interaction (FI)

network.

Figure 3.8 Distribution of all 624 proteins identified in the current direct-EPS data set. Blue

and red dots represent the 133 differentially expressed proteins based on a fold change of at least

2 and a FDR of ≤0.05 by QSpec analysis (red dots are proteins up-regulated in the EC tumor

85

group, blue dots are significantly down-regulated in the EC tumor group, with the 14 shortlisted

candidates indicated by their names).

Following extraction of these proteins from the FI network and the addition of linker genes, a

proteomics-associated prostate cancer network consisting of 161 genes, 61 of which are linkers,

was obtained. To decompose this network into smaller independent pieces and run pathway

annotation for every cluster separately, we used network community analysis to automatically

identify network modules that contain genes and their co-regulators that are involved in common

biological processes with high probability. We identified five modules of at least 10 proteins and

pathway enrichment analysis was done for each cluster separately. The first module was

significantly enriched in the “Glycolysis/ gluconeogenesis” pathway; the second one was

mapped to the cell cycle-related process “G2/M transition”; the functional annotation of the third

and fifth modules showed that they were significantly enriched in hemostasis processes like

“Complement and coagulation cascades” and “Further platelet releasate” and others (Figure 3.9,

2.10, and Appendix Table 3.3). The fourth module was not significantly associated with any

known processes or pathways.

86

Figure 3.9 Two significantly enriched pathways within the 133 differentially expressed

proteins are involved in hemostatic and immunological processes. Pink circles represent

proteins up-regulated in the EC tumor group; blue circles represent proteins down-regulated in

the EC group; orange triangles represent linkers.

87

Figure 3.10 Additional modules enriched by pathway analysis.

3.3.3 Generation of candidate shortlist

To select proteins for verification in a small, independent cohort of direct-EPS samples, we

performed a systematic data mining strategy using in-house and publicly available resources to

minimize the panel of 133 proteins to a refined set (Figure 3.11).

88

Figure 3.11 Protein prioritization scheme for the selection of candidates for verification.

Proteins were ranked initially based on six features by assigning a +1 value to a given protein

with the following annotations (0 if the term could not be assigned): 1) present by MS in six out

of eight samples corresponding to the risk group that the candidate was found to be higher in by

spectral counting; 2) overexpression of candidate genes in the prostate as demonstrated by at

least a threefold above median mRNA expression in normal prostate tissue compared with 22

different normal human tissues; 3) presence of predicted TMDs; 4) presence of predicted SP

sequences; 5) cellular localization assignment to “cell surface” (GO:0009986) and

“extracellular” (GO:0005576) by Gene Ontology annotation; and 6) cancer-associated protein as

available from the Human Protein Atlas database. The rationale for each of the key features is as

follows: as individual biological specimens can have a high level of variation, we set the criteria

89

that a candidate protein should have been detected in at least six out of eight samples in its

associated risk group. This would minimize the inclusion of proteins that are over represented by

large numbers of spectra in just a small number of samples. The integration of mRNA

microarray profiles from normal prostate and other human tissues with our proteomic data set

was performed with the rationale that the most promising biomarker candidates would be

preferentially expressed in the prostate, and thus may have an important functional role in the

organ that contributes to the outcome of the patient. Microarray data was therefore solely used to

select proteins with a likely tissue-selective expression pattern. We also took into consideration

the presence of predicted TMDs and SPs, as well as Gene Ontology annotations linked to the cell

surface and extracellular region as they are characteristics of secreted and plasma membrane

proteins, found in many currently known biomarkers. Sixty proteins had at least three out of six

features; 32 of which are cancer- associated proteins (253). Because of limitations in antibody

availability, a small subset of proteins (14 candidates, Table 3.2) were chosen for verification.

Also included in this list were two well-defined prostate cancer biomarkers, PSA and PAP, their

levels were also evaluated by ELISA in a different set of 14 direct-EPS samples. As shown in

Figure 3.12, the PSA and PAP concentrations determined by ELISA and Western blotting

yielded a similar distribution as the MS analysis, with an elevated level of expression being

observed in the OC group compared with the EC group. Further verification of the expression of

a subset of the short-listed candidate proteins was carried out in new sets of EPS fluids.

Table 3.2 Fourteen candidates selected by data mining scheme.

90

Figure 3.12 Measurement of PSA and PAP in direct-EPS. Samples were obtained from

patients with EC and OC prostate tumors in the discovery cohort by MS (reported as normalized

spectral abundance factors, NSAF) and in a new set of samples by ELISA (reported as μg/ml;

**p-value ≤0.01; *p-value ≤0.05).

3.3.4 Verification of differentially expressed candidates in direct-EPS

For the verification of the shortlisted candidates, we chose a small cohort of direct-EPS samples

that could be linked with clinical outcome information related to whether the patient had BCR or

91

no evidence of recurrence within a two-year period post-prostatectomy (Table 3.3). Using these

samples, the expression levels of candidates the verification set were evaluated by Western

blotting.

Table 3.3 Clinical information of patient samples used for Western blotting verification.

Five out of the 14 the candidates were successfully verified to correlate with the discovery data

(Figure 3.13, 3.14). These are: stratifin (SFN), membrane metalloendopeptidase (MME),

Parkinson protein 7 (PARK7), tissue inhibitor of metalloproteinase 1 (TIMP1), and

transglutaminase 4 (TGM4). For SFN, QSpec analysis of the discovery data in direct-EPS

revealed that it was three-fold higher in the OC group with a FDR of <0.002. In the BCR and

nonrecurrent samples, the level of SFN expression was significantly higher in the nonrecurrent

group (p = 0.03). For MME, it was found to be two-fold higher in the OC proteome with a FDR

of <0.002. In the comparison between BCR and nonrecurrent samples, MME expression was

found to be significantly higher in expression in the nonrecurrent group compared with BCR

patients (p = 0.05). In addition, we analyzed EC (n = 23) and OC (n = 19) direct-EPS samples for

the expression of MME using a commercially available ELISA, and found a similar trend in this

new set of samples (p = 0.07) (Figure 3.14B). PARK7 had a threefold difference by QSpec with

a FDR of <0.002, with higher expression being observed in the OC group. Similarly, this protein

was found to be significantly higher in the nonrecurrent group compared with the BCR group (p

= 0.008). Both proteins TIMP1 and TGM4 were three-fold different in expression in the

92

discovery sample set, whereby their expression was higher in the OC group with a FDR of

<0.002. Upon subsequent analysis in BCR and nonrecurrent patient direct-EPS samples, a

similar trend was observed with heightened expression in the nonrecurrent group for TIMP 1 (p

= 0.099) but not for TGM4 (p = 0.80).

Figure 3.13 Differential expression of verified candidates. Analysis was completed by MS in

EC and OC direct-EPS and Western blotting in BCR (R) and nonrecurrent (NR) patient direct-

EPS (reported as densitometry values; **p-value ≤0.01; *p-value ≤0.05; x, p-value ≤0.1; NS

denotes an insignificant p-value.

93

Figure 3.14 Differential expression of candidates measured by immunoassays. (A) Western

blots and stained membranes used to probe for candidates in direct-EPS from BCR and

nonrecurrent groups of patients. (B) MME ELISA in direct-EPS (x denotes p-value ≤0.1).

3.3.5 Verification of differentially expressed candidates in EPS-urines

A major aim of the current study was directed toward the detection of our candidates in a

clinically accessible fluid that can be collected with minimal invasiveness. EPS-urine samples

obtained during a DRE meet these criteria. Therefore, EPS-urine samples were obtained from

patients about to undergo a prostate biopsy procedure who subsequently were diagnosed with

pathology-confirmed EC and OC prostate tumors. Pools of EPS-urines from both groups were

prepared (Table 3.4) and used to assess the presence of the candidate proteins in this biofluid by

Western blot and a multiplexed SRM-MS approach. All candidates were quantifiable in the EPS-

urine with at least one proteotypic peptide by SRM-MS (Figure 3.15, Appendix Figure 3.1,

Appendix Tables 3.4, 3.5). The overall expression levels of MME, PARK7, and TIMP1

coincided with the Western blot analyses in the same pools (Figure 3.16). For SFN, one of the

two peptides representing SFN (NLLSVAYK) was found to be lower in the OC tumor group,

94

whereas the other (VLSSIEQK) was not quantifiable. Similarly, both TGM4 peptides were

down-regulated in the OC tumor group, which is not in line with the Western blot analyses.

Prostate specific antigen expression was found to be similar between the OC and EC tumor

groups as measured by two peptides: peptide HSQPWQVLVASR was higher in the OC group by

an average of 1.7-fold and LSEPAELTDAVK was by an average of 1.3-fold. This trend was

consistent with Western blot analysis of these same pools. ELISA was also performed on the

individual EPS-urines that comprise the pools, whereby a notable difference between the two

groups were not apparent (Figure 3.17).

95

Table 3.4 Clinical information of patient samples used to generate EPS-urine pools.

97

Figure 3.15 Standard curves generated for each peptide used for SRM-MS measurements.

Six concentration points were evaluated by 3 technical replicates in EPS-urine digests.

98

Figure 3.16 Verification of differential expression of candidates in EPS-urines pooled from

patients with EC or OC prostate tumors. Five candidates were measured by Western blot and

SRM-MS using heavy isotope-labeled peptide standards. Relative quantitative values are shown

for each technical replicate and as the average ratio of light/heavy peptide in the OC tumor group

divided by the light/heavy peptide in the EC tumor group ± standard deviation for each peptide.

99

Figure 3.17 PSA measurements in individual EPS-urines from patients with EC and OC

prostate tumors by ELISA (NS denotes insignificant p-value). The same patient samples were

pooled and assayed by Western blot for PSA as well as SRM-MS.

3.4 Discussion

Emerging advances in proteomics and genomics has led to the identification of a vast number of

biomarker candidates for a plethora of malignancies; however, few see clinical application (209,

311). The potential of serum as a source of protein biomarkers is hampered by its immense

complexity (118); thus we and others have looked to organ-proximal fluids for the discovery and

targeted verification of putative biomarkers of various conditions (260, 264, 267, 312-315).

Here, we interrogated a prostate-proximal fluid, EPS, for the identification of candidate

biomarkers of aggressive prostate cancer by analyzing a cohort of patients with EC versus OC

disease phenotypes. We followed up with verification of these biomarkers to identify aggressive

disease through additional analyses of a cohort of BCR and nonrecurrent patient direct-EPS.

Efforts to adapt urine as a source of biomarkers for prostate cancer have been made using various

approaches. For instance, DNA markers such as GSTP1 hypermethylation (316), RNA markers

such as PCA3 (317), and protein markers such as ANXA3 (167), matrix metalloproteinases

(169), and urinary versus serum PSA (318, 319). In light of the clinical relevance of urine

samples, we accessed EPS-urine for verification and potential future assay development of our

candidate biomarkers.

100

Our 624 EPS protein data set contained a number of previously proposed urinary protein prostate

cancer biomarkers, including PSA, ANXA3, and matrix metalloproteinases (MMP7 and 9), all of

which demonstrated differential expression between the EC and OC groups. Furthermore, a large

proportion of the proteome of EPS was predicted to have TMDs and SP sequences that are

characteristic of plasma membrane and secreted proteins, including our five candidates. A cell-

surface glycoprotein, MME, has been proposed to have an inhibitory effect on prostate cancer

cell growth and migration through its association with PTEN, PI-3 kinase (320) and its loss has

been shown to be associated with PSA relapse (321). The observation in the current study was

that this protein is significantly higher in expression in the OC and nonrecurrent groups

compared with the EC and BCR groups in direct-EPS. In general, there was a trend of decreased

expression of most proteins detected in the EC groups relative to the OC groups. However, at the

protein network level, proteins in the complement and coagulation cascade were significantly

higher in expression in the EC tumor group. There are clearly a high abundance of immune

system-related proteins identified in the EPS fluids, which is consistent with the chronic

inflammation associated with development and progression of prostate cancer (322).

Lymphocytes and macrophages present in the prostate are most often associated with this

chronic inflammation, and less frequently plasma cells, eosinophils, and neutrophils (323, 324).

The presence of tumor-associated macrophages have been reported as potential biomarkers of

poor prognosis in prostate cancers (324, 325), and we have evidence that these macrophages can

be detected by flow cytometry of the low speed cell pellets of EPS urine samples in a subset of

prostate cancer patients (data not shown). A better understanding of the role of the many

detectable biomarkers of immune system activity, like the members of the complement path-

way overexpressed in non-OC EPS samples, is critical for improving diagnostic targeting of

aggressive prostate cancers.

The complexity of the EPS proteome could be reduced using depletion columns, specifically

targeted for high-abundance proteins such as albumin, before digestion. This method may

facilitate the quantification of lower-abundance proteins although sample recovery may be a

caveat if starting concentrations are low. Advances in nanoparticle technology are also showing

promise in their application in proteomics to deepen proteome coverage and enhance recovery of

low- abundance proteins. Nanoparticles, functionalized using high-affinity baits to selectively

capture desired classes of proteins, have been shown to enrich otherwise, low-abundance

101

proteins, by orders of magnitude (326-328). Yet other strategies focus on the analysis of

subproteomes by extraction of desired organelles or exploiting post-translational modifications to

selectively enrich for desired classes of proteins. For instance, glycosylation is the most common

post-translational modification and N-linked glycosylation is particularly common in secreted or

plasma membrane proteins (329). Enrichment of this class of proteins can be achieved by the

selective capture of N-linked glycosites to a solid support via hydrazide chemistry followed by

enzymatic release by peptide N-glycosidase F (330, 331), or various lectin affinity approaches

(332-334). Quantification can be achieved by stable isotope labeling and tandem MS. Zhao and

colleagues have extended the technique of stable isotope labeling by amino acids in cell culture,

by generating stable isotope labeled secretome standards for the quantification of soluble

proteins in direct-EPS (315). Alternatively, more elegant techniques such as SISCAPA® (335,

336) can selectively enrich for target peptides using anti-peptide antibodies and coupled to SRM-

MS for targeted quantification, albeit the pipeline from generation of monoclonal antibodies and

their validation may be a significant limiting factor. Analyzing urinary sediments, which

accounts for 47.7% of the total protein content by weight (171), may offer interesting markers of

prostate cancer that may be missed by solely investigating the soluble fraction. Mataija-Botelho

et al. identified 60 proteins in the sediment of urine, many of which were also identified in the

soluble fraction, suggesting that simply discarding the sediment may alter abundance ratios of

different proteins (337). EPS fluids are also an abundant source of exosomes, a sub-fraction that

is also being targeted for discovery of prostate cancer biomarkers (172, 173, 338). Initial pilot

proteomic studies of EPS-urine exosome fractions have indicated readily detectable and

abundant proteins by MS (126). This exosome fraction is a subcomponent of the proteins

analyzed in the present study, and we are currently pursuing more detailed analyses of these

EPS-derived exosomes from samples representative of the disease spectrum of prostate cancers.

Our pipeline attempts to bridge discovery proteomics in direct-EPS with validation in a more

clinically obtainable fluid, EPS-urine. As such, we observed some discrepancies that can be

attributed to differences in direct-EPS and EPS-urine samples, sample size, patient heterogeneity,

the translation of shotgun proteomics to targeted SRM-MS, and inherent challenges in SRM-MS

quantification. In the current study, discrepancies in SFN and TGM4 by Western blotting and

SRM-MS are reported. Although a moderate-to-good correlation between antibody-based assays

(namely, ELISA) and targeted proteomics approaches have been reported (250, 339, 340),

102

discordance between the two methods can be conceptualized by the differences in the targeting

approaches. Therefore, it may be possible that the epitope for the SFN and TGM4 antibodies

used in the current study may be modified thus resulting in inaccurate measurements, or the

antibodies may be also targeting other isoforms of the proteins as both are members of larger 14-

3-3 and transglutaminase protein families. Furthermore, proteoforms – that is the different

molecular forms in which the protein product of a single gene can be found, including changes

due to genetic variations, alternatively spiced RNA transcripts, and post-translational

modification (341) − may confound accurate quantification. The quantification of SFN and

TGM4 thus remains inconclusive at this point of time.

Recently, Hüttenhain and colleagues demonstrated a high-throughput workflow for the

development of SRM-MS assays for a large number of cancer-associated proteins in human

urine and plasma (253). In their study, 408 proteins were detectable by SRM-MS in urine, of

which 169 were previously undetected in the data sets from Human Protein Atlas and Adachi et

al. (124). In another study, the serum and prostate glycoproteome of Pten-null and wild-type

mice were comparatively assessed using label-free quantitative proteomics followed by SRM-

MS of 39 protein orthologs in sera of prostate cancer patients and controls (240), demonstrating a

useful platform for biomarker discovery. Similarly, the current study demonstrates the adaptation

of a discovery-driven global proteomics experiment in a prostate-proximal fluid to a targeted

quantitative assay in EPS-urine. Although a large proportion of the SRM-MS optimization

process for peptides can be achieved in a simpler matrix, such as BSA, yeast, or cell lysate

background, finer optimization in the highly complex clinical specimen is an important

requirement. General caveats associated with protein biomarker studies in urine are low protein

abundance, the presence of cellular material, high concentrations of salts and interfering

substances such as urea, urobilin and other metabolites, and intra- and inter-individual variability

(271). In our hands, initial optimization studies were performed in a BSA background, whereby

lower concentrations of heavy isotope labeled peptide standards were easily identifiable by at

least three transitions. However, once introduced to the EPS-urine background, we observed

large amounts of ion suppression and interference of the heavy isotope-labeled peptide standards

that were easily identifiable in BSA. Most peptides showed good linearity within 1–500 fmol on-

column range but equimolar concentrations of peptides showed variable responses. Therefore, in

the end we used a heavy peptide standard concentration of 500 fmol on-column, which allowed

103

all peptides to be readily detectable in EPS-urine. Using heavy isotope labeled peptide standards,

we quantified PSA, SFN, MME, PARK7, TIMP1, and TGM4 in pooled EPS-urines of EC and

organ-confined prostate cancers by 1 or 2 peptides. The SRM-MS experiments presented in this

proof-of-concept study primarily serves to demonstrate the applicability of EPS-urines to SRM-

MS assay platforms and to support its value as an important clinical biofluid. By no means do

pooled EPS- urines represent the heterogeneity of the patient population. The only way to verify

the relevance of our proposed candidates would be to quantitatively measure them in large

cohorts of individual EPS-urines and thus future work will be dedicated to extending the

verification phase of the current study.

3.5 Materials and methods

3.5.1 EPS Sample Properties

Direct-EPS samples were obtained from patients following informed consent and use of

Institutional Review Board approved protocols at Urology of Virginia and Eastern Virginia

Medical School between 2007 and 2009 and the Research Ethics Review Board at the University

Health Network. Sample collection and handling has been previously described (260). Briefly,

an aggressive prostate massage was performed in confirmed prostate cancer patients under

anesthesia prior to undergoing prostatectomy to collect 0.2–1 ml of fluid. Samples were diluted

with saline to 5 ml and stored on ice for a maximum of 1 h. Particulates were sedimented by low-

speed centrifugation and the resulting supernatants were aliquoted to 1 ml and stored at -80 °C.

The clinical data linked to the patients enrolled in the study are outlined in Tables 2.1 and 2.3.

Organ-confined disease is defined as the condition of patients with tumors not beyond stage T2c

that are margin, seminal vesicle and lymph node negative. Extracapsular disease is differentiated

from OC by evidence of margin involvement, extraprostatic extension, or positive for tumor in

seminal vesicles or lymph nodes. BCR is defined as patients with rising serum PSA values over

0.2 ng/ml at 3 months or longer post-prostatectomy. EPS-urines were collected from men

reporting to the clinic for a prostate biopsy via prostate massage with three strokes on each side

of the median sulcus of the prostate followed by collection of voided urine (10–20 ml), as

previously described (255). Pools of EPS-urines were derived from 17 patients classified as

having OC prostate cancer and 17 from non-OC, EC cancers using 1 ml per patient sample.

Tumor grades, staging, prostate volumes and percent of cancer positive needle biopsy cores were

104

determined using standard pathological procedures. Patient information was recorded, including

demographics, medical history, pathology results, and risk factors, and stored in a Caisis

database system. All personal information or identifiers beyond diagnosis and laboratory results

were not available to the laboratory investigators.

3.5.2 Protein digestion and sample preparation for mass spectrometry

Ultrapure-grade urea, ammonium acetate, and calcium chloride were from BioShop Canada, Inc.

(Burlington, ON, Canada). Ultrapure-grade iodoacetamide, and dithiotreitol (DTT) were from

Sigma-Aldrich. HPLC-grade solvents (methanol, acetonitrile, and water) and formic acid were

from Fisher Scientific. Mass spectrometry-grade trypsin was from Promega (Madison, WI).

Solid-phase extraction C18 MacrospinTM Columns were from The Nest Group, Inc. (Southboro,

MA). Amicon spin filter columns, 0.5 ml, 3 kDa MWCO, were from Millipore.

Protein concentrations of the direct-EPS samples used for discovery-based proteomics were

quantified by Bradford assay and a volume corresponding to 100 μg of total protein were

concentrated to 20 μl using a spin filter column. Concentrated direct-EPS was diluted with 50 μl

of 8 M urea, 2 mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C for 1 h.

Carbamidomethylation was performed by incubating samples with 8 mM of iodoacetamide for 1

h at 37 °C in the dark. Samples were then diluted to 1.5 M urea using 100 mM ammonium

bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of 2 mM and the

protein mixture was digested with 4 μg of trypsin at 37 °C overnight. The digested peptide

mixture was purified with C18 MacrospinTM columns and concentrated by vacuum

centrifugation and reconstituted to a volume of 40 μl with 0.1% formic acid. Samples were

stored at -80 °C until used for MudPIT analysis. A 9-step MudPIT sequence was used as

previously described (203, 204, 260).

3.5.3 Protein identification and data analysis

Raw data was converted to m/z XML using ReAdW (version 1.1) and searched by X!Tandem

(version 06.06.01) against a locally installed version of the human UniProt complete human

proteome protein sequence database (release date 2009, 20,323 sequences). The search was

performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass tolerance of 10

ppm, and two miscleavages were allowed. Carbamidomethylation of cysteine was specified as

105

fixed and oxidation of methionine as a variable modification. A target/decoy search was

performed to experimentally estimate the false positive rate and only proteins identified with two

unique high quality peptide identifications were considered as previously reported (269, 296,

298) (1 decoy protein identified; FDR ≤0.2%). An in-house protein grouping algorithm was

applied to satisfy the principles of parsimony (296). A total of 624 protein groups were identified

(Appendix Tables 3.1, 3.2). Proteins with significant differential expression levels were

determined based on a hierarchical Bayesian statistical algorithm known as QSpec (275).

Proteins were considered to be differentially expressed if normalized spectral abundance factors

(342) derived from QSpec were at least two-fold different between EC and OC groups with a

FDR ≤0.05.

3.5.4 Gene Ontology annotation, pathway enrichment and network-

based analysis of EPS proteins

Analysis of differentially expressed proteins by Gene Ontology assignment was performed using

DAVID v6.7 (299, 343). Proteins were uploaded into the DAVID functional annotation tool and

compared with the human genome background. Enriched pathways and Gene Ontology FAT

terms with a minimum of two-fold enrichment and a Fisher’s Exact test p-value ≤0.05 were

considered and the top Gene Ontology terms that are shared between the EC and OC tumor

groups are reported as the proportion of genes involved relative to the total number of genes

involved in the whole human genome background.

To understand the relationship among differentially expressed proteins further, we performed a

network-based analysis using a protein FI network (344). From the FI network, which contains

close to 50% of the human proteome and more than 200,000 FIs, we extracted the subnetworks

that involve the differentially expressed proteins. Using a minimum spanning tree algorithm, the

minimum number of additional “linker” genes from the full FI network were added in order to

form a single fully-connected subnetwork (345). The subnetwork was clustered using the edge-

betweenness algorithm (346) in order to identify functionally related network modules. Clusters

with size ≥10 were kept for further analysis. All network diagrams were drawn using Cytoscape

v. 2.7 (347). For pathway enrichment analysis of network modules, the bionomial test was used

and the FDR was calculated with 1000 permutations across all genes in the FI network

(Appendix Table 3.3).

106

3.5.5 Integrative data mining

Differentially expressed proteins were compared with transcriptomic profiles based on gene

mapping using an in-house relational database with mRNA microarrays for 22 normal tissues,

including the prostate, available through Human Gene Atlas (261). The median mRNA

expression level of each gene product was calculated across all 22 tissues and compared with the

level of expression in the prostate alone, as detected by a total of 218 probe IDs. In this way,

genes that demonstrate a propensity toward elevated expression in the prostate compared with

other tissues at the transcriptomic level were derived. To assess whether the differentially

expressed proteins had predicted TMDs and SPs, their assigned UniProt IDs were uploaded onto

the Protein Center bioinformatics software (Thermo Fisher Scientific), which uses TMAP

(http://emboss.sourceforge.net/apps/release/6.0/emboss/apps/tmap.html) and PrediSi

(http://www.predisi. de/) tools for predictions. The Human Protein Atlas (348) was used to

determine whether a given candidate was previously proposed as a cancer biomarker (Cb; from

the Plasma Proteome Institute) and to assess the immunoreactivity of antibodies directed against

candidate proteins in normal (non-neoplastic and morphologically normal) and cancerous

prostate tissue sections. Comparisons across various proteomics data sets (direct-EPS, seminal

plasma, urine, cell lines) (124, 260, 264, 267, 268, 312) were performed using the Protein Center

bioinformatics software.

3.5.6 Immunoassays in prostatic fluids

The Human Neprilysin (MME) DuoSet® ELISA Development System (R&D Systems,

Minneapolis, MN) was used to measure MME in direct-EPS. Total protein concentration

corresponding to 140 μg was obtained from each sample. Samples and standard were diluted in

1% BSA in PBS, pH 7.4 and assayed in duplicate following the protocol outlined by the

manufacturer. PSA and PAP ELISAs were performed as previously described (255, 349).

For Western blotting, anti-SFN (ab14123), anti-MME (ab951), anti- PARK7 (ab11251), anti-

PSA (ab46976) were obtained from Abcam (Cambridge, MA), anti-TGM4 (sc55791) was from

Santa Cruz Bio- technology (Santa Cruz, CA), anti-TIMP1 (RP1T1) antibody was from Triple

Point Biologics (Forest Grove, OR). Direct-EPS from patients with BCR and no evidence of

recurrence were quantified by Bradford assay and a corresponding 30 μg of total protein were

107

resolved on Criterion 10 –20% Tris-HCl SDS-PAGE gels (Bio-Rad) and transferred to PVDF

membranes (Bio-Rad). The membranes were blocked with 5% milk in TBS-T for 1 h at room

temperature. Primary antibodies were diluted in blocking solution and incubated at 4 °C

overnight, washed with TBS-T, and incubated with secondary antibody diluted in blocking

solution for 1 h at room temperature. Signals were developed using SuperSignal West Pico or

Femto chemiluminescent substrate (Thermo Fisher Scientific). Pooled EPS-urines were

concentrated from 15 ml to 500 μl by using a spin column (Amicon Ultra Spin Filter, 3 kDa

MWCO, Millipore) and quantified using Bio-Rad DC assay. Concentrations corresponding to 50

μg of total protein were resolved on 10% SDS-PAGE, transferred to PVDF membranes, and

assayed as described above.

3.5.7 Statistical analysis

Following candidate protein selection, we verified their differential expression by Western blot

densitometry and/or ELISA. To determine if the protein levels differ between OC tumor groups

and EC tumor groups, we applied the Mann-Whitney U test to assess if the difference is

statistically significant. A p-value ≤0.05 was considered significant.

3.5.8 SRM-MS development

A spectral library was built using the Skyline software tool (version 0.5 and higher) (350) from

the LTQ-Orbitrap XL data obtained from the 16 direct-EPS samples. All spectra had scores that

passed a stringent peptide score as determined by X!Tandem target decoy search criteria (i.e.

0.5% decoy spectral matches). In addition, two publicly available consensus libraries were used:

NIST_human_QTOF (v 3.0 05/24/2011) and GPM human_ipi_cmp_20. Protein sequences were

converted to FASTA format and uploaded into Skyline for the prediction of signature peptides.

Peptides were chosen based on previously reported specifications (241): predicted tryptic digests

of 7–30 amino acids in length and containing zero possible missed cleavage sites or cysteine and

methionine residues, and matching the spectral libraries. Four to 6 of the most intense y-ions

with a precursor charge of +2 and a fragment ion charge of +1 were selected from the libraries,

resulting in several possible peptides for each of the candidates. The GPM (http://www.gpm.org)

was used to narrow down the list of peptides to those that have been observed experimentally

with a high number of +2 charge states. Furthermore, no possible missed cleavage sites within 3

108

amino acids before or after the peptide, no acidic residues on cleavage sites, and uniqueness to

the gene (241) were additional requirements. Other favorable conditions included no possible

post-translational modification sites, no N-terminal Q, E, no W, and SSR hydrophobicity 5–30

(241, 351, 352). Based on these criteria, 21 signature peptides for six candidates (SFN, MME,

PARK7, TIMP1, TGM4, and PSA) were chosen for SRM-MS experimentation (Appendix Table

3.4).

Lyophilized, unpurified 13C- and 15N-labeled arginine and lysine peptides (SpikeTides_L™,

JPT Peptide Technologies, Berlin, Germany) were obtained in a 96-well format and solubilized

in 80% 0.1 M ammonium bicarbonate and 20% acetonitrile. A heavy isotope labeled peptide

standard stock containing all 21 SpikeTides_L™ at equal concentrations was made for

subsequent experimentation.

For optimization studies, the heavy peptide standard stock corresponding to 500 fmol on-column

was added to BSA digests and automatically loaded from a 96-well microplate autosampler

using the EASY-nLC system. Analysis was performed on a TSQ Vantage triple quadrupole mass

spectrometer (Thermo Fisher Scientific, San Jose, CA) interfaced with the EASY-nLC system

(Proxeon Biosystems, Odense, Denmark), as described previously (204, 260). A 40 min HPLC

gradient running at a flow rate of 400 nl/min was used. The gradient conditions were as follows:

starting with 100% buffer A (water/0.1% formic acid) followed by an increase to 60% buffer B

(acetonitrile/ 0.1% formic acid) for 36 min, followed by a steep increase to 95% buffer B for 2

min, and finally to 100% buffer B for 2 min. For compound optimization, the Thermo TSQ Tune

software (Thermo Fisher Scientific, San Jose, CA) was used. The XCalibur data system (Thermo

Fisher Scientific, San Jose, CA) was used to input the following SRM-MS method parameters:

one scan event over a 40 min method time at a fixed scan width of 0.02 m/z, scan time of 0.010

s, 0.2 fwhm Q1 and 0.7 fwhm Q3 resolution, and data collection in profile mode. For each

peptide, 3– 4 most intense transitions for each eluting peptide were selected and their accurate

retention times were recorded using Skyline (350). These transitions were grouped into two

separate methods to maintain a cycle time of 1 s.

109

3.5.9 Relative quantification by SRM-MS in EPS-urines

For quantification by SRM-MS, EPS-urine pools were prepared as follows: concentrated EPS-

urines were precipitated in 100% methanol (1:20 v/v ratio) overnight at -20 °C. Protein pellets

were recovered by centrifuging at 16,000 g for 30 min at 4 °C. Two additional washes with

methanol were performed to obtain “clean” protein pellets. The pellets were subsequently

resolubilized in 50 μl of 8 M urea, 2 mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C

for 1 h. Carbamidomethylation was performed by incubating samples with 8 mM of

iodoacetamide for 1 h at 37 °C in the dark. Samples were then diluted to 1.5 M urea using 100

mM ammonium bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of 2

mM and the protein mixture was digested with 4 μg of trypsin at 37 °C overnight. The digested

peptide mixture was purified with C18 MacrospinTM columns and concentrated by vacuum

centrifugation and reconstituted to a volume of 100 μl with 5% acetonitrile, 0.1% formic acid.

Samples were incubated at 37 °C for 30 min to allow peptides to go into solution and

subsequently centrifuged at 16,000 g for 30 min at 4 °C to remove any remaining debris.

For quantification in EPS-urine pools, a total of 1 μg of peptides on-column (as determined by

the micro BCA assay) were spiked with 500 fmol on-column of the heavy isotope-labeled

peptide standard stock and assayed following the same conditions as above. Each sample was

assayed in four technical replicates (Appendix Table 3.5). Total peak area values and area ratios

(light/heavy peptide) were obtained from Skyline (350). Relative quantitative values are shown

for each technical replicate and as the average ratio of light/heavy peptide in the OC tumor group

divided by the light/heavy peptide in the EC tumor group ± standard deviation for each peptide.

110

Chapter 4

Systematic development of a quantitative assay for putative

prostate cancer biomarkers in expressed prostatic secretions

Kim, Y., Mejia, S.*, Jeon, J.H.*, Ignatchenko, V., Yao, C.Q., Nyalwidhe, J.O., Lance, R.S.,

Gramolini, A.O., Troyer, D.A., Drake, R.R., Boutros, P.C., Semmes, O.J., Kislinger, T.

Systematic development of a multiplexed selected reaction monitoring mass spectrometry assay

for prostate cancer biomarkers in expressed prostatic secretions. Manuscript in preparation. *

Equal contribution

4 Systematic large-scale development of targeted

proteomics assays for biomarker verification in

prostate-proximal fluids

4.1 Abstract

Current prostate cancer prognostic factors stratify patients into risk groups, but are inaccurate in

predicting outcome, resulting in overtreatment and repeat biopsies of many men with indolent

disease. Therefore, a pressing need in prostate cancer management is improved prognostic

factors that enable follow-up of men with low risk disease without repeat needle biopsies. To

address this, I systematically explored non-invasively collected prostate proximal fluids, known

as expressed prostatic secretions (EPS), as a source of prostate biomarkers. Analysis of direct-

EPS from extracapsular (EC) and organ-confined (OC) prostate cancers revealed differentially

expressed proteins based on spectral counts (244). Selection criteria led to 147 proteotypic

peptides suitable for verification using selected reaction monitoring mass spectrometry (SRM-

MS). For each candidate, a corresponding crude heavy isotope labeled peptide was synthesized

and used for SRM-MS method development. Next, relative quantification of endogenous peptide

concentrations in defined subgroups of prostate cancer patients and control EPS-urines was

performed, revealing 34 candidate peptides with high potential as prostate cancer biomarkers. A

single scheduled SRM-MS assay was developed and used to verify the differential expression of

111

these candidate peptides by absolute quantification in an independent cohort of stratified EPS-

urines.

4.2 Introduction

The worldwide incidence of prostate cancer has been steadily rising but many patients exhibit

tumors of an indolent nature; that is, these tumors grow slowly and pose minimal threat to the

life of the patient, in the absence of treatment. Once cancer has been confirmed, the optimal

course of action is tailored on a patient-to-patient basis, ranging from active surveillance

programs to surgery and radiotherapy. The goal of individualized patient management is to spare

patients with indolent disease from unnecessary procedures, while identifying and treating those

who would benefit from treatment intensification. Gleason grading is a strong prognosticator of

prostate cancer, but requires surgical or biopsy specimens. The number of patients admitted to

the hospital as a result of post-biopsy complications is sizable, posing a significant burden on

health care and risking serious complications. Most men diagnosed with prostate cancer have

localized disease with excellent prognosis (localized cancers have 100% 5 year survival, SEER

Cancer Statistics http://seer.cancer.gov), but nearly 70% receive early intervention in the form of

surgery or radiotherapy (353-357). Furthermore, even active surveillance relies on repeat

prostate specific antigen (PSA) testing, digital rectal examination (DRE), and multiple prostate

biopsies.

Liquid biopsies, such as circulating tumor cells and free DNA have been proposed as non-

invasive prostate cancer biomarkers, but their detection and enrichment remains technically

challenging. Cataloguing the secreted and soluble factors released into the interstitial fluid that

bathes the organ of interest may provide a novel inventory of putative disease biomarkers.

Indeed, I have interrogated the collection of proteins comprising a prostate-proximal fluid,

extraprostatic secretions (EPS) (126, 127, 244, 260). Reproducible detection and quantification

of a protein in the matrix of interest is a requirement of any potentially valuable disease

biomarker, but verification of these candidates is a major bottleneck in the pipeline from

discovery to clinical implementation.

Traditionally, immunoaffinity based assays, namely enzyme-linked immunosorbent assays

(ELISAs), are used to validate a biomarker; but this approach is time-consuming, costly, and

112

relies on the existence of validated antibodies for a given target protein. Targeted mass

spectrometry (MS) offers an alternative approach to the rapid verification of candidate

biomarkers. Selected reaction monitoring mass spectrometry (SRM-MS) is the leading method

for the targeted quantification of proteins using MS. It offers excellent selectivity, sensitivity,

and throughput, without the need for validated antibodies. In the current chapter, I will describe

my efforts that extend the previously published investigations into the EPS proteome (126, 127,

244, 260). Here, I systematically generated a multiplexed quantitative SRM-MS assay targeting

candidates with prostate cancer prognostic and diagnostic potential using a clinically relevant

prostate proximal fluid.

4.3 Results

4.3.1 Candidate selection for targeted quantification by selected reaction

monitoring mass spectrometry

Utilizing a previously published shotgun proteomics dataset derived from direct-EPS comparing

the proteomes of EC and OC prostatic tumors (described in Chapter 3), 232 proteotypic peptides

(Phase 1 peptides, Appendix Table 4.1) were shortlisted for targeted quantification by SRM-MS

in EPS-urines (Figure 4.1). In order to abrogate potential confounding influences due to the

differing modes of data acquisition, instruments, and samples, Phase 1 peptides were first

evaluated for reproducible detection by SRM-MS. For this, crude heavy isotope labeled synthetic

versions of the target peptides (JPT Peptide Technologies GmbH) were spiked into a pool of

EPS-urine, and the light (endogenous) and heavy (synthetic) peptides were monitored. The data

was manually inspected in order to select peptides that had at least 3 fragments ions that aligned

at the expected peptide elution time, had co-eluting light and heavy peptides, had minimal

interference, and were reproducible. Out of the 232 peptides monitored, 147 (63%) peptides

(Phase 2 peptides, Appendix Table 4.1) met these criteria, and were thus taken forward to an

independent cohort of EPS-urine samples.

113

Figure 4.1 Study workflow outlining the three major phases of the current project.

Discovery data from direct-EPS derived from patients with EC or OC prostatic tumors was used

to identify candidates demonstrating differential expression (Chapter 3). Proteotypic peptides

from these candidates were carefully selected and evaluated by SRM-MS for performance in an

EPS-urine background. Next, peptides were analyzed in EPS-urines from individuals with

benign prostatic hyperplasia (BPH) or no evidence of disease and compared to EC and OC EPS-

urines. Peptide quantification by SRM-MS was performed and 34 promising candidates with

diagnostic and prognostic potential were identified based on estimated abundance changes. The

34 candidates were accurately quantified in a different cohort of EPS-urines for verification.

Finally, ongoing statistical analyses are being performed to generate a peptide signature with a

high potential for prostate cancer diagnosis and prognosis.

114

4.3.2 Targeted relative quantification of candidates in a cohort of EPS-

urines

For targeted quantification of Phase 2 peptides, a cohort of EPS-urines derived from 74 clinically

stratified prostate cancer patients and noncancer controls (BPH and normal individuals) was

utilized (Cohort A, Table 4.1, Appendix Table 4.4). The overlap of these 147 putative candidates

with a previously published dataset composed of 661 cancer-associated proteins in urine (253)

was determined. Of the 147 Phase 2 peptides, only 10 overlapped with urine (Figure 4.2A),

suggesting that post-DRE urines are enriched in prostate-derived proteins. Indeed, in a previous

study (Chapter 2), myself and colleagues have demonstrated the significant enrichment of

proteins in EPS-urine (post-DRE urine) when compared to urine (matched pre-DRE urine) (127).

Furthermore, the estimated average concentration of Phase 2 peptides measured in this cohort of

EPS-urine demonstrated a dynamic concentration range of peptides spanning ~5 orders of

magnitude, with the exception of the two KLK3 (PSA) peptides HSQPWQVLVASR and

LSEPAELTDAVK (Figure 4.2B).

Table 4.1 Overview of clinical information from patients enrolled in Cohort A of EPS-

urines.

115

Figure 4.2 Peptides in EPS-urine. (A) Venn diagram depicting the number of Phase 2 peptides

that were also detected in the urines of a previous publication by Hüttenhain et al. (253). (B)

Estimated concentrations of Phase 2 peptides based on the spike-in concentration of the crude

heavy isotope labeled peptide standards. Red dots represent Phase 3 peptides that were

subsequently quantified by using AQUA peptides.

116

4.3.3 Selection of peptide candidates for verification

To derive a small set of peptide candidates for verification, a student’s T-test was performed to

compare the ratios of peptide abundance between cancer and noncancer groups, as well as EC

and OC groups. The first criteria used to select candidates as potential diagnostic and prognostic

biomarkers were p-value cutoffs of 0.05 and 0.1, respectively. Next, refinements to the list of

candidates were made by adding peptides representing the proteins IGJ and ANXA1 because

they were the only candidates that that were upregulated in the EC tumor group, although they

did not meet the p-value cut-off (p-value = 0.25). Furthermore, KLK3 peptides were added to

monitor PSA levels in EPS-urine. Finally, each of the peptides that met these criteria were

manually inspected for quality of SRM-MS traces. Overall, 34 candidates (Phase 3 peptides)

demonstrated a potential for classifying individuals based on prostatic disease status (Figure 4.1,

Figure 4.3, Appendix Table 4.2, Appendix Table 4.6). Peptides with diagnostic potential were

considered to be those that could discriminate prostate cancer from noncancer conditions.

Twenty-four such peptides were selected, of which the majority were upregulated in cancer

(Figure 4.3A and B). These candidates are grossly involved in functions associated with response

to oxidation when characterized by Gene Ontology enrichment (Figure 4.3C). Of the 14

candidates that demonstrated a prognostic potential – that is, they could discriminate EC tumors

from OC ones – only two candidates were upregulated in the EC tumor group (Figure 4.3A and

B). Candidates that were upregulated in the OC group are largely associated with the immune

response by Gene Ontology enrichment analysis (Figure 4.3C).

In order to generate a SRM-MS assay for accurate targeted quantification of Phase 3 peptides,

each peptide was synthesized as a highly purified (>97% purity) heavy isotope labeled standard

(AQUA peptides, Thermo Fisher Scientific) and calibration curves were generated for all

candidates in a urine background (Appendix Figure 4.1). All peptides demonstrated excellent

linearity (R2 ranging from 0.90 – 0.99) and reproducibility (coefficient of variation ranging from

0.4 – 19%) (Table 4.2, Appendix Table 4.3). The limit of detection ranged from approximately

0.006 to 8 fmol/μg of total urine peptides; the lower limit of quantification ranged from

approximately 0.02 to 25 fmol/μg of total urine peptides.

117

Figure 4.3 Phase 3 peptides. (A) Venn diagram depicting the number of peptides with

diagnostic potential and the number of peptides with prognostic potential. (B) The number of

peptides that the up- or down-regulated in the cancer group compared to the noncancer group

(left graph), and the EC tumor group compared to the OC tumor group (right graph). (C) Gene

Ontology terms that are enriched among the diagnostic and prognostic peptides.

118

Table 4.2. Limits of detection and quantification for 34 peptides determined by response

curves generated using AQUA peptides spiked into urine.

119

4.3.4 Absolute quantification for the verification of candidates in EPS-

urines

Next, the 34 Phase 3 peptides were quantified for the verification of their differential expression

in an independent cohort of 207 EPS-urines. Prostate cancer EPS-urines were further stratified

into pathological stage pT3 and pT2 tumors. Noncancer specimens were derived from patients

with BPH and normal individuals (Cohort B, Table 4.3, Appendix Table 4.5).

Table 4.3 Overview of clinical information from patients enrolled in Cohort B of EPS-

urines.

Since sufficient sample was available, each EPS-urine was measured in two technical replicates,

using a single scheduled SRM-MS method. The duplicate measurements for each sample in

Cohort B demonstrated strong reproducibility (Figure 4.4A). A Pearson correlation analysis

between the duplicate SRM-MS runs revealed that the majority of peptides have Pearson

correlation coefficients of 0.8 or above (Figure 4.4B). Peptides that have a Pearson correlation

coefficient of 0.8 or lower were found to be generally low-abundance peptides. The analysis also

demonstrated good chromatographic reproducibility (coefficient of variation <1.5%) with most

peptides eluting between 20 and 30 minutes over the 40-minute chromatographic gradient

(Figures 4.4C and D).

120

Figure 4.4 Cohort B SRM-MS data quality. (A) Peptide abundance correlation between both

replicates analyzed by SRM-MS. (B) The fraction of peptides that fall into different Pearson

correlation coefficient cut-offs in the subgroups of EPS-urine samples. (C) The average retention

121

time and coefficient of variation for each of the 34 peptides. (D) The recorded retention times of

the 34 candidates; boxplots represent the interquartile range; whiskers represent the minimum

and maximum recorded retention time.

Next, the absolute concentration of each peptide across all 207 samples in Cohort B was

determined based on the amount of spike in heavy AQUA peptides (Figure 4.5). The 34 peptides

measured in this set of EPS-urines spanned approximately 5 orders of magnitude, from the low

fmol to nmol range per µg total protein. As expected, the two peptides representing KLK3

(HSQPWQVLVASR and LSEPAELTDAVK) were found to be the most abundantly expressed

peptides in EPS-urine. The remaining peptides spanned approximately 3 orders of magnitude,

with the peptide representing DOPD being the highest and one of the peptides representing

PEDF being the lowest in this set of EPS-urines. All proteins that were represented by two

peptides were found to be within approximately 1 order of magnitude in abundance, except for

the peptides representing PEDF, which were approximately 2 orders of magnitude different in

abundance.

122

Figure 4.5 Absolute quantification of Phase 3 peptides in EPS-urine. Box plots represent the

median and interquartile range. Whiskers represent the 1-99 percentile. Outliers are represented

by red dots and the mean is represented by ‘+’.

4.3.5 Comparative analysis between subgroups of patients

In order to determine candidates with diagnostic and prognostic potential, comparative analyses

between subgroups of Cohort B patients were carried out using the SRMStats software tool

(version 1.6) (358). A number of candidates were significantly differentially expressed between

cancer and normal individuals, with all candidates being significantly upregulated in the cancer

123

group (Figure 4.6A). The proteins that were represented in this comparison were IDHC,

ANXA3, ANXA1, KLK3, KCRB, SERA, and CNDP2. Interestingly, IDHC, ANXA3, and

KLK3 were represented by the significant upregulation of two peptides for each protein in the

cancer group. When comparing the expression of candidates between the cancer and BPH

groups, ANXA3, SERA, ANXA1, and KLK3 were found to be significantly upregulated in the

cancer group (Figure 4.6B). The protein ANXA3 was represented by two peptides that were

significantly upregulated in cancer. Next, a comparison of pathological stage pT3 and pT2

samples revealed 10 peptides representing HEXB, 6PGL, DOPD, KLK3, SERA, PEDF, PRDX6,

and BTD. Two proteins – KLK3 and PRDX6 were represented by two peptides each (Figure

4.6C). When comparing cancer and noncancer (BPH and normal individuals), ANXA3, IDHC,

SERA, ANXA1, KCRB, and KLK3 were significantly upregulated in cancer (Figure 4.6D).

ANXA3 and KLK3 were represented by two peptides each. Lastly, a comparison between BPH

and normal samples revealed very little alterations, with IDHC being the only protein that was

significantly upregulated in BPH (Figure 4.6E). The relative expression of Phase 3 proteins and

peptides in EPS fluids (127, 244) using different quantitative methods are summarized in Table

4.4.

125

Figure 4.6 Comparative quantification of candidates in different subgroups of EPS.

Volcano plots representing the fold-changes of Phase 3 peptides in various subgroups of EPS-

urines from Cohort B. Red dots represent peptides that are upregulated; blue dots represent

peptides that are downregulated; dotted vertical lines represent the boundaries of a 1.5 fold

change; dotted horizontal line represents a p-value of 0.05.

126

Table 4.4 Relative expression of Phase 3 proteins and peptides in EPS fluids using various

quantitative methods.

127

4.4 Discussion and future direction

The timely verification of exhaustive lists of candidate disease biomarkers is becoming ever

more relevant as high-throughput technologies enable in-depth analyses of complex biological

systems exposing a plethora of putative candidates. Despite this, the implementation of

biomarkers into clinical practice is largely lagging behind. This is, in part, attributed to the lack

of validated methods of candidate testing and further evaluation. Targeted proteomics,

specifically SRM-MS, has placed itself at the forefront of candidate protein verification, due to

its relatively low cost and high-throughput assay development workflow. Therefore, the aim of

the current study was to systematically develop a targeted assay and apply it for the quantitative

verification of prostate cancer biomarkers in a sample that is routinely collected in the clinic.

Here, I developed a single multiplexed SRM-MS assay for 34 promising prostate cancer

diagnostic and prognostic candidates in EPS-urine. This work is an extension of previously

published shotgun proteomics data demonstrating the differential expression of a large number of

protein candidates in direct-EPS from patients with EC and OC tumors (described in Chapter 3)

(244). The feasibility of obtaining coordinates (peptide elution times, most intense transitions)

from different modes of data acquisition and instrumentation was demonstrated by extracting

relevant information from discovery data on a LTQ-Orbitrap in order to lay the foundation for

targeted assay development using a triple quadrupole mass spectrometer.

By utilizing semi-quantitative data (spectral counting) and spectral libraries generated from

Chapter 3, the extracted coordinates and most abundant transitions were identified and

systematically evaluated to develop a refined SRM-MS assay for differentially expressed

proteins. By performing multiple rounds of assay refinement (Phases 2 and 3), I was able to

narrow the list of promising candidates from over 200 candidates to 34 peptides with high

diagnostic and prognostic potential. By using heavy isotope-labeled peptides standards, I was

able to estimate and later absolutely quantify the candidates in the EPS-urine background.

Interestingly, a very small number of peptides in EPS-urine overlapped with those that were

previously explored in urine by Hüttenhain and colleagues (253). The cancer associated proteins

that were evaluated in their study were gathered from reports based on protein and nucleic acid

changes in human plasma and tissue (253, 359). In the current study, the evidence was directly

128

derived from comparing EC and OC tumor groups and from prostate-proximal fluids. I have

demonstrated in Chapter 2 that in pre- and post-DRE urines from the same patients, post-DRE

urines (EPS-urines) indeed contain a unique subset of proteins that are exclusive to this proteome

(127).

Nonetheless, there were some potential caveats that were faced in the design of the current study.

The discovery and subsequent Cohort A of samples were obtained from patients with either EC

or OC tumors based on pathology. This is in contrast to the samples used in the verification

phase (Cohort B) where patients were stratified based solely on their pathological stage (pT3

versus pT2). There are slight differences in their designations – the major difference being that

the EC and OC designations were primarily determined on the basis of pathological stage and

margin status (for instance, a pT2c tumor with positive margins and a pT3 tumor without

positive margins would be considered as EC cases). In order to evaluate the effects of these

differences, the expression profiles of candidate peptides in Cohorts A and B of samples were

evaluated (Appendix Figure 4.2). For the majority of peptides, Cohorts A and B were highly

concordant in terms of differential expression between pT3 versus pT2, and EC versus OC, with

the exception of a few proteins (IDHC, ANXA1, K2C8, PEDF, FIBG, and CFAB). When Cohort

A samples were converted to pT3 and pT2 designations, only GELS was affected. It is unclear

whether this effect is due to the difference in patient stratification, patient-to-patient variability,

or technical issues. However, the current evidence suggests that there is no major confounding

effect of patient stratification based on EC versus OC or pT3 versus pT2.

The overall relative expression profiles of the Phase 3 candidates across different EPS datasets

generated in our lab is summarized in Table 4.4. The first dataset was described in Chapter 3 and

in Kim et al., (244), and was the basis for candidate selection in the current study (Figure 4.1,

Discovery stage). Here, the semi-quantitative method of spectral counting was used to estimate

the relative abundance of proteins in direct-EPS from EC and OC tumors. In the same study,

immunoassays (i.e. Western blots) targeting a small number of proteins were performed in pools

of EPS-urine from EC and OC tumors. This data was compared to the expression changes of the

peptide candidates in Cohorts A and B of EPS-urines analyzed in the current study.

Encouragingly, the majority of candidates showed similar changes in direct-EPS and EPS-urine,

supporting the workflow that was used for candidate selection. Nonetheless, some candidates in

the EPS-urine did not reflect the trends observed in direct-EPS. This could be a result of patient

129

heterogeneity, instrumentation, and quantitative methods. Even when comparing the expression

levels of the candidates between EPS-urines from Cohorts A and B, there are some differences.

Interestingly, some pairs of peptides that represent a single protein demonstrate opposing trends.

For instance, the protein ANXA3 has one peptide (LTFDEYR) that is upregulated in the EC

group of EPS-urines in Cohort A, while the other peptide (SEIDLLDIR) is downregulated in the

EC group. This could be due to inaccurate quantification due to the use of crude peptide

standards of varying purities, to the presence of proteoforms (341) and modifications, or

variations in proteolytic digestion efficiencies of endogenous proteins (235, 360).

An interesting finding from the work presented in this chapter is the trend of lower abundance of

the majority of candidates with advancing disease. For instance, elevated serum levels of PSA

are indicative of prostate cancer; however, EPS levels of PSA have been consistently lower in

the EC group compared to the OC group, and in the pT3 group compared to the pT2 group (244).

Similarly, a trend of decreased PSA in EPS-urine from cancer patients compared to noncancer

individuals has also been noted by Drake et al. (255). This may be indicative of PSA leakage out

of the prostate gland and into the circulation, as described in Chapter 1 (129). Similarly, it can be

hypothesized that other proteins are leaking from the prostate and into the circulation with the

deteriorating structural integrity of advanced prostate cancers. It would be beneficial to measure

both EPS and serum levels of such proteins from the same patient, in order to test this

hypothesis.

In the comparison between BPH and normal individuals from Cohort B, only one protein

(IDHC) was found to demonstrate a significant difference in abundance. The remaining peptides

were found to show almost no differential expression. This may be explained by the fact that the

discovery cohort from Chapter 3 (244) lacked a noncancer group consisting of BPH and normal

individuals. Furthermore, the patients in Cohort A were not stratified on the basis of BPH versus

normal designations, and hence candidates that were selected for verification were not

specifically aimed at verifying their differential expression between BPH and normal EPS-

urines.

Currently, the analyses of the data presented in this chapter are ongoing. In collaboration with

Dr. Paul Boutros (OICR), a multi-feature diagnostic and prognostic signature is currently being

generated using machine learning approaches. The features that are being taken into

130

consideration include each of the 34 peptides that were measured in the 207 heterogeneous

verification sample cohort, as well as the clinical parameters that are available. For instance,

serum PSA levels that were taken at the time of sample draw may be combined with a protein

signature to provide enhanced prognostic performance of the signature. Although not obtained

for the current study, additional parameters such a tumor microenvironment parameters may also

provide information about patient outcome. Indeed, in a recent study by Lalonde et al., intra-

prostatic hypoxia was combined with DNA indices (copy number alterations) to predict 5-year

biochemical recurrence (BCR) (144). Such multi-feature signatures may provide a

comprehensive, powerful predictor of patient outcome. Patients enrolled in Cohort B of this

study have clinical information regarding BCR. In the pT2 group, 11 individuals out of 61

developed BCR at least 2 years post-RP; in the pT3 group, 10 individuals out of 29 developed

BCR (Table 4.3). Although not yet explored in this study, one approach to the analysis of the

data may be to comparatively analyze the protein expression changes between individuals who

develop recurrence and those that did not.

4.5 Materials and Methods

4.5.1 Sample collection and properties

Samples were obtained from patients following informed consent and use of Institutional Review

Board approved protocols at Urology of Virginia and Eastern Virginia Medical School and the

Research Ethics Review Board at the University Health Network. Twenty ml of EPS-urine was

centrifuged at 1100 g for 15 minutes at 4 °C to pellet debris. The resulting EPS-urine was

aliquoted in volumes of 3.5 ml and diluted with 2.5 ml of PBS (pH 7.4). The mixture was

vortexed, combined, and centrifuged at 1100 g for 15 minutes. The resulting supernatants were

stored at -40 °C. The clinical information for patients enrolled in the study are detailed in

Appendix Tables 4.4 and 4.5. Prostate cancer patients were selected on the basis of pathological

stage pT2 or pT3. Noncancer individuals had biopsy confirmed BPH or were considered as

individuals with no indication of prostatic disease based on biopsy results.

131

4.5.2 Sample preparation for mass spectrometry

Ultrapure-grade tetrafluoroethylene (TFE), trifluoroacetic acid (TFA), iodoacetamide (IAA), and

dithiotreitol (DTT) were from Sigma-Aldrich. HPLC-grade solvents (methanol, acetonitrile, and

water) and formic acid were from Fisher Scientific. Mass spectrometry-grade trypsin/lys-c was

from Promega (Madison, WI). Amicon spin filter columns, 0.5 ml, 3 kDa MWCO, were from

Millipore. Solid phase extraction C18 tips were from Agilent.

Four ml of urine and EPS-urine were concentrated to approximately 500 μl by using a spin

column with a molecular weight cutoff of 3 kDa, and proteins were precipitated overnight by the

addition of ice-cold 100% methanol. Protein pellets were washed twice with 100% methanol and

air dried. Protein resolubilization and extraction was performed by the addition of 50% TFE at

60 °C for 2 hours. Following reduction with DTT and alkylation with IAA, proteins were

digested overnight at 37 °C using mass trypsin/lys-c. The reaction was quenched by the addition

of TFA. Desalting was performed by solid phase extraction using C18 tips. Solvents were

removed by vacuum centrifugation and peptides were resolubilized in 5% acetonitrile, 0.1 %

formic acid. Peptide concentrations were determined by the micro BCA assay kit (Thermo Fisher

Scientific).

4.5.3 Target acquisition by selected reaction monitoring mass

spectrometry

Samples were analyzed on a TSQ Vantage triple quadrupole mass spectrometer (Thermo Fisher

Scientific) equipped with an EASY-Spray (Thermo Fisher Scientific) electrospray ion source.

Separations were performed on EASY-Spray columns (15 cm x 75 µm ID packed with 3µm C18

particles, Thermo Fisher Scientific) heated to 50 °C. Peptides were kept at 4° C and loaded onto

the column from an EASY-nLC (Thermo Fisher Scientific) autosampler. Chromatographic

conditions were as follows: 40 minute gradient at a flow rate of 300 nl/min starting with 100% A

(water), stepping up to 5% B (ACN) in 5 minutes, followed by 25% B at 35 minutes, followed

by a steep increase to 50% B at 38 minutes and 100% B at 40 minutes.

Targeted acquisition of eluting ions was performed by the mass spectrometer operated in SRM-

MS mode with Q1 and Q3 set to 0.7 m/z fwhm resolution and a cycle time of 1 second. For all

132

SRM-MS runs with the exception of the measurement of Phase 3 peptides EPS-urines from

Cohort B, unscheduled methods were used, each targeting approximately 200 transitions over a 1

second cycle time. For Cohort B, a single scheduled method was utilized (2 minute elution

window), monitoring approximately 200 transitions (3 transitions per peptide) over a 1 second

cycle time.

4.5.4 Phase 2 peptide selection

In Chapter 3, 133 differentially expressed proteins were identified when comparing the proteome

profiles of 16 direct-EPS samples from individuals with EC and OC prostatic tumors. A spectral

library was built from the resulting LTQ-Orbitrap XL data obtained using the Skyline software

tool (version 2.1.0) (350). All spectra had scores that passed a stringent peptide score as

determined by an X!Tandem target decoy search criteria of 0.5% decoy spectral matches. Protein

sequences were converted to FASTA format and uploaded into Skyline for the prediction of

proteotypic peptides. Peptides were chosen based on previously reported specifications (241). A

total of 232 proteotypic peptides (Phase 1 peptides) were selected and purchased as bulk heavy-

isotope labeled peptide standards (JPT Peptide Technologies). In order to assess the suitability of

the Phase 1 peptides for SRM-MS, 250 fmol of each heavy peptide standard was spiked into 1 ug

of EPS-urine-derived peptides, with 4-6 transitions monitored over a 40-minute chromatographic

gradient. Of the 232 Phase 1 peptides, 147 (Phase 2 peptides) were reproducibly detectable with

a minimum of three transitions in the complex EPS-urine background.

4.5.5 Phase 2 peptide quantification and Phase 3 peptide selection

A cohort of individual EPS-urines (n = 74) from a heterogeneous population of patients with EC,

OC, and control (BPH, normal) (Cohort A) was used to analyze the Phase 2 peptides. A total of 1

µg of peptide from each sample was spiked with 200 fmol of the heavy peptide standards.

Visualization and inspection of peaks was performed on Skyline. Each peptide was quantified in

a sample by integrating the quantifier ion (most intense ion) of the light peptide with its co-

eluting heavy peptide ion, in order to derive a light-to-heavy peptide ratio. The student’s T-test

was used to compare the ratios between cancer and controls, as well as EC and OC. A K-fold

cross-validation was performed to investigate the diagnostic and prognostic power of the

peptides at different p-value cutoffs. For the diagnostic and prognostic peptide candidates, p-

133

value cutoffs of 0.05 and 0.1 were used, respectively. Further refinements to the lists were made

by including peptides that did not meet the p-value cutoffs but were potentially promising. For

instance, peptides SSEDPNEDIVER from protein IGJ and TPAQFDADELR from protein

ANXA1 were added to the list of putative prognostic candidates because they were the only

candidates that were elevated in the EC tumor group (p-value = 0.25). Two KLK3 peptides

(HSQPWQVLVASR and LSEPAELTDAVK) were also added in order to monitor PSA levels in

EPS-urine. A total of 34 peptides (24 diagnostic and 14 prognostic peptides, of which 4

overlapped) were taken forward for verification. These peptides are henceforth referred to as

Phase 3 peptides.

4.5.6 Generation of response curves and determination of figures of

merit

Each of the Phase 3 peptides were obtained as highly purified, soluble light and heavy peptides

at defined concentrations (AQUA peptides, Thermo Fisher). A reversed response curve was

generated for each peptide by spiking various concentrations (0-250 fmol, 7 points) of the heavy

peptide into a background of 1 μg urine peptides and 50 fmol of the corresponding light peptide.

Each concentration point was measured in 3-5 technical replicates. The LOD and LLOQ for each

peptide were derived using the following equation: LOD or LOQ = (F x SDblank) / b, where F =

3.3 for LOD and 10 for LOQ and b = slope.

4.5.7 Multiplexed, scheduled selected reaction monitoring mass

spectrometry method

In order to increase throughput, a multiplexed SRM-MS assay was developed by scheduling all

34 candidates in a single 40-minute chromatographic gradient. A total of 3 transitions were

monitored for the light and heavy versions of each peptide for a total of 204 transitions per

analysis. A 2-minute acquisition time window was scheduled around the expected peptide

elution time.

4.5.8 Verification of Phase 3 peptides in EPS-urines

The verification a heterogeneous population of patients with pathological stage pT3 and pT2

prostate tumors, BPH, and normal individuals (n = 207) was enrolled (Cohort B). One μg of total

134

peptide from each sample was spiked with 100 fmol of heavy peptide and 10 fmol of

corresponding light peptide for all candidates with the exception of the KLK3 peptide,

HSQPWQVLVASR, which was spiked in at 500 fmol of heavy peptide. Visualization and

inspection of peaks was performed on Skyline. Each sample was analyzed in two technical

replicates using the same instrument parameters as described above.

4.5.9 Quantitative and statistical analyses

Each of the light and heavy peptides were checked for quality of data by observing co-elution of

all three transitions, alignment of light and heavy peptide elution times, and reproducibility

between technical replicates. Relative ratios of the area under the curve of the most predominant

ion (quantifier) of the light peptide versus the corresponding heavy quantifier were calculated.

For statistical analyses, the SRMstats software tool (version 1.6) (358) was utilized. Peptides

were comparatively tested for abundance differences between Cancer versus Normal; Cancer

versus BPH; Cancer versus Noncancer; BPH versus Normal; and pathological stages pT3 versus

pathological stage pT2. Peptides with a p-value ≤0.05 and a fold change ≥1.5 were considered

significant.

135

Chapter 5 Summary and future direction

5 Summary and future direction

5.1 Summary of key findings

In the current thesis, I used a proteomics platform encompassing the systematic discovery and

verification of putative protein biomarkers in prostate-proximal fluids. Overall, this body of work

provides a high-quality archive of proteins in expressed prostatic secretions (EPS). In addition,

proteins that are differentially expressed among subtypes of prostate cancer those that are

extracapsular (EC) and those that are organ-confined (OC) – were identified. By systematically

following up on these proteins, a number of peptides were verified to be differentially abundant

among these prostate cancer subtypes and a quantitative assay was developed. Ongoing

bioinformatics and statistical analyses are being used to generate a multi-feature signature. These

findings may facilitate the clinical management of prostate cancer by identifying patients who

may be spared from unnecessary medical procedures and have favourable prognoses, circumvent

the need for biopsies, and may be used for longitudinal tracking of disease progression.

Furthermore, proteomic analyses of additional EPS-urine-derived exosomes (126) were

performed during my candidacy, but were not covered in depth in this thesis. Taken together,

over 2000 proteins from prostate-proximal fluids have been comprehensively catalogued, with an

emphasis on the identification of prostate cancer biomarkers (Figure 5.1). A full database of

these datasets is available and can be further interrogated to answer various clinical questions.

5.1.1 In-depth proteomic analyses characterizing the direct and urinary expressed prostatic secretion proteomes

In the first investigation into the EPS proteome in our lab, myself and colleagues analyzed direct-

EPS from nine prostate cancer patients (260). Using multidimensional protein identification

technology (MudPIT) and spectral counting, 916 unique proteins were identified. This was

integrated with publicly available datasets, such as mRNA expression levels across various

human tissues, antibody availability, other proteomics datasets from related fluids and prostate

cancer cell secretomes, in order to characterize this proteome.

136

To describe the EPS-urine proteome, myself and colleagues identified 1022 unique proteins from

EPS-urines from prostate cancer and benign prostatic hyperplasia (BPH) patients (127).

Additionally, in order to identify prostate-enriched proteins, five patients with prostate cancer

and five noncancer individuals were screened pre- and post-digital rectal examination to obtain

urine and EPS-urine, respectively. This valuable, internally controlled sample set provided a total

of 444 unique proteins of which 49 proteins were significantly enriched in EPS-urine compared

to urine using a semi-quantitative method.

5.1.2 Discovery phase: identification of candidate biomarkers of extracapsular prostate cancer

In the discovery phase, a shotgun approach was taken to catalogue the proteomes of a prostate-

proximal fluid, direct expressed prostatic secretions (direct-EPS), from EC and OC prostate

cancers. In total, over 600 proteins were identified, of which 133 were differentially abundant in

the two groups by spectral counting. In an initial verification step, a systematic data mining

strategy was used to select a small subset of differentially expressed proteins. These 14

candidates were then analyzed by Western blotting in a cohort of direct-EPS linked with clinical

outcome information related to biochemical recurrence (BCR) within a two-year period post-

prostatectomy. Five candidates – SFN, MME, PARK7, TIMP1, TGM4 − showed a similar

pattern of differential expression between recurrent and non-recurrent patients as they did for EC

and OC, respectively. As one of the goals of my thesis was to demonstrate the detection of

potential candidates in a clinically relevant biofluid for non-invasive detection, EPS-urine was

obtained from EC and OC prostate cancers. As a proof-of-concept, selected reaction monitoring

mass spectrometry (SRM-MS) was employed for quantification of these candidates. By targeting

at least one proteotypic peptide, all candidates were quantified in the pooled EPS-urines by

SRM-MS. In conclusion, I established EPS as a promising resource of prostate cancer

biomarkers. Furthermore, I demonstrated SRM-MS as a valid method of rapid verification of

putative candidates in biological fluids.

5.1.3 Verification phase: targeted quantification of selected prostate cancer biomarker candidates

The successful application of SRM-MS to EPS-urines in the discovery phase prompted me to

utilize this high-throughput method for the accurate quantification of a larger number of

137

biomarker candidates. By revisiting the 133 differentially abundant proteins determined by

spectral counting, 147 proteotypic peptides corresponding to 82 proteins, were selected and

relatively quantified by SRM-MS in a cohort of individual EPS-urines from EC and OC patients,

as well as noncancer control individuals. From this analysis, 34 candidates were demonstrated to

have diagnostic and prognostic potential. Next, to verify these candidates, absolute quantification

was performed utilizing highly purified peptide standards and a single multiplexed SRM-MS

assay in a new cohort of EPS-urines. This high-quality information is being utilized to generate a

multi-feature diagnostic and prognostic signature with a high potential for facilitating the clinical

management of prostate cancer patients.

5.1.4 Additional proteomics analysis of prostate-proximal fluids

Although not covered in detail in this thesis, I carried out an additional proteomics investigation

into EPS-urine derived exosomes (126). In this study, myself and colleagues analyzed EPS-

urine-derived exosomes from prostate cancer patients and noncancer individuals using mass

spectrometry (126). A total of 877 unique proteins were identified. Characterization of this

proteome revealed a fraction of proteins that were not previously detected in the EPS (direct-EPS

and EPS-urine) proteome or other exosome proteomes.

Taken together, I comprehensively characterized the proteomes of various prostate-proximal

fluids. Through this, I demonstrated that these biofluids are rich sources of prostate-related

proteins and may be a valuable resource of potential disease biomarkers (Figure 5.1).

138

Figure 5.1 Overview of all proteomics datasets generated for various prostate-proximal

fluids. A total of approximately 2000 proteins are available in an in-house database. PCa,

prostate cancer; NC, noncancer; EC, extracapsular; OC, organ-confined; BPH, benign prostatic

hyperplasia.

5.2 Future directions

The data presented in this thesis provides verified proteomic information to generate a potential

non-invasive biomarker panel for the prognosis of prostate cancer. The studies fall into Phases 1

(preclinical exploratory studies) and 2 (clinical assay development) out of the 5 phases of the

biomarker development pipeline as set out by the Early Detection Research Network (112). The

results fulfill the primary aims of Phase 1 by identifying and prioritizing potential biomarkers in

a systematic manner, and providing quantitative measures of protein abundance changes between

subtypes of prostate cancers. In Phase 2, an assay is developed for a non-invasive specimen. Its

primary aim is to assess the clinical characteristics of the assay (such as its sensitivity and

specificity) and its ability to distinguish patient groups. In partial fulfillment of this, a single

multiplexed SRM-MS assay was systematically developed and utilized to generate a biomarker

signature.

139

Following the work presented in the current thesis, a number of critical steps need to be

undertaken in order to further validate the biomarkers for clinical use. First, the verified

differentially expressed peptides are currently being statistically evaluated to generate a multi-

feature signature for the non-invasive diagnosis and prognosis of prostate cancer. Following this,

a multi-institutional evaluation of the signature should be performed. For this, standardized

procedures for the harmonization of sample collection, processing, and storage need to be

instituted. The SRM-MS assay should be optimized for a clinical laboratory setting and standard

operating procedures should be drafted. Once implemented, the intra- and inter-laboratory

reproducibility of the standardized assay, and its analytical robustness must be validated.

Furthermore, the biomarkers presented in the thesis are continuous variables; thus clinically

relevant thresholds and time points that can accurately stratify patients need to be established and

validated (361). One of the major advantages of utilizing EPS-urine is the ease with which the

specimens can be drawn from patients. Therefore, the longitudinal collection of EPS-urine at

regular intervals should be performed in order to monitor changes in the levels of biomarkers and

their relationship to disease progression. Prospective and cancer-control studies should be

performed in large patient cohorts in order to determine the clinical utility of the biomarkers. The

primary aim of a prospective study is to determine the detection rate and the false referral rate of

the biomarker in a relevant population (112). In cancer-control studies, the aim is to estimate the

reduction in cancer burden in the population as a result of testing. Factors that may inhibit a

biomarker from being taken into the clinic include morbidity associated with the test, poor

compliance, and over-diagnosis of cancers that, if left undetected, would pose no threat to

patients (112).

One area that was not investigated in this thesis is the biological functions of putative candidates

and the molecular mechanisms that they are involved in. In Chapter 3, it was shown that a

number of different pathways are enriched in my dataset, including those that regulate the cell

cycle and metabolism (Figures 3.9 and 3.10). These and other candidates can be followed up by

studying the effects of silencing or over-expressing them in prostate cancer cell lines.

Characteristics such as proliferation, apoptosis, migration, and invasiveness can be evaluated

using methods such as proliferation and colony formation assays, annexin V/propidium iodide

apoptosis assay, wound healing, and Matrigel invasion assays. Furthermore, by integrating

140

genomic, transcriptomic, and epigenetic information into the analysis, a systems level view of

prostate cancer biology might be uncovered.

Our laboratory has been cataloguing the proteome of EPS for over 6 years (126, 127, 244, 260),

having identified over 2000 proteins in total (Figure 5.1). Advancing techniques are expected to

provide even deeper insight into this immensely complex, yet highly valuable class of biofluids.

Subfractionation, by extracting exosomes or glycoproteins, for instance, is an active area of

investigation for the identification of novel secreted biomarkers and molecular players in disease

progression. Strategies that enrich for exosomes (e.g., differential ultracentrifugation) and

glycoprotein capture (e.g., solid phase extraction of glycopeptides) greatly enhance the dynamic

range and limit of detection of proteins in highly complex patient specimens.

Extracellular vesicles are small (40-5000 nm diameter) membrane-bound vesicles that carry

biomolecular cargo (proteins, nucleic acids, and lipids) from their cell of origin (178, 362), and

are involved in a myriad of biological processes (179, 363-367). Cancer cells are reported to

increase the release of extracellular vesicles into the extracellular environment compared to

noncancer cells (368-370), and thus these factors may represent an important source of fluid-

derived cancer biomarkers. Indeed, studies have demonstrated the presence of “prostasomes” –

prostate-derived extracellular vesicles – in body fluids, cancer cell lines, and tissue (126, 371-

373). In our lab, myself and colleagues have characterized the proteomes of EPS-urine-derived

exosomes using shotgun proteomics (126). Almost 900 proteins were identified, of which

approximately 200 were not previously detected in direct-EPS and EPS-urines analyzed in our

lab.

Protein glycosylation, the post-translational modification of proteins by the covalent addition of

sugar moieties, plays a fundamental role in a wide range of biological processes (374, 375).

Indeed, alterations in protein glycosylation patterns is a recognized hallmark of cancer

progression (376, 377). Profiling the changes that occur in this subproteome as a result of disease

can provide novel biomarkers for diagnosis and prognosis of cancer. The majority of membrane

and secreted proteins are glycosylated (378); in fact, many of the clinical biomarkers in use

today, such as PSA in prostate cancer, CA125 in ovarian cancer, and Her2/neu in breast cancer,

are glycoproteins.

141

As a continuation of the work presented in this thesis, an in-depth proteomic profiling of such

EPS subproteomes will provide additional putative biomarker candidates that were not

previously identified. As mentioned above, by enriching for these proteins, low abundance

species may be more readily resolved.

Proteomics technologies are continuously evolving to provide even deeper proteome coverage,

better quantitation, and higher throughput. The data independent acquisition (DIA) mode of mass

spectrometry analysis has recently emerged as a powerful tool for simultaneous peptide

identification and quantification (379). In DIA mode, all peptides in a mass-to-charge (m/z)

window of a defined width and a retention time range are fragmented, and MS/MS data is

acquired from all detectable precursor ions in the isolation window. This process is repeated for

the next m/z window in an iterative fashion across the full m/z range, producing a complete and

permanent time-resolved record of all fragment ions of detectable peptide precursors in a sample

(380). With recent improvements in DIA data analysis platforms, the highly complex, composite

fragment ion spectra can be linked back to the precursors from which they originate using a

priori information contained in MS/MS spectral libraries (381). This method increases coverage

and reproducibility of the data by acquiring MS/MS data from all detectable precursor ions in a

sample, while making retrospective data analysis a possibility.

Some of the limitations of SRM-MS assays can be circumvented by a high-resolution, accurate-

mass mass spectrometer. For instance, the Thermo Scientific Q Exactive is a hybrid quadrupole-

Orbitrap mass spectrometer that can be utilized for sensitive targeted quantification of proteins in

complex biological material. In parallel reaction monitoring (PRM) mode, a target precursor ion

is isolated in a quadrupole, fragmented by high energy collisional dissociation, and all resulting

product ions are detected in an Orbitrap mass analyzer, yielding a MS/MS spectrum (382, 383).

Quantification is achieved post-acquisition, by extracting one or more of the fragment ions.

Peterson and colleagues demonstrated similar analytical performance of PRM to that of SRM-

MS in terms of dynamic range, linearity, and precision (382). The major advantage of PRM over

SRM-MS is the improved selectivity by recording all fragment ions in the MS/MS scan, over the

few transitions recorded in SRM-MS. In this way, the assay development time is also reduced

since there is no need to preselect target transitions.

142

5.3 Concluding remarks

Personalized medicine, a field of health care that utilizes each individual’s unique clinical,

genetic, genomic, and environmental information to optimize and guide clinical decision making

(384), is emerging as a feasible and promising modality of cancer management. This is largely

due to tremendous progress made in genomic medicine that encompasses information from

genomes and their cognate factors such as RNA, proteins, and metabolites. This information is

helping researchers navigate the complex landscape of tumor biology and providing a

comprehensive view, enabling more precise risk assessment, diagnosis and prognosis, and

response to therapy (385). For example, protein expression patterns, such as those that were

presented in this thesis, can definitively classify patients into different prognostic subgroups,

facilitating strategic health planning. Continued scientific efforts are expected to lead the way in

personalized medicine and shift the paradigm of healthcare from disease treatment to disease

prevention (384).

143

References

1. McNeal, J. E. (1988) Normal histology of the prostate. The American journal of surgical

pathology 12, 619-633

2. Abate-Shen, C., and Shen, M. M. (2000) Molecular genetics of prostate cancer. Genes Dev

14, 2410-2434

3. Brawer, M. K., Peehl, D. M., Stamey, T. A., and Bostwick, D. G. (1985) Keratin

immunoreactivity in the benign and neoplastic human prostate. Cancer research 45, 3663-

3667

4. Liu, A. Y., True, L. D., LaTray, L., Nelson, P. S., Ellis, W. J., Vessella, R. L., Lange, P. H.,

Hood, L., and van den Engh, G. (1997) Cell-cell interaction in prostate gene regulation and

cytodifferentiation. Proc Natl Acad Sci U S A 94, 10705-10710

5. Nagle, R. B., Ahmann, F. R., McDaniel, K. M., Paquin, M. L., Clark, V. A., and Celniker,

A. (1987) Cytokeratin characterization of human prostatic carcinoma and its derived cell

lines. Cancer research 47, 281-286

6. Sherwood, E. R., Berg, L. A., Mitchell, N. J., McNeal, J. E., Kozlowski, J. M., and Lee, C.

(1990) Differential cytokeratin expression in normal, hyperplastic and malignant epithelial

cells from human prostate. J Urol 143, 167-171

7. Verhagen, A. P., Aalders, T. W., Ramaekers, F. C., Debruyne, F. M., and Schalken, J. A.

(1988) Differential expression of keratins in the basal and luminal compartments of rat

prostatic epithelium during degeneration and regeneration. The Prostate 13, 25-38

8. Bui, M., and Reiter, R. E. (1998) Stem cell genes in androgen-independent prostate cancer.

Cancer metastasis reviews 17, 391-399

9. Abrahamsson, P. A., Cockett, A. T., and di Sant'Agnese, P. A. (1998) Prognostic

significance of neuroendocrine differentiation in clinically localized prostatic carcinoma.

The Prostate. Supplement 8, 37-42

10. di Sant'Agnese, P. A. (1998) Neuroendocrine cells of the prostate and neuroendocrine

differentiation in prostatic carcinoma: a review of morphologic aspects. Urology 51, 121-

124

11. (2014) Canadian Cancer Statistics. In: Society, C. C., ed.

12. Sakr, W. A., Haas, G. P., Cassin, B. F., Pontes, J. E., and Crissman, J. D. (1993) The

frequency of carcinoma and intraepithelial neoplasia of the prostate in young male patients.

J Urol 150, 379-385

13. (1998-2002) World Health Organization Mortality Database. World Health Organization

Mortality Database

144

14. Gronberg, H. (2003) Prostate cancer epidemiology. Lancet 361, 859-864

15. Greene, K. L., Cowan, J. E., Cooperberg, M. R., Meng, M. V., DuChane, J., and Carroll, P.

R. (2005) Who is the average patient presenting with prostate cancer? Urology 66, 76-82

16. Schroder, F. H., Hugosson, J., Roobol, M. J., Tammela, T. L., Ciatto, S., Nelen, V.,

Kwiatkowski, M., Lujan, M., Lilja, H., Zappa, M., Denis, L. J., Recker, F., Berenguer, A.,

Maattanen, L., Bangma, C. H., Aus, G., Villers, A., Rebillard, X., van der Kwast, T.,

Blijenberg, B. G., Moss, S. M., de Koning, H. J., and Auvinen, A. (2009) Screening and

prostate-cancer mortality in a randomized European study. N Engl J Med 360, 1320-1328

17. Parkin, D. M., Bray, F. I., and Devesa, S. S. (2001) Cancer burden in the year 2000. The

global picture. European journal of cancer 37 Suppl 8, S4-66

18. Potosky, A. L., Miller, B. A., Albertsen, P. C., and Kramer, B. S. (1995) The role of

increasing detection in the rising incidence of prostate cancer. JAMA 273, 548-552

19. Osegbe, D. N. (1997) Prostate cancer in Nigerians: facts and nonfacts. J Urol 157, 1340-

1343

20. Ruijter, E., van de Kaa, C., Miller, G., Ruiter, D., Debruyne, F., and Schalken, J. (1999)

Molecular genetics and epidemiology of prostate carcinoma. Endocrine reviews 20, 22-45

21. Meikle, A. W., Smith, J. A., and West, D. W. (1985) Familial factors affecting prostatic

cancer risk and plasma sex-steroid levels. The Prostate 6, 121-128

22. Isaacs, S. D., Kiemeney, L. A., Baffoe-Bonnie, A., Beaty, T. H., and Walsh, P. C. (1995)

Risk of cancer in relatives of prostate cancer probands. Journal of the National Cancer

Institute 87, 991-996

23. Carter, B. S., Bova, G. S., Beaty, T. H., Steinberg, G. D., Childs, B., Isaacs, W. B., and

Walsh, P. C. (1993) Hereditary prostate cancer: epidemiologic and clinical features. J Urol

150, 797-802

24. Gronberg, H., Isaacs, S. D., Smith, J. R., Carpten, J. D., Bova, G. S., Freije, D., Xu, J.,

Meyers, D. A., Collins, F. S., Trent, J. M., Walsh, P. C., and Isaacs, W. B. (1997)

Characteristics of prostate cancer in families potentially linked to the hereditary prostate

cancer 1 (HPC1) locus. JAMA 278, 1251-1255

25. Serrano, M., Hannon, G. J., and Beach, D. (1993) A new regulatory motif in cell-cycle

control causing specific inhibition of cyclin D/CDK4. Nature 366, 704-707

26. Morton, R. A., Jr., Watkins, J. J., Bova, G. S., Wales, M. M., Baylin, S. B., and Isaacs, W.

B. (1996) Hypermethylation of chromosome 17P locus D17S5 in human prostate tissue. J

Urol 156, 512-516

27. Wilbert, D. M., Griffin, J. E., and Wilson, J. D. (1983) Characterization of the cytosol

androgen receptor of the human prostate. The Journal of clinical endocrinology and

metabolism 56, 113-120

145

28. Taplin, M. E., Bubley, G. J., Shuster, T. D., Frantz, M. E., Spooner, A. E., Ogata, G. K.,

Keer, H. N., and Balk, S. P. (1995) Mutation of the androgen-receptor gene in metastatic

androgen-independent prostate cancer. N Engl J Med 332, 1393-1398

29. Labrie, F., Belanger, A., Dupont, A., Luu-The, V., Simard, J., and Labrie, C. (1993) Science

behind total androgen blockade: from gene to combination therapy. Clinical and

investigative medicine. Medecine clinique et experimentale 16, 475-492

30. Culig, Z., Klocker, H., Eberle, J., Kaspar, F., Hobisch, A., Cronauer, M. V., and Bartsch, G.

(1993) DNA sequence of the androgen receptor in prostatic tumor cell lines and tissue

specimens assessed by means of the polymerase chain reaction. The Prostate 22, 11-22

31. Hobisch, A., Culig, Z., Radmayr, C., Bartsch, G., Klocker, H., and Hittmair, A. (1995)

Distant metastases from prostatic carcinoma express androgen receptor protein. Cancer

research 55, 3068-3072

32. Gottlieb, B., Lehvaslaiho, H., Beitel, L. K., Lumbroso, R., Pinsky, L., and Trifiro, M. (1998)

The Androgen Receptor Gene Mutations Database. Nucleic Acids Res 26, 234-238

33. Aihara, M., Wheeler, T. M., Ohori, M., and Scardino, P. T. (1994) Heterogeneity of prostate

cancer in radical prostatectomy specimens. Urology 43, 60-66; discussion 66-67

34. Bostwick, D. G., Shan, A., Qian, J., Darson, M., Maihle, N. J., Jenkins, R. B., and Cheng, L.

(1998) Independent origin of multiple foci of prostatic intraepithelial neoplasia: comparison

with matched foci of prostate carcinoma. Cancer 83, 1995-2002

35. Clark, J., Attard, G., Jhavar, S., Flohr, P., Reid, A., De-Bono, J., Eeles, R., Scardino, P.,

Cuzick, J., Fisher, G., Parker, M. D., Foster, C. S., Berney, D., Kovacs, G., and Cooper, C.

S. (2008) Complex patterns of ETS gene alteration arise during cancer development in the

human prostate. Oncogene 27, 1993-2003

36. Macintosh, C. A., Stower, M., Reid, N., and Maitland, N. J. (1998) Precise microdissection

of human prostate cancers reveals genotypic heterogeneity. Cancer research 58, 23-28

37. Mehra, R., Han, B., Tomlins, S. A., Wang, L., Menon, A., Wasco, M. J., Shen, R., Montie,

J. E., Chinnaiyan, A. M., and Shah, R. B. (2007) Heterogeneity of TMPRSS2 gene

rearrangements in multifocal prostate adenocarcinoma: molecular evidence for an

independent group of diseases. Cancer research 67, 7991-7995

38. Liu, W., Laitinen, S., Khan, S., Vihinen, M., Kowalski, J., Yu, G., Chen, L., Ewing, C. M.,

Eisenberger, M. A., Carducci, M. A., Nelson, W. G., Yegnasubramanian, S., Luo, J., Wang,

Y., Xu, J., Isaacs, W. B., Visakorpi, T., and Bova, G. S. (2009) Copy number analysis

indicates monoclonal origin of lethal metastatic prostate cancer. Nature medicine 15, 559-

565

39. Mehra, R., Tomlins, S. A., Yu, J., Cao, X., Wang, L., Menon, A., Rubin, M. A., Pienta, K.

J., Shah, R. B., and Chinnaiyan, A. M. (2008) Characterization of TMPRSS2-ETS gene

aberrations in androgen-independent metastatic prostate cancer. Cancer research 68, 3584-

3590

146

40. Shen, M. M., and Abate-Shen, C. (2010) Molecular genetics of prostate cancer: new

prospects for old challenges. Genes Dev 24, 1967-2000

41. Sakr, W. A., Grignon, D. J., Crissman, J. D., Heilbrun, L. K., Cassin, B. J., Pontes, J. J., and

Haas, G. P. (1994) High grade prostatic intraepithelial neoplasia (HGPIN) and prostatic

adenocarcinoma between the ages of 20-69: an autopsy study of 249 cases. In vivo 8, 439-

443

42. Shiraishi, T., Watanabe, M., Matsuura, H., Kusano, I., Yatani, R., and Stemmermann, G. N.

(1994) The frequency of latent prostatic carcinoma in young males: the Japanese experience.

In vivo 8, 445-447

43. Yatani, R., Kusano, I., Shiraishi, T., Hayashi, T., and Stemmermann, G. N. (1989) Latent

prostatic carcinoma: pathological and epidemiological aspects. Japanese journal of clinical

oncology 19, 319-326

44. Bostwick, D. G., and Cheng, L. (2012) Precursors of prostate cancer. Histopathology 60, 4-

27

45. Bostwick, D. G. (1989) Prostatic intraepithelial neoplasia (PIN). Urology 34, 16-22

46. De Marzo, A. M., Meeker, A. K., Zha, S., Luo, J., Nakayama, M., Platz, E. A., Isaacs, W.

B., and Nelson, W. G. (2003) Human prostate cancer precursors and pathobiology. Urology

62, 55-62

47. Bostwick, D. G., and Brawer, M. K. (1987) Prostatic intra-epithelial neoplasia and early

invasion in prostate cancer. Cancer 59, 788-794

48. Haggman, M. J., Macoska, J. A., Wojno, K. J., and Oesterling, J. E. (1997) The relationship

between prostatic intraepithelial neoplasia and prostate cancer: critical issues. J Urol 158,

12-22

49. Haggman, M. J., Wojno, K. J., Pearsall, C. P., and Macoska, J. A. (1997) Allelic loss of 8p

sequences in prostatic intraepithelial neoplasia and carcinoma. Urology 50, 643-647

50. Nagle, R. B., Brawer, M. K., Kittelson, J., and Clark, V. (1991) Phenotypic relationships of

prostatic intraepithelial neoplasia to invasive prostatic carcinoma. The American journal of

pathology 138, 119-128

51. Qian, J., Bostwick, D. G., Takahashi, S., Borell, T. J., Herath, J. F., Lieber, M. M., and

Jenkins, R. B. (1995) Chromosomal anomalies in prostatic intraepithelial neoplasia and

carcinoma detected by fluorescence in situ hybridization. Cancer research 55, 5408-5414

52. Sakr, W. A., Macoska, J. A., Benson, P., Grignon, D. J., Wolman, S. R., Pontes, J. E., and

Crissman, J. D. (1994) Allelic loss in locally metastatic, multisampled prostate cancer.

Cancer research 54, 3273-3277

53. Vocke, C. D., Pozzatti, R. O., Bostwick, D. G., Florence, C. D., Jennings, S. B., Strup, S. E.,

Duray, P. H., Liotta, L. A., Emmert-Buck, M. R., and Linehan, W. M. (1996) Analysis of 99

147

microdissected prostate carcinomas reveals a high frequency of allelic loss on chromosome

8p12-21. Cancer research 56, 2411-2416

54. Lavery, H. J., and Droller, M. J. (2012) Do Gleason patterns 3 and 4 prostate cancer

represent separate disease states? J Urol 188, 1667-1675

55. Bubendorf, L., Schopfer, A., Wagner, U., Sauter, G., Moch, H., Willi, N., Gasser, T. C., and

Mihatsch, M. J. (2000) Metastatic patterns of prostate cancer: an autopsy study of 1,589

patients. Human pathology 31, 578-583

56. Logothetis, C. J., and Lin, S. H. (2005) Osteoblasts in prostate cancer metastasis to bone.

Nat Rev Cancer 5, 21-28

57. Attard, G., Swennenhuis, J. F., Olmos, D., Reid, A. H., Vickers, E., A'Hern, R., Levink, R.,

Coumans, F., Moreira, J., Riisnaes, R., Oommen, N. B., Hawche, G., Jameson, C.,

Thompson, E., Sipkema, R., Carden, C. P., Parker, C., Dearnaley, D., Kaye, S. B., Cooper,

C. S., Molina, A., Cox, M. E., Terstappen, L. W., and de Bono, J. S. (2009) Characterization

of ERG, AR and PTEN gene status in circulating tumor cells from patients with castration-

resistant prostate cancer. Cancer research 69, 2912-2918

58. Holcomb, I. N., Grove, D. I., Kinnunen, M., Friedman, C. L., Gallaher, I. S., Morgan, T. M.,

Sather, C. L., Delrow, J. J., Nelson, P. S., Lange, P. H., Ellis, W. J., True, L. D., Young, J.

M., Hsu, L., Trask, B. J., and Vessella, R. L. (2008) Genomic alterations indicate tumor

origin and varied metastatic potential of disseminated cells from prostate cancer patients.


59. Leversha, M. A., Han, J., Asgari, Z., Danila, D. C., Lin, O., Gonzalez-Espinoza, R., Anand,

A., Lilja, H., Heller, G., Fleisher, M., and Scher, H. I. (2009) Fluorescence in situ

hybridization analysis of circulating tumor cells in metastatic prostate cancer. Clin Cancer

Res 15, 2091-2097

60. Ploussard, G., Epstein, J. I., Montironi, R., Carroll, P. R., Wirth, M., Grimm, M. O., Bjartell,

A. S., Montorsi, F., Freedland, S. J., Erbersdobler, A., and van der Kwast, T. H. (2011) The

contemporary concept of significant versus insignificant prostate cancer. Eur Urol 60, 291-

303

61. Epstein, J. I., Walsh, P. C., Carmichael, M., and Brendler, C. B. (1994) Pathologic and

clinical findings to predict tumor extent of nonpalpable (stage T1c) prostate cancer. JAMA

271, 368-374

62. Loeb, S., Bjurlin, M. A., Nicholson, J., Tammela, T. L., Penson, D. F., Carter, H. B.,

Carroll, P., and Etzioni, R. (2014) Overdiagnosis and overtreatment of prostate cancer. Eur

Urol 65, 1046-1055

63. Epstein, J. I., Pizov, G., and Walsh, P. C. (1993) Correlation of pathologic findings with

progression after radical retropubic prostatectomy. Cancer 71, 3582-3593

64. Epstein, J. I., Chan, D. W., Sokoll, L. J., Walsh, P. C., Cox, J. L., Rittenhouse, H., Wolfert,

R., and Carter, H. B. (1998) Nonpalpable stage T1c prostate cancer: prediction of

148

insignificant disease using free/total prostate specific antigen levels and needle biopsy

findings. J Urol 160, 2407-2411

65. Kapoor, J., Namdarian, B., Pedersen, J., Hovens, C., Moon, D., Peters, J., Costello, A. J.,

Ruljancich, P., and Corcoran, N. M. (2013) Extraprostatic extension into periprostatic fat is

a more important determinant of prostate cancer recurrence than an invasive phenotype. J

Urol 190, 2061-2066

66. van Oort, I. M., Kok, D. E., Kiemeney, L. A., Hulsbergen-van de Kaa, C. A., and Witjes, J.

A. (2009) A single institution experience with biochemical recurrence after radical

prostatectomy for tumors that on pathology are of small volume or "insignificant". Urol

Oncol 27, 509-513

67. Moul, J. W. (2000) Prostate specific antigen only progression of prostate cancer. J Urol 163,

1632-1642

68. Polascik, T. J., Oesterling, J. E., and Partin, A. W. (1999) Prostate specific antigen: a decade

of discovery--what we have learned and where we are going. J Urol 162, 293-306

69. Boorjian, S. A., Thompson, R. H., Tollefson, M. K., Rangel, L. J., Bergstralh, E. J., Blute,

M. L., and Karnes, R. J. (2011) Long-term risk of clinical progression after biochemical

recurrence following radical prostatectomy: the impact of time from surgery to recurrence.

Eur Urol 59, 893-899

70. Pound, C. R., Partin, A. W., Eisenberger, M. A., Chan, D. W., Pearson, J. D., and Walsh, P.

C. (1999) Natural history of progression after PSA elevation following radical

prostatectomy. JAMA 281, 1591-1597

71. Kapadia, N. S., Olson, K., Sandler, H. M., Feng, F. Y., and Hamstra, D. A. (2012) Interval

to biochemical failure as a biomarker for cause-specific and overall survival after dose-

escalated external beam radiation therapy for prostate cancer. Cancer 118, 2059-2068

72. Freedland, S. J., Humphreys, E. B., Mangold, L. A., Eisenberger, M., Dorey, F. J., Walsh, P.

C., and Partin, A. W. (2005) Risk of prostate cancer-specific mortality following

biochemical recurrence after radical prostatectomy. JAMA 294, 433-439

73. Baca, S. C., Prandi, D., Lawrence, M. S., Mosquera, J. M., Romanel, A., Drier, Y., Park, K.,

Kitabayashi, N., MacDonald, T. Y., Ghandi, M., Van Allen, E., Kryukov, G. V., Sboner, A.,

Theurillat, J. P., Soong, T. D., Nickerson, E., Auclair, D., Tewari, A., Beltran, H., Onofrio,

R. C., Boysen, G., Guiducci, C., Barbieri, C. E., Cibulskis, K., Sivachenko, A., Carter, S. L.,

Saksena, G., Voet, D., Ramos, A. H., Winckler, W., Cipicchio, M., Ardlie, K., Kantoff, P.

W., Berger, M. F., Gabriel, S. B., Golub, T. R., Meyerson, M., Lander, E. S., Elemento, O.,

Getz, G., Demichelis, F., Rubin, M. A., and Garraway, L. A. (2013) Punctuated evolution of

prostate cancer genomes. Cell 153, 666-677

74. Schoenborn, J. R., Nelson, P., and Fang, M. (2013) Genomic profiling defines subtypes of

prostate cancer with the potential for therapeutic stratification. Clin Cancer Res 19, 4058-

4066

149

75. Beroukhim, R., Mermel, C. H., Porter, D., Wei, G., Raychaudhuri, S., Donovan, J.,

Barretina, J., Boehm, J. S., Dobson, J., Urashima, M., Mc Henry, K. T., Pinchback, R. M.,

Ligon, A. H., Cho, Y. J., Haery, L., Greulich, H., Reich, M., Winckler, W., Lawrence, M.

S., Weir, B. A., Tanaka, K. E., Chiang, D. Y., Bass, A. J., Loo, A., Hoffman, C., Prensner,

J., Liefeld, T., Gao, Q., Yecies, D., Signoretti, S., Maher, E., Kaye, F. J., Sasaki, H., Tepper,

J. E., Fletcher, J. A., Tabernero, J., Baselga, J., Tsao, M. S., Demichelis, F., Rubin, M. A.,

Janne, P. A., Daly, M. J., Nucera, C., Levine, R. L., Ebert, B. L., Gabriel, S., Rustgi, A. K.,

Antonescu, C. R., Ladanyi, M., Letai, A., Garraway, L. A., Loda, M., Beer, D. G., True, L.

D., Okamoto, A., Pomeroy, S. L., Singer, S., Golub, T. R., Lander, E. S., Getz, G., Sellers,

W. R., and Meyerson, M. (2010) The landscape of somatic copy-number alteration across

human cancers. Nature 463, 899-905

76. Taylor, B. S., Schultz, N., Hieronymus, H., Gopalan, A., Xiao, Y., Carver, B. S., Arora, V.

K., Kaushik, P., Cerami, E., Reva, B., Antipin, Y., Mitsiades, N., Landers, T., Dolgalev, I.,

Major, J. E., Wilson, M., Socci, N. D., Lash, A. E., Heguy, A., Eastham, J. A., Scher, H. I.,

Reuter, V. E., Scardino, P. T., Sander, C., Sawyers, C. L., and Gerald, W. L. (2010)

Integrative genomic profiling of human prostate cancer. Cancer Cell 18, 11-22

77. Tomlins, S. A., Rhodes, D. R., Perner, S., Dhanasekaran, S. M., Mehra, R., Sun, X. W.,

Varambally, S., Cao, X., Tchinda, J., Kuefer, R., Lee, C., Montie, J. E., Shah, R. B., Pienta,

K. J., Rubin, M. A., and Chinnaiyan, A. M. (2005) Recurrent fusion of TMPRSS2 and ETS

transcription factor genes in prostate cancer. Science 310, 644-648

78. Barbieri, C. E., Baca, S. C., Lawrence, M. S., Demichelis, F., Blattner, M., Theurillat, J. P.,

White, T. A., Stojanov, P., Van Allen, E., Stransky, N., Nickerson, E., Chae, S. S., Boysen,

G., Auclair, D., Onofrio, R. C., Park, K., Kitabayashi, N., MacDonald, T. Y., Sheikh, K.,

Vuong, T., Guiducci, C., Cibulskis, K., Sivachenko, A., Carter, S. L., Saksena, G., Voet, D.,

Hussain, W. M., Ramos, A. H., Winckler, W., Redman, M. C., Ardlie, K., Tewari, A. K.,

Mosquera, J. M., Rupp, N., Wild, P. J., Moch, H., Morrissey, C., Nelson, P. S., Kantoff, P.

W., Gabriel, S. B., Golub, T. R., Meyerson, M., Lander, E. S., Getz, G., Rubin, M. A., and

Garraway, L. A. (2012) Exome sequencing identifies recurrent SPOP, FOXA1 and MED12

mutations in prostate cancer. Nature genetics 44, 685-689

79. Kan, Z., Jaiswal, B. S., Stinson, J., Janakiraman, V., Bhatt, D., Stern, H. M., Yue, P.,

Haverty, P. M., Bourgon, R., Zheng, J., Moorhead, M., Chaudhuri, S., Tomsho, L. P.,

Peters, B. A., Pujara, K., Cordes, S., Davis, D. P., Carlton, V. E., Yuan, W., Li, L., Wang,

W., Eigenbrot, C., Kaminker, J. S., Eberhard, D. A., Waring, P., Schuster, S. C., Modrusan,

Z., Zhang, Z., Stokoe, D., de Sauvage, F. J., Faham, M., and Seshagiri, S. (2010) Diverse

somatic mutation patterns and pathway alterations in human cancers. Nature 466, 869-873

80. Kumar, A., White, T. A., MacKenzie, A. P., Clegg, N., Lee, C., Dumpit, R. F., Coleman, I.,

Ng, S. B., Salipante, S. J., Rieder, M. J., Nickerson, D. A., Corey, E., Lange, P. H.,

Morrissey, C., Vessella, R. L., Nelson, P. S., and Shendure, J. (2011) Exome sequencing

identifies a spectrum of mutation frequencies in advanced and lethal prostate cancers. Proc

Natl Acad Sci U S A 108, 17087-17092

81. Grasso, C. S., Wu, Y. M., Robinson, D. R., Cao, X., Dhanasekaran, S. M., Khan, A. P.,

Quist, M. J., Jing, X., Lonigro, R. J., Brenner, J. C., Asangani, I. A., Ateeq, B., Chun, S. Y.,

150

Siddiqui, J., Sam, L., Anstett, M., Mehra, R., Prensner, J. R., Palanisamy, N., Ryslik, G. A.,

Vandin, F., Raphael, B. J., Kunju, L. P., Rhodes, D. R., Pienta, K. J., Chinnaiyan, A. M.,

and Tomlins, S. A. (2012) The mutational landscape of lethal castration-resistant prostate

cancer. Nature 487, 239-243

82. Eastham, J. A., Stapleton, A. M., Gousse, A. E., Timme, T. L., Yang, G., Slawin, K. M.,

Wheeler, T. M., Scardino, P. T., and Thompson, T. C. (1995) Association of p53 mutations

with metastatic prostate cancer. Clin Cancer Res 1, 1111-1118

83. Cairns, P., Okami, K., Halachmi, S., Halachmi, N., Esteller, M., Herman, J. G., Jen, J.,

Isaacs, W. B., Bova, G. S., and Sidransky, D. (1997) Frequent inactivation of

PTEN/MMAC1 in primary prostate cancer. Cancer research 57, 4997-5000

84. Herschman, J. D., Smith, D. S., and Catalona, W. J. (1997) Effect of ejaculation on serum

total and free prostate-specific antigen concentrations. Urology 50, 239-243

85. Wolf, A. M., Wender, R. C., Etzioni, R. B., Thompson, I. M., D'Amico, A. V., Volk, R. J.,

Brooks, D. D., Dash, C., Guessous, I., Andrews, K., DeSantis, C., Smith, R. A., and

American Cancer Society Prostate Cancer Advisory, C. (2010) American Cancer Society

guideline for the early detection of prostate cancer: update 2010. CA: a cancer journal for

clinicians 60, 70-98

86. Brawer, M. K., Chetner, M. P., Beatie, J., Buchner, D. M., Vessella, R. L., and Lange, P. H.

(1992) Screening for prostatic carcinoma with prostate specific antigen. J Urol 147, 841-845

87. Catalona, W. J., Richie, J. P., Ahmann, F. R., Hudson, M. A., Scardino, P. T., Flanigan, R.

C., deKernion, J. B., Ratliff, T. L., Kavoussi, L. R., and Dalkin, B. L. (1994) Comparison of

digital rectal examination and serum prostate specific antigen in the early detection of

prostate cancer: results of a multicenter clinical trial of 6,630 men. J Urol 151, 1283-1290

88. Schroder, F. H., van der Cruijsen-Koeter, I., de Koning, H. J., Vis, A. N., Hoedemaeker, R.

F., and Kranse, R. (2000) Prostate cancer detection at low prostate specific antigen. J Urol

163, 806-812

89. Coley, C. M., Barry, M. J., Fleming, C., and Mulley, A. G. (1997) Early detection of

prostate cancer. Part I: Prior probability and effectiveness of tests. The American College of

Physicians. Annals of internal medicine 126, 394-406

90. Thompson, I. M., Pauler, D. K., Goodman, P. J., Tangen, C. M., Lucia, M. S., Parnes, H. L.,

Minasian, L. M., Ford, L. G., Lippman, S. M., Crawford, E. D., Crowley, J. J., and Coltman,

C. A., Jr. (2004) Prevalence of prostate cancer among men with a prostate-specific antigen

level < or =4.0 ng per milliliter. N Engl J Med 350, 2239-2246

91. Loeb, S., Vonesh, E. F., Metter, E. J., Carter, H. B., Gann, P. H., and Catalona, W. J. (2011)

What is the true number needed to screen and treat to save a life with prostate-specific

antigen testing? J Clin Oncol 29, 464-467

92. Grisanzio, C., and Signoretti, S. (2008) p63 in prostate biology and pathology. Journal of

cellular biochemistry 103, 1354-1368

151

93. Humphrey, P. A. (2007) Diagnosis of adenocarcinoma in prostate needle biopsy tissue.

Journal of clinical pathology 60, 35-42

94. Jiang, Z., Li, C., Fischer, A., Dresser, K., and Woda, B. A. (2005) Using an AMACR

(P504S)/34betaE12/p63 cocktail for the detection of small focal prostate carcinoma in

needle biopsy specimens. American journal of clinical pathology 123, 231-236

95. Luo, J., Zha, S., Gage, W. R., Dunn, T. A., Hicks, J. L., Bennett, C. J., Ewing, C. M., Platz,

E. A., Ferdinandusse, S., Wanders, R. J., Trent, J. M., Isaacs, W. B., and De Marzo, A. M.

(2002) Alpha-methylacyl-CoA racemase: a new molecular marker for prostate cancer.


96. Hayes, D. F., Bast, R. C., Desch, C. E., Fritsche, H., Jr., Kemeny, N. E., Jessup, J. M.,

Locker, G. Y., Macdonald, J. S., Mennel, R. G., Norton, L., Ravdin, P., Taube, S., and

Winn, R. J. (1996) Tumor marker utility grading system: a framework to evaluate clinical

utility of tumor markers. Journal of the National Cancer Institute 88, 1456-1466

97. Gleason, D. F., and Mellinger, G. T. (1974) Prediction of prognosis for prostatic

adenocarcinoma by combined histological grading and clinical staging. J Urol 111, 58-64

98. Gleason, D. F. (1990) Histologic grading of prostatic carcinoma. In Pathology of the

Prostate, Churchill Livingstone, New York

99. Martin, N. E., Mucci, L. A., Loda, M., and Depinho, R. A. (2011) Prognostic determinants

in prostate cancer. Cancer J 17, 429-437

100. D'Amico, A. V., Whittington, R., Malkowicz, S. B., Schultz, D., Blank, K., Broderick, G.

A., Tomaszewski, J. E., Renshaw, A. A., Kaplan, I., Beard, C. J., and Wein, A. (1998)

Biochemical outcome after radical prostatectomy, external beam radiation therapy, or

interstitial radiation therapy for clinically localized prostate cancer. JAMA 280, 969-974

101. Stark, J. R., Perner, S., Stampfer, M. J., Sinnott, J. A., Finn, S., Eisenstein, A. S., Ma, J.,

Fiorentino, M., Kurth, T., Loda, M., Giovannucci, E. L., Rubin, M. A., and Mucci, L. A.

(2009) Gleason score and lethal prostate cancer: does 3 + 4 = 4 + 3? J Clin Oncol 27, 3459-

3464

102. Wright, J. L., Salinas, C. A., Lin, D. W., Kolb, S., Koopmeiners, J., Feng, Z., and Stanford,

J. L. (2009) Prostate cancer specific mortality and Gleason 7 disease differences in prostate

cancer outcomes between cases with Gleason 4 + 3 and Gleason 3 + 4 tumors in a

population based cohort. J Urol 182, 2702-2707

103. Bostwick, D. G., Leske, D. A., Qian, J., and Sinha, A. A. (1995) Prostatic intraepithelial

neoplasia and well differentiated adenocarcinoma maintain an intact basement membrane.

Pathology, research and practice 191, 850-855

104. Epstein, J. I., Partin, A. W., Sauvageot, J., and Walsh, P. C. (1996) Prediction of progression

following radical prostatectomy. A multivariate analysis of 721 men with long-term follow-

up. The American journal of surgical pathology 20, 286-292

152

105. Albertsen, P. C., Hanley, J. A., Gleason, D. F., and Barry, M. J. (1998) Competing risk

analysis of men aged 55 to 74 years at diagnosis managed conservatively for clinically

localized prostate cancer. JAMA 280, 975-980

106. Barry, M. J., Albertsen, P. C., Bagshaw, M. A., Blute, M. L., Cox, R., Middleton, R. G.,

Gleason, D. F., Zincke, H., Bergstralh, E. J., and Jacobsen, S. J. (2001) Outcomes for men

with clinically nonmetastatic prostate carcinoma managed with radical prostactectomy,

external beam radiotherapy, or expectant management: a retrospective analysis. Cancer 91,

2302-2314

107. Humphrey, P. A. (2004) Gleason grading and prognostic factors in carcinoma of the

prostate. Modern pathology : an official journal of the United States and Canadian

Academy of Pathology, Inc 17, 292-306

108. Ruijter, E. T., van de Kaa, C. A., Schalken, J. A., Debruyne, F. M., and Ruiter, D. J. (1996)

Histological grade heterogeneity in multifocal prostate cancer. Biological and clinical

implications. J Pathol 180, 295-299

109. Pan, C. C., Potter, S. R., Partin, A. W., and Epstein, J. I. (2000) The prognostic significance

of tertiary Gleason patterns of higher grade in radical prostatectomy specimens: a proposal

to modify the Gleason grading system. The American journal of surgical pathology 24, 563-

569

110. Srigley, J. R., Amin, M. B., Bostwick, D. G., Grignon, D. J., and Hammond, M. E. (2000)

Updated protocol for the examination of specimens from patients with carcinomas of the

prostate gland: a basis for checklists. Cancer Committee. Arch Pathol Lab Med 124, 1034-

1039

111. Prensner, J. R., Rubin, M. A., Wei, J. T., and Chinnaiyan, A. M. (2012) Beyond PSA: the

next generation of prostate cancer biomarkers. Sci Transl Med 4, 127rv123

112. Pepe, M. S., Etzioni, R., Feng, Z., Potter, J. D., Thompson, M. L., Thornquist, M., Winget,

M., and Yasui, Y. (2001) Phases of biomarker development for early detection of cancer.

Journal of the National Cancer Institute 93, 1054-1061

113. Fuzery, A. K., Levin, J., Chan, M. M., and Chan, D. W. (2013) Translation of proteomic

biomarkers into FDA approved cancer diagnostics: issues and challenges. Clin Proteomics

10, 13

114. Zweig, M. H. (1993) ROC plots display test accuracy, but are still limited by the study

design. Clin Chem 39, 1345-1346

115. Zweig, M. H., and Campbell, G. (1993) Receiver-operating characteristic (ROC) plots: a

fundamental evaluation tool in clinical medicine. Clin Chem 39, 561-577

116. Scott, M. G. (2010) When do new biomarkers make economic sense? Scandinavian journal

of clinical and laboratory investigation. Supplementum 242, 90-95

153

117. Mani, D. R., Abbatiello, S. E., and Carr, S. A. (2012) Statistical characterization of

multiple-reaction monitoring mass spectrometry (MRM-MS) assays for quantitative

proteomics. BMC Bioinformatics 13 Suppl 16, S9

118. Rifai, N., Gillette, M. A., and Carr, S. A. (2006) Protein biomarker discovery and validation:

the long and uncertain path to clinical utility. Nat Biotechnol 24, 971-983

119. Kim, Y., and Kislinger, T. (2013) Novel approaches for the identification of biomarkers of

aggressive prostate cancer. Genome medicine 5, 56

120. Rosty, C., Christa, L., Kuzdzal, S., Baldwin, W. M., Zahurak, M. L., Carnot, F., Chan, D.

W., Canto, M., Lillemoe, K. D., Cameron, J. L., Yeo, C. J., Hruban, R. H., and Goggins, M.

(2002) Identification of hepatocarcinoma-intestine-pancreas/pancreatitis-associated protein I

as a biomarker for pancreatic ductal adenocarcinoma by protein biochip technology. Cancer

research 62, 1868-1875

121. Sedlaczek, P., Frydecka, I., Gabrys, M., Van Dalen, A., Einarsson, R., and Harlozinska, A.

(2002) Comparative analysis of CA125, tissue polypeptide specific antigen, and soluble

interleukin-2 receptor alpha levels in sera, cyst, and ascitic fluids from patients with ovarian

carcinoma. Cancer 95, 1886-1893

122. Srivastava, A., Goldberger, H., Dimtchev, A., Ramalinga, M., Chijioke, J., Marian, C.,

Oermann, E. K., Uhm, S., Kim, J. S., Chen, L. N., Li, X., Berry, D. L., Kallakury, B. V.,

Chauhan, S. C., Collins, S. P., Suy, S., and Kumar, D. (2013) MicroRNA profiling in

prostate cancer--the diagnostic potential of urinary miR-205 and miR-214. PLoS One 8,

e76994

123. Lee, B., Mazar, J., Aftab, M. N., Qi, F., Shelley, J., Li, J. L., Govindarajan, S., Valerio, F.,

Rivera, I., Thurn, T., Tran, T. A., Kameh, D., Patel, V., and Perera, R. J. (2014) Long

noncoding RNAs as putative biomarkers for prostate cancer detection. The Journal of

molecular diagnostics : JMD 16, 615-626

124. Adachi, J., Kumar, C., Zhang, Y., Olsen, J. V., and Mann, M. (2006) The human urinary

proteome contains more than 1500 proteins, including a large proportion of membrane

proteins. Genome Biol 7, R80

125. Gonzales, P. A., Zhou, H., Pisitkun, T., Wang, N. S., Star, R. A., Knepper, M. A., and Yuen,

P. S. (2010) Isolation and purification of exosomes in urine. Methods in molecular biology

641, 89-99

126. Principe, S., Jones, E. E., Kim, Y., Sinha, A., Nyalwidhe, J. O., Brooks, J., Semmes, O. J.,

Troyer, D. A., Lance, R. S., Kislinger, T., and Drake, R. R. (2013) In-depth proteomic

analyses of exosomes isolated from expressed prostatic secretions in urine. Proteomics 13,

1667-1671

127. Principe, S., Kim, Y., Fontana, S., Ignatchenko, V., Nyalwidhe, J. O., Lance, R. S., Troyer,

D. A., Alessandro, R., Semmes, O. J., Kislinger, T., Drake, R. R., and Medin, J. A. (2012)

Identification of prostate-enriched proteins by in-depth proteomic analyses of expressed

prostatic secretions in urine. J Proteome Res 11, 2386-2396

154

128. Theodorescu, D., Schiffer, E., Bauer, H. W., Douwes, F., Eichhorn, F., Polley, R., Schmidt,

T., Schofer, W., Zurbig, P., Good, D. M., Coon, J. J., and Mischak, H. (2008) Discovery and

validation of urinary biomarkers for prostate cancer. Proteomics Clin Appl 2, 556-570

129. Kulasingam, V., and Diamandis, E. P. (2008) Strategies for discovering novel cancer

biomarkers through utilization of emerging technologies. Nature clinical practice. Oncology

5, 588-599

130. Jarjanazi, H., Savas, S., Pabalan, N., Dennis, J. W., and Ozcelik, H. (2008) Biological

implications of SNPs in signal peptide domains of human proteins. Proteins 70, 394-403

131. Kay, P., Yang, Y. C., and Paraoan, L. (2013) Directional protein secretion by the retinal

pigment epithelium: roles in retinal health and the development of age-related macular

degeneration. Journal of cellular and molecular medicine 17, 833-843

132. Scholzen, T., and Gerdes, J. (2000) The Ki-67 protein: from the known and the unknown. J

Cell Physiol 182, 311-322

133. Berney, D. M., Gopalan, A., Kudahetti, S., Fisher, G., Ambroisine, L., Foster, C. S., Reuter,

V., Eastham, J., Moller, H., Kattan, M. W., Gerald, W., Cooper, C., Scardino, P., and

Cuzick, J. (2009) Ki-67 and outcome in clinically localised prostate cancer: analysis of

conservatively treated prostate cancer patients from the Trans-Atlantic Prostate Group

study. Br J Cancer 100, 888-893

134. Khor, L. Y., Bae, K., Paulus, R., Al-Saleem, T., Hammond, M. E., Grignon, D. J., Che, M.,

Venkatesan, V., Byhardt, R. W., Rotman, M., Hanks, G. E., Sandler, H. M., and Pollack, A.

(2009) MDM2 and Ki-67 predict for distant metastasis and mortality in men treated with

radiotherapy and androgen deprivation for prostate cancer: RTOG 92-02. J Clin Oncol 27,

3177-3184

135. Bettencourt, M. C., Bauer, J. J., Sesterhenn, I. A., Mostofi, F. K., McLeod, D. G., and Moul,

J. W. (1996) Ki-67 expression is a prognostic marker of prostate cancer recurrence after

radical prostatectomy. J Urol 156, 1064-1068

136. Fisher, G., Yang, Z. H., Kudahetti, S., Moller, H., Scardino, P., Cuzick, J., and Berney, D.

M. (2013) Prognostic value of Ki-67 for prostate cancer death in a conservatively managed

cohort. Br J Cancer 108, 271-277

137. Murphy, A. J., Hughes, C. A., Lannigan, G., Sheils, O., O'Leary, J., and Loftus, B. (2007)

Heterogeneous expression of alpha-methylacyl-CoA racemase in prostatic cancer correlates

with Gleason score. Histopathology 50, 243-251

138. Rubin, M. A., Bismar, T. A., Andren, O., Mucci, L., Kim, R., Shen, R., Ghosh, D., Wei, J.

T., Chinnaiyan, A. M., Adami, H. O., Kantoff, P. W., and Johansson, J. E. (2005) Decreased

alpha-methylacyl CoA racemase expression in localized prostate cancer is associated with

an increased rate of biochemical recurrence and cancer-specific death. Cancer Epidemiol

Biomarkers Prev 14, 1424-1432

155

139. Chang, S. S. (2004) Overview of prostate-specific membrane antigen. Rev Urol 6 Suppl 10,

S13-18

140. Ross, J. S., Sheehan, C. E., Fisher, H. A., Kaufman, R. P., Jr., Kaur, P., Gray, K., Webb, I.,

Gray, G. S., Mosher, R., and Kallakury, B. V. (2003) Correlation of primary tumor prostate-

specific membrane antigen expression with disease recurrence in prostate cancer. Clin

Cancer Res 9, 6357-6362

141. Perner, S., Hofer, M. D., Kim, R., Shah, R. B., Li, H., Moller, P., Hautmann, R. E.,

Gschwend, J. E., Kuefer, R., and Rubin, M. A. (2007) Prostate-specific membrane antigen

expression as a predictor of prostate cancer progression. Human pathology 38, 696-701

142. Cuzick, J., Swanson, G. P., Fisher, G., Brothman, A. R., Berney, D. M., Reid, J. E., Mesher,

D., Speights, V. O., Stankiewicz, E., Foster, C. S., Moller, H., Scardino, P., Warren, J. D.,

Park, J., Younus, A., Flake, D. D., 2nd, Wagner, S., Gutin, A., Lanchbury, J. S., and Stone,

S. (2011) Prognostic value of an RNA expression signature derived from cell cycle

proliferation genes in patients with prostate cancer: a retrospective study. Lancet Oncol 12,

245-255

143. Ding, Z., Wu, C. J., Chu, G. C., Xiao, Y., Ho, D., Zhang, J., Perry, S. R., Labrot, E. S., Wu,

X., Lis, R., Hoshida, Y., Hiller, D., Hu, B., Jiang, S., Zheng, H., Stegh, A. H., Scott, K. L.,

Signoretti, S., Bardeesy, N., Wang, Y. A., Hill, D. E., Golub, T. R., Stampfer, M. J., Wong,

W. H., Loda, M., Mucci, L., Chin, L., and DePinho, R. A. (2011) SMAD4-dependent barrier

constrains prostate cancer growth and metastatic progression. Nature 470, 269-273

144. Lalonde, E., Ishkanian, A. S., Sykes, J., Fraser, M., Ross-Adams, H., Erho, N., Dunning, M.

J., Halim, S., Lamb, A. D., Moon, N. C., Zafarana, G., Warren, A. Y., Meng, X., Thoms, J.,

Grzadkowski, M. R., Berlin, A., Have, C. L., Ramnarine, V. R., Yao, C. Q., Malloff, C. A.,

Lam, L. L., Xie, H., Harding, N. J., Mak, D. Y., Chu, K. C., Chong, L. C., Sendorek, D. H.,

P'ng, C., Collins, C. C., Squire, J. A., Jurisica, I., Cooper, C., Eeles, R., Pintilie, M., Dal Pra,

A., Davicioni, E., Lam, W. L., Milosevic, M., Neal, D. E., van der Kwast, T., Boutros, P. C.,

and Bristow, R. G. (2014) Tumour genomic and microenvironmental heterogeneity for

integrated prediction of 5-year biochemical recurrence of prostate cancer: a retrospective

cohort study. Lancet Oncol 15, 1521-1532

145. Penney, K. L., Sinnott, J. A., Fall, K., Pawitan, Y., Hoshida, Y., Kraft, P., Stark, J. R.,

Fiorentino, M., Perner, S., Finn, S., Calza, S., Flavin, R., Freedman, M. L., Setlur, S., Sesso,

H. D., Andersson, S. O., Martin, N., Kantoff, P. W., Johansson, J. E., Adami, H. O., Rubin,

M. A., Loda, M., Golub, T. R., Andren, O., Stampfer, M. J., and Mucci, L. A. (2011)

mRNA expression signature of Gleason grade predicts lethal prostate cancer. J Clin Oncol

29, 2391-2396

146. Cottrell, S., Jung, K., Kristiansen, G., Eltze, E., Semjonow, A., Ittmann, M., Hartmann, A.,

Stamey, T., Haefliger, C., and Weiss, G. (2007) Discovery and validation of 3 novel DNA

methylation markers of prostate cancer prognosis. J Urol 177, 1753-1758

147. Porkka, K. P., Pfeiffer, M. J., Waltering, K. K., Vessella, R. L., Tammela, T. L., and

Visakorpi, T. (2007) MicroRNA expression profiling in prostate cancer. Cancer research

67, 6130-6135

156

148. Mitchell, P. S., Parkin, R. K., Kroh, E. M., Fritz, B. R., Wyman, S. K., Pogosova-

Agadjanyan, E. L., Peterson, A., Noteboom, J., O'Briant, K. C., Allen, A., Lin, D. W.,

Urban, N., Drescher, C. W., Knudsen, B. S., Stirewalt, D. L., Gentleman, R., Vessella, R.

L., Nelson, P. S., Martin, D. B., and Tewari, M. (2008) Circulating microRNAs as stable

blood-based markers for cancer detection. Proc Natl Acad Sci U S A 105, 10513-10518

149. Khan, A. P., Poisson, L. M., Bhat, V. B., Fermin, D., Zhao, R., Kalyana-Sundaram, S.,

Michailidis, G., Nesvizhskii, A. I., Omenn, G. S., Chinnaiyan, A. M., and Sreekumar, A.

(2010) Quantitative proteomic profiling of prostate cancer reveals a role for miR-128 in

prostate cancer. Mol Cell Proteomics 9, 298-312

150. Gupta, A., Lotan, Y., Ashfaq, R., Roehrborn, C. G., Raj, G. V., Aragaki, C. C., Montorsi, F.,

and Shariat, S. F. (2009) Predictive value of the differential expression of the urokinase

plasminogen activation axis in radical prostatectomy patients. Eur Urol 55, 1124-1133

151. Shariat, S. F., Menesses-Diaz, A., Kim, I. Y., Muramoto, M., Wheeler, T. M., and Slawin,

K. M. (2004) Tissue expression of transforming growth factor-beta1 and its receptors:

correlation with pathologic features and biochemical progression in patients undergoing

radical prostatectomy. Urology 63, 1191-1197

152. Shariat, S. F., Kattan, M. W., Traxel, E., Andrews, B., Zhu, K., Wheeler, T. M., and Slawin,

K. M. (2004) Association of pre- and postoperative plasma levels of transforming growth

factor beta(1) and interleukin 6 and its soluble receptor with prostate cancer progression.

Clin Cancer Res 10, 1992-1999

153. Wood, D. P., Jr., and Banerjee, M. (1997) Presence of circulating prostate cells in the bone

marrow of patients undergoing radical prostatectomy is predictive of disease-free survival. J

Clin Oncol 15, 3451-3457

154. Berg, A., Berner, A., Lilleby, W., Bruland, O. S., Fossa, S. D., Nesland, J. M., and

Kvalheim, G. (2007) Impact of disseminated tumor cells in bone marrow at diagnosis in

patients with nonmetastatic prostate cancer treated by definitive radiotherapy. Int J Cancer

120, 1603-1609

155. Kantoff, P. W., Halabi, S., Farmer, D. A., Hayes, D. F., Vogelzang, N. A., and Small, E. J.

(2001) Prognostic significance of reverse transcriptase polymerase chain reaction for

prostate-specific antigen in men with hormone-refractory prostate cancer. J Clin Oncol 19,

3025-3028

156. Goodman, O. B., Jr., Fink, L. M., Symanowski, J. T., Wong, B., Grobaski, B., Pomerantz,

D., Ma, Y., Ward, D. C., and Vogelzang, N. J. (2009) Circulating tumor cells in patients

with castration-resistant prostate cancer baseline values and correlation with prognostic

factors. Cancer Epidemiol Biomarkers Prev 18, 1904-1913

157. Bussemakers, M. J., van Bokhoven, A., Verhaegh, G. W., Smit, F. P., Karthaus, H. F.,

Schalken, J. A., Debruyne, F. M., Ru, N., and Isaacs, W. B. (1999) DD3: a new prostate-

specific gene, highly overexpressed in prostate cancer. Cancer research 59, 5975-5979

157

158. Groskopf, J., Aubin, S. M., Deras, I. L., Blase, A., Bodrug, S., Clark, C., Brentano, S.,

Mathis, J., Pham, J., Meyer, T., Cass, M., Hodge, P., Macairan, M. L., Marks, L. S., and

Rittenhouse, H. (2006) APTIMA PCA3 molecular urine test: development of a method to

aid in the diagnosis of prostate cancer. Clin Chem 52, 1089-1095

159. Nakanishi, H., Groskopf, J., Fritsche, H. A., Bhadkamkar, V., Blase, A., Kumar, S. V.,

Davis, J. W., Troncoso, P., Rittenhouse, H., and Babaian, R. J. (2008) PCA3 molecular

urine assay correlates with prostate cancer tumor volume: implication in selecting

candidates for active surveillance. J Urol 179, 1804-1809; discussion 1809-1810

160. Whitman, E. J., Groskopf, J., Ali, A., Chen, Y., Blase, A., Furusato, B., Petrovics, G.,

Ibrahim, M., Elsamanoudi, S., Cullen, J., Sesterhenn, I. A., Brassell, S., Rittenhouse, H.,

Srivastava, S., and McLeod, D. G. (2008) PCA3 score before radical prostatectomy predicts

extracapsular extension and tumor volume. J Urol 180, 1975-1978; discussion 1978-1979

161. Augustin, H., Mayrhofer, K., Pummer, K., and Mannweiler, S. (2013) Relationship between

prostate cancer gene 3 (PCA3) and characteristics of tumor aggressiveness. The Prostate 73,

203-210

162. Laxman, B., Tomlins, S. A., Mehra, R., Morris, D. S., Wang, L., Helgeson, B. E., Shah, R.

B., Rubin, M. A., Wei, J. T., and Chinnaiyan, A. M. (2006) Noninvasive detection of

TMPRSS2:ERG fusion transcripts in the urine of men with prostate cancer. Neoplasia 8,

885-888

163. Hessels, D., Smit, F. P., Verhaegh, G. W., Witjes, J. A., Cornel, E. B., and Schalken, J. A.

(2007) Detection of TMPRSS2-ERG fusion transcripts and prostate cancer antigen 3 in

urinary sediments may improve diagnosis of prostate cancer. Clin Cancer Res 13, 5103-

5108

164. Shay, J. W., and Wright, W. E. (1996) Telomerase activity in human cancer. Curr Opin

Oncol 8, 66-71

165. Meid, F. H., Gygi, C. M., Leisinger, H. J., Bosman, F. T., and Benhattar, J. (2001) The use

of telomerase activity for the detection of prostatic cancer cells after prostatic massage. J

Urol 165, 1802-1805

166. Kim, N. W., and Wu, F. (1997) Advances in quantification and characterization of

telomerase activity by the telomeric repeat amplification protocol (TRAP). Nucleic Acids

Res 25, 2595-2597

167. Schostak, M., Schwall, G. P., Poznanovic, S., Groebe, K., Muller, M., Messinger, D.,

Miller, K., Krause, H., Pelzer, A., Horninger, W., Klocker, H., Hennenlotter, J., Feyerabend,

S., Stenzl, A., and Schrattenholz, A. (2009) Annexin A3 in urine: a highly specific

noninvasive marker for prostate cancer early detection. J Urol 181, 343-353

168. Egeblad, M., and Werb, Z. (2002) New functions for the matrix metalloproteinases in cancer

progression. Nat Rev Cancer 2, 161-174

158

169. Roy, R., Louis, G., Loughlin, K. R., Wiederschain, D., Kilroy, S. M., Lamb, C. C.,

Zurakowski, D., and Moses, M. A. (2008) Tumor-specific urinary matrix metalloproteinase

fingerprinting: identification of high molecular weight urinary matrix metalloproteinase

species. Clin Cancer Res 14, 6610-6617

170. Di Carlo, A., Mariano, A., Terracciano, D., Ferro, M., Montanaro, V., Marsicano, M., Di

Lorenzo, G., Altieri, V., and Macchia, V. (2010) Matrix metalloproteinase-2 and -9 in the

urine of prostate cancer patients. Oncol Rep 24, 3-8

171. Zhou, H., Yuen, P. S., Pisitkun, T., Gonzales, P. A., Yasuda, H., Dear, J. W., Gross, P.,

Knepper, M. A., and Star, R. A. (2006) Collection, storage, preservation, and normalization

of human urinary exosomes for biomarker discovery. Kidney Int 69, 1471-1476

172. Duijvesz, D., Luider, T., Bangma, C. H., and Jenster, G. (2011) Exosomes as biomarker

treasure chests for prostate cancer. Eur Urol 59, 823-831

173. Nilsson, J., Skog, J., Nordstrand, A., Baranov, V., Mincheva-Nilsson, L., Breakefield, X. O.,

and Widmark, A. (2009) Prostate cancer-derived urine exosomes: a novel approach to

biomarkers for prostate cancer. Br J Cancer 100, 1603-1607

174. Vlassov, A. V., Magdaleno, S., Setterquist, R., and Conrad, R. (2012) Exosomes: current

knowledge of their composition, biological functions, and diagnostic and therapeutic

potentials. Biochim Biophys Acta 1820, 940-948

175. Wang, Z., Hill, S., Luther, J. M., Hachey, D. L., and Schey, K. L. (2012) Proteomic analysis

of urine exosomes by multidimensional protein identification technology (MudPIT).

Proteomics 12, 329-338

176. Mitchell, P. J., Welton, J., Staffurth, J., Court, J., Mason, M. D., Tabi, Z., and Clayton, A.

(2009) Can urinary exosomes act as treatment response markers in prostate cancer? J Transl

Med 7, 4

177. Skog, J., Wurdinger, T., van Rijn, S., Meijer, D. H., Gainche, L., Sena-Esteves, M., Curry,

W. T., Jr., Carter, B. S., Krichevsky, A. M., and Breakefield, X. O. (2008) Glioblastoma

microvesicles transport RNA and proteins that promote tumour growth and provide

diagnostic biomarkers. Nat Cell Biol 10, 1470-1476

178. Taylor, D. D., and Gercel-Taylor, C. (2008) MicroRNA signatures of tumor-derived

exosomes as diagnostic biomarkers of ovarian cancer. Gynecol Oncol 110, 13-21

179. Valadi, H., Ekstrom, K., Bossios, A., Sjostrand, M., Lee, J. J., and Lotvall, J. O. (2007)

Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic

exchange between cells. Nat Cell Biol 9, 654-659

180. Wasinger, V. C., Cordwell, S. J., Cerpa-Poljak, A., Yan, J. X., Gooley, A. A., Wilkins, M.

R., Duncan, M. W., Harris, R., Williams, K. L., and Humphery-Smith, I. (1995) Progress

with gene-product mapping of the Mollicutes: Mycoplasma genitalium. Electrophoresis 16,

1090-1094

159

181. Camerini, S., and Mauri, P. (2015) The role of protein and peptide separation before mass

spectrometry analysis in clinical proteomics. Journal of chromatography. A 1381, 1-12

182. Wu, S., Lourette, N. M., Tolic, N., Zhao, R., Robinson, E. W., Tolmachev, A. V., Smith, R.

D., and Pasa-Tolic, L. (2009) An integrated top-down and bottom-up strategy for broadly

characterizing protein isoforms and modifications. J Proteome Res 8, 1347-1357

183. Zhang, Y., Fonslow, B. R., Shan, B., Baek, M. C., and Yates, J. R., 3rd (2013) Protein

analysis by shotgun/bottom-up proteomics. Chem Rev 113, 2343-2394

184. Wolters, D. A., Washburn, M. P., and Yates, J. R., 3rd (2001) An automated

multidimensional protein identification technology for shotgun proteomics. Analytical

chemistry 73, 5683-5690

185. Yates, J. R., 3rd (2004) Mass spectral analysis in proteomics. Annual review of biophysics

and biomolecular structure 33, 297-316

186. Link, A. J., Eng, J., Schieltz, D. M., Carmack, E., Mize, G. J., Morris, D. R., Garvik, B. M.,

and Yates, J. R., 3rd (1999) Direct analysis of protein complexes using mass spectrometry.

Nat Biotechnol 17, 676-682

187. Aebersold, R., and Mann, M. (2003) Mass spectrometry-based proteomics. Nature 422, 198-

207

188. Fenn, J. B., Mann, M., Meng, C. K., Wong, S. F., and Whitehouse, C. M. (1989)

Electrospray ionization for mass spectrometry of large biomolecules. Science 246, 64-71

189. Tanaka, K., Waki, H., Ido, Y., Akita, S., Yoshida, Y., Yoshida, T., and Matsuo, T. (1988)

Protein and polymer analyses up to m/z 100 000 by laser ionization time-of-flight mass

spectrometry. Rapid Commun. Mass Spectrom. 2, 151-153

190. Wiley, W. C., and Mclaren, I. H. (1955) Time-of-Flight Mass Spectrometer with Improved

Resolution. Rev Sci Instrum 26, 1150-1157

191. Katzenstein, H. S., and Friedland, S. S. (1955) New Time-of-Flight Mass Spectrometer. Rev

Sci Instrum 26, 324-327

192. Wolff, M. M., and Stephens, W. E. (1953) A Pulsed Mass Spectrometer with Time

Dispersion. Rev Sci Instrum 24, 616-617

193. Cameron, A. E., and Eggers, D. F. (1948) An Ion Velocitron. Rev Sci Instrum 19, 605-607

194. Stephens, W. E. (1946) A Pulsed Mass Spectrometer with Time Dispersion. Phys Rev 69,

691-691

195. Paul, W., and Steinwedel, H. (1953) *Ein Neues Massenspektrometer Ohne Magnetfeld. Z

Naturforsch A 8, 448-450

160

196. Steen, H., and Mann, M. (2004) The ABC's (and XYZ's) of peptide sequencing. Nature

reviews. Molecular cell biology 5, 699-711

197. Hu, Q. Z., Noll, R. J., Li, H. Y., Makarov, A., Hardman, M., and Cooks, R. G. (2005) The

Orbitrap: a new mass spectrometer. J Mass Spectrom 40, 430-443

198. Hardman, M., and Makarov, A. A. (2003) Interfacing the orbitrap mass analyzer to an

electrospray ion source. Analytical chemistry 75, 1699-1705

199. Makarov, A. (2000) Electrostatic axially harmonic orbital trapping: a high-performance

technique of mass analysis. Analytical chemistry 72, 1156-1162

200. Senko, M. W., Canterbury, J. D., Guan, S., and Marshall, A. G. (1996) A high-performance

modular data system for Fourier transform ion cyclotron resonance mass spectrometry.

Rapid Commun Mass Spectrom 10, 1839-1844

201. Marshall, A. G., Hendrickson, C. L., and Jackson, G. S. (1998) Fourier transform ion

cyclotron resonance mass spectrometry: A primer. Mass Spectrom Rev 17, 1-35

202. Liu, H., Sadygov, R. G., and Yates, J. R., 3rd (2004) A model for random sampling and

estimation of relative protein abundance in shotgun proteomics. Analytical chemistry 76,

4193-4201

203. Washburn, M. P., Wolters, D., and Yates, J. R., 3rd (2001) Large-scale analysis of the yeast

proteome by multidimensional protein identification technology. Nat Biotechnol 19, 242-

247

204. Taylor, P., Nielsen, P. A., Trelle, M. B., Horning, O. B., Andersen, M. B., Vorm, O., Moran,

M. F., and Kislinger, T. (2009) Automated 2D peptide separation on a 1D nano-LC-MS

system. J Proteome Res 8, 1610-1616

205. Taylor, P., Nielsen, P. A., Trelle, M. B., Horning, O. B., Andersen, M. B., Vorm, O., Moran,

M. F., and Kislinger, T. (2009) Automated 2D Peptide Separation on a 1D Nano-LC-MS

System. Journal of Proteome Research 8, 1610-1616

206. Maiolica, A., Junger, M. A., Ezkurdia, I., and Aebersold, R. (2012) Targeted proteome

investigation via selected reaction monitoring mass spectrometry. J Proteomics 75, 3495-

3513

207. Sadygov, R. G., Cociorva, D., and Yates, J. R., 3rd (2004) Large-scale database searching

using tandem mass spectra: looking up the answer in the back of the book. Nature methods

1, 195-202

208. Schiess, R., Wollscheid, B., and Aebersold, R. (2009) Targeted proteomic strategy for

clinical biomarker discovery. Molecular oncology 3, 33-44

209. Anderson, N. L., and Anderson, N. G. (2002) The human plasma proteome: history,

character, and diagnostic prospects. Mol Cell Proteomics 1, 845-867

161

210. Brand, J., Haslberger, T., Zolg, W., Pestlin, G., and Palme, S. (2006) Depletion efficiency

and recovery of trace markers from a multiparameter immunodepletion column. Proteomics

6, 3236-3242

211. Fonslow, B. R., Stein, B. D., Webb, K. J., Xu, T., Choi, J., Park, S. K., and Yates, J. R., 3rd

(2013) Digestion and depletion of abundant proteins improves proteomic coverage. Nature

methods 10, 54-56

212. Huttenhain, R., Surinova, S., Ossola, R., Sun, Z., Campbell, D., Cerciello, F., Schiess, R.,

Bausch-Fluck, D., Rosenberger, G., Chen, J., Rinner, O., Kusebauch, U., Hajduch, M.,

Moritz, R. L., Wollscheid, B., and Aebersold, R. (2013) N-glycoprotein SRMAtlas: a

resource of mass spectrometric assays for N-glycosites enabling consistent and multiplexed

protein quantification for clinical applications. Mol Cell Proteomics 12, 1005-1016

213. Gilchrist, A., Au, C. E., Hiding, J., Bell, A. W., Fernandez-Rodriguez, J., Lesimple, S.,

Nagaya, H., Roy, L., Gosline, S. J., Hallett, M., Paiement, J., Kearney, R. E., Nilsson, T.,

and Bergeron, J. J. (2006) Quantitative proteomics analysis of the secretory pathway. Cell

127, 1265-1281

214. Cutillas, P. R., and Vanhaesebroeck, B. (2007) Quantitative profile of five murine core

proteomes using label-free functional proteomics. Mol Cell Proteomics 6, 1560-1573

215. Old, W. M., Meyer-Arendt, K., Aveline-Wolf, L., Pierce, K. G., Mendoza, A., Sevinsky, J.

R., Resing, K. A., and Ahn, N. G. (2005) Comparison of label-free methods for quantifying

human proteins by shotgun proteomics. Mol Cell Proteomics 4, 1487-1502

216. Chelius, D., and Bondarenko, P. V. (2002) Quantitative profiling of proteins in complex

mixtures using liquid chromatography and mass spectrometry. J Proteome Res 1, 317-323

217. Wang, W., Zhou, H., Lin, H., Roy, S., Shaler, T. A., Hill, L. R., Norton, S., Kumar, P.,

Anderle, M., and Becker, C. H. (2003) Quantification of proteins and metabolites by mass

spectrometry without isotopic labeling or spiked standards. Analytical chemistry 75, 4818-

4826

218. Wang, M., You, J., Bemis, K. G., Tegeler, T. J., and Brown, D. P. (2008) Label-free mass

spectrometry-based protein quantification technologies in proteomic analysis. Brief Funct

Genomic Proteomic 7, 329-339

219. Bondarenko, P. V., Chelius, D., and Shaler, T. A. (2002) Identification and relative

quantitation of protein mixtures by enzymatic digestion followed by capillary reversed-

phase liquid chromatography-tandem mass spectrometry. Analytical chemistry 74, 4741-

4749

220. Wang, G., Wu, W. W., Zeng, W., Chou, C. L., and Shen, R. F. (2006) Label-free protein

quantification using LC-coupled ion trap or FT mass spectrometry: Reproducibility,

linearity, and application with complex proteomes. J Proteome Res 5, 1214-1223

162

221. Higgs, R. E., Knierman, M. D., Gelfanova, V., Butler, J. P., and Hale, J. E. (2005)

Comprehensive label-free method for the relative quantification of proteins from biological

samples. J Proteome Res 4, 1442-1450

222. Ono, M., Shitashige, M., Honda, K., Isobe, T., Kuwabara, H., Matsuzuki, H., Hirohashi, S.,

and Yamada, T. (2006) Label-free quantitative proteomics using large peptide data sets

generated by nanoflow liquid chromatography and mass spectrometry. Mol Cell Proteomics

5, 1338-1347

223. Wiener, M. C., Sachs, J. R., Deyanova, E. G., and Yates, N. A. (2004) Differential mass

spectrometry: a label-free LC-MS method for finding significant differences in complex

peptide and protein mixtures. Analytical chemistry 76, 6085-6096

224. Ong, S. E., Blagoev, B., Kratchmarova, I., Kristensen, D. B., Steen, H., Pandey, A., and

Mann, M. (2002) Stable isotope labeling by amino acids in cell culture, SILAC, as a simple

and accurate approach to expression proteomics. Mol Cell Proteomics 1, 376-386

225. Gygi, S. P., Rist, B., Gerber, S. A., Turecek, F., Gelb, M. H., and Aebersold, R. (1999)

Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat

Biotechnol 17, 994-999

226. Ross, P. L., Huang, Y. N., Marchese, J. N., Williamson, B., Parker, K., Hattan, S.,

Khainovski, N., Pillai, S., Dey, S., Daniels, S., Purkayastha, S., Juhasz, P., Martin, S.,

Bartlet-Jones, M., He, F., Jacobson, A., and Pappin, D. J. (2004) Multiplexed protein

quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents.

Mol Cell Proteomics 3, 1154-1169

227. Thompson, A., Schafer, J., Kuhn, K., Kienle, S., Schwarz, J., Schmidt, G., Neumann, T.,

Johnstone, R., Mohammed, A. K., and Hamon, C. (2003) Tandem mass tags: a novel

quantification strategy for comparative analysis of complex protein mixtures by MS/MS.

Analytical chemistry 75, 1895-1904

228. Bowers, G. N., Jr., Fassett, J. D., and White, E. t. (1993) Isotope dilution mass spectrometry

and the National Reference System. Analytical chemistry 65, 475R-479R

229. Viswanathan, C. T., Bansal, S., Booth, B., DeStefano, A. J., Rose, M. J., Sailstad, J., Shah,

V. P., Skelly, J. P., Swann, P. G., and Weiner, R. (2007) Quantitative bioanalytical methods

validation and implementation: best practices for chromatographic and ligand binding

assays. Pharmaceutical research 24, 1962-1973

230. Gerber, S. A., Rush, J., Stemman, O., Kirschner, M. W., and Gygi, S. P. (2003) Absolute

quantification of proteins and phosphoproteins from cell lysates by tandem MS. Proc Natl

Acad Sci U S A 100, 6940-6945

231. Beynon, R. J., Doherty, M. K., Pratt, J. M., and Gaskell, S. J. (2005) Multiplexed absolute

quantification in proteomics using artificial QCAT proteins of concatenated signature

peptides. Nature methods 2, 587-589

163

232. Brun, V., Dupuis, A., Adrait, A., Marcellin, M., Thomas, D., Court, M., Vandenesch, F.,

and Garin, J. (2007) Isotope-labeled protein standards: toward absolute quantitative

proteomics. Mol Cell Proteomics 6, 2139-2149

233. Coppinger, J. A., Hutt, D. M., Razvi, A., Koulov, A. V., Pankow, S., Yates, J. R., 3rd, and

Balch, W. E. (2012) A chaperone trap contributes to the onset of cystic fibrosis. PLoS One

7, e37682

234. Hanke, S., Besir, H., Oesterhelt, D., and Mann, M. (2008) Absolute SILAC for accurate

quantitation of proteins in complex mixtures down to the attomole level. J Proteome Res 7,

1118-1130

235. Arsene, C. G., Ohlendorf, R., Burkitt, W., Pritchard, C., Henrion, A., O'Connor, G., Bunk,

D. M., and Guttler, B. (2008) Protein quantification by isotope dilution mass spectrometry

of proteolytic fragments: cleavage rate and accuracy. Analytical chemistry 80, 4154-4160

236. Kondrat, K. L., McClusky, G. A., and Cooks, R. G. (1978) Multiple reaction monitoring

mass spectrometry/mass spectrometry for direct analysis of complex mixtures. Analytical

chemistry, 2017-2021

237. Anderson, L., and Hunter, C. L. (2006) Quantitative mass spectrometric multiple reaction

monitoring assays for major plasma proteins. Mol Cell Proteomics 5, 573-588

238. Keshishian, H., Addona, T., Burgess, M., Kuhn, E., and Carr, S. A. (2007) Quantitative,

multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and

stable isotope dilution. Mol Cell Proteomics 6, 2212-2229

239. Addona, T. A., Abbatiello, S. E., Schilling, B., Skates, S. J., Mani, D. R., Bunk, D. M.,

Spiegelman, C. H., Zimmerman, L. J., Ham, A. J., Keshishian, H., Hall, S. C., Allen, S.,

Blackman, R. K., Borchers, C. H., Buck, C., Cardasis, H. L., Cusack, M. P., Dodder, N. G.,

Gibson, B. W., Held, J. M., Hiltke, T., Jackson, A., Johansen, E. B., Kinsinger, C. R., Li, J.,

Mesri, M., Neubert, T. A., Niles, R. K., Pulsipher, T. C., Ransohoff, D., Rodriguez, H.,

Rudnick, P. A., Smith, D., Tabb, D. L., Tegeler, T. J., Variyath, A. M., Vega-Montoto, L. J.,

Wahlander, A., Waldemarson, S., Wang, M., Whiteaker, J. R., Zhao, L., Anderson, N. L.,

Fisher, S. J., Liebler, D. C., Paulovich, A. G., Regnier, F. E., Tempst, P., and Carr, S. A.

(2009) Multi-site assessment of the precision and reproducibility of multiple reaction

monitoring-based measurements of proteins in plasma. Nat Biotechnol 27, 633-641

240. Cima, I., Schiess, R., Wild, P., Kaelin, M., Schuffler, P., Lange, V., Picotti, P., Ossola, R.,

Templeton, A., Schubert, O., Fuchs, T., Leippold, T., Wyler, S., Zehetner, J., Jochum, W.,

Buhmann, J., Cerny, T., Moch, H., Gillessen, S., Aebersold, R., and Krek, W. (2011) Cancer

genetics-guided discovery of serum biomarker signatures for diagnosis and prognosis of

prostate cancer. Proc Natl Acad Sci U S A 108, 3342-3347

241. Elschenbroich, S., Ignatchenko, V., Clarke, B., Kalloger, S. E., Boutros, P. C., Gramolini,

A. O., Shaw, P., Jurisica, I., and Kislinger, T. (2011) In-depth proteomics of ovarian cancer

ascites: combining shotgun proteomics and selected reaction monitoring mass spectrometry.

J Proteome Res 10, 2286-2299

164

242. Selevsek, N., Matondo, M., Sanchez Carbayo, M., Aebersold, R., and Domon, B. (2011)

Systematic quantification of peptides/proteins in urine using selected reaction monitoring.


243. Wei, Y., Tong, J., Taylor, P., Strumpf, D., Ignatchenko, V., Pham, N. A., Yanagawa, N.,

Liu, G., Jurisica, I., Shepherd, F. A., Tsao, M. S., Kislinger, T., and Moran, M. F. (2011)

Primary tumor xenografts of human lung adeno and squamous cell carcinoma express

distinct proteomic signatures. J Proteome Res 10, 161-174

244. Kim, Y., Ignatchenko, V., Yao, C. Q., Kalatskaya, I., Nyalwidhe, J. O., Lance, R. S.,

Gramolini, A. O., Troyer, D. A., Stein, L. D., Boutros, P. C., Medin, J. A., Semmes, O. J.,

Drake, R. R., and Kislinger, T. (2012) Identification of differentially expressed proteins in

direct expressed prostatic secretions of men with organ-confined versus extracapsular

prostate cancer. Mol Cell Proteomics 11, 1870-1884

245. Mallick, P., Schirle, M., Chen, S. S., Flory, M. R., Lee, H., Martin, D., Ranish, J., Raught,

B., Schmitt, R., Werner, T., Kuster, B., and Aebersold, R. (2007) Computational prediction

of proteotypic peptides for quantitative proteomics. Nat Biotechnol 25, 125-131

246. Rost, H., Malmstrom, L., and Aebersold, R. (2012) A computational tool to detect and avoid

redundancy in selected reaction monitoring. Mol Cell Proteomics 11, 540-549

247. Stahl-Zeng, J., Lange, V., Ossola, R., Eckhardt, K., Krek, W., Aebersold, R., and Domon, B.

(2007) High sensitivity detection of plasma proteins by multiple reaction monitoring of N-

glycosites. Mol Cell Proteomics 6, 1809-1817

248. Wang, P., Whiteaker, J. R., and Paulovich, A. G. (2009) The evolving role of mass

spectrometry in cancer biomarker discovery. Cancer biology & therapy 8, 1083-1094

249. Zhi, W., Sharma, A., Purohit, S., Miller, E., Bode, B., Anderson, S. W., Reed, J. C., Steed,

R. D., Steed, L., Hopkins, D., and She, J. X. (2011) Discovery and validation of serum

protein changes in type 1 diabetes patients using high throughput two dimensional liquid

chromatography-mass spectrometry and immunoassays. Mol Cell Proteomics 10, M111

012203

250. Fortin, T., Salvador, A., Charrier, J. P., Lenz, C., Lacoux, X., Morla, A., Choquet-

Kastylevsky, G., and Lemoine, J. (2009) Clinical quantitation of prostate-specific antigen

biomarker in the low nanogram/milliliter range by conventional bore liquid

chromatography-tandem mass spectrometry (multiple reaction monitoring) coupling and

correlation with ELISA tests. Mol Cell Proteomics 8, 1006-1015

251. Vegvari, A., Rezeli, M., Welinder, C., Malm, J., Lilja, H., Marko-Varga, G., and Laurell, T.

(2010) Identification of prostate-specific antigen (PSA) isoforms in complex biological

samples utilizing complementary platforms. J Proteomics 73, 1137-1147

252. Picotti, P., Bodenmiller, B., Mueller, L. N., Domon, B., and Aebersold, R. (2009) Full

dynamic range proteome analysis of S. cerevisiae by targeted proteomics. Cell 138, 795-806

165

253. Huttenhain, R., Soste, M., Selevsek, N., Rost, H., Sethi, A., Carapito, C., Farrah, T.,

Deutsch, E. W., Kusebauch, U., Moritz, R. L., Nimeus-Malmstrom, E., Rinner, O., and

Aebersold, R. (2012) Reproducible quantification of cancer-associated proteins in body

fluids using targeted proteomics. Sci Transl Med 4, 142ra194

254. Eggener, S. E., Scardino, P. T., Carroll, P. R., Zelefsky, M. J., Sartor, O., Hricak, H.,

Wheeler, T. M., Fine, S. W., Trachtenberg, J., Rubin, M. A., Ohori, M., Kuroiwa, K.,

Rossignol, M., Abenhaim, L., International Task Force on Prostate, C., and the Focal

Lesion, P. (2007) Focal therapy for localized prostate cancer: a critical appraisal of rationale

and modalities. J Urol 178, 2260-2267

255. Drake, R. R., White, K. Y., Fuller, T. W., Igwe, E., Clements, M. A., Nyalwidhe, J. O.,

Given, R. W., Lance, R. S., and Semmes, O. J. (2009) Clinical collection and protein

properties of expressed prostatic secretions as a source for biomarkers of prostatic disease. J


256. Collins, A. T., and Maitland, N. J. (2006) Prostate cancer stem cells. European journal of

cancer 42, 1213-1218

257. Deras, I. L., Aubin, S. M., Blase, A., Day, J. R., Koo, S., Partin, A. W., Ellis, W. J., Marks,

L. S., Fradet, Y., Rittenhouse, H., and Groskopf, J. (2008) PCA3: a molecular urine assay

for predicting prostate biopsy outcome. J Urol 179, 1587-1592

258. Hessels, D., and Schalken, J. A. (2009) The use of PCA3 in the diagnosis of prostate cancer.

Nat Rev Urol 6, 255-261

259. Wang, R., Chinnaiyan, A. M., Dunn, R. L., Wojno, K. J., and Wei, J. T. (2009) Rational

approach to implementation of prostate cancer antigen 3 into clinical care. Cancer 115,

3879-3886

260. Drake, R. R., Elschenbroich, S., Lopez-Perez, O., Kim, Y., Ignatchenko, V., Ignatchenko,

A., Nyalwidhe, J. O., Basu, G., Wilkins, C. E., Gjurich, B., Lance, R. S., Semmes, O. J.,

Medin, J. A., and Kislinger, T. (2010) In-depth proteomic analyses of direct expressed

prostatic secretions. J Proteome Res 9, 2109-2116

261. Wu, C., Orozco, C., Boyer, J., Leglise, M., Goodale, J., Batalov, S., Hodge, C. L., Haase, J.,

Janes, J., Huss, J. W., 3rd, and Su, A. I. (2009) BioGPS: an extensible and customizable

portal for querying and organizing gene annotation resources. Genome Biol 10, R130

262. Ponten, F., Gry, M., Fagerberg, L., Lundberg, E., Asplund, A., Berglund, L., Oksvold, P.,

Bjorling, E., Hober, S., Kampf, C., Navani, S., Nilsson, P., Ottosson, J., Persson, A.,

Wernerus, H., Wester, K., and Uhlen, M. (2009) A global view of protein expression in

human cells, tissues, and organs. Mol Syst Biol 5, 337

263. Uhlen, M., Bjorling, E., Agaton, C., Szigyarto, C. A., Amini, B., Andersen, E., Andersson,

A. C., Angelidou, P., Asplund, A., Asplund, C., Berglund, L., Bergstrom, K., Brumer, H.,

Cerjan, D., Ekstrom, M., Elobeid, A., Eriksson, C., Fagerberg, L., Falk, R., Fall, J.,

Forsberg, M., Bjorklund, M. G., Gumbel, K., Halimi, A., Hallin, I., Hamsten, C., Hansson,

M., Hedhammar, M., Hercules, G., Kampf, C., Larsson, K., Lindskog, M., Lodewyckx, W.,

166

Lund, J., Lundeberg, J., Magnusson, K., Malm, E., Nilsson, P., Odling, J., Oksvold, P.,

Olsson, I., Oster, E., Ottosson, J., Paavilainen, L., Persson, A., Rimini, R., Rockberg, J.,

Runeson, M., Sivertsson, A., Skollermo, A., Steen, J., Stenvall, M., Sterky, F., Stromberg,

S., Sundberg, M., Tegel, H., Tourle, S., Wahlund, E., Walden, A., Wan, J., Wernerus, H.,

Westberg, J., Wester, K., Wrethagen, U., Xu, L. L., Hober, S., and Ponten, F. (2005) A

human protein atlas for normal and cancer tissues based on antibody proteomics. Mol Cell


264. Li, R., Guo, Y., Han, B. M., Yan, X., Utleg, A. G., Li, W., Tu, L. C., Wang, J., Hood, L.,

Xia, S., and Lin, B. (2008) Proteomics cataloging analysis of human expressed prostatic

secretions reveals rich source of biomarker candidates. Proteomics Clin Appl 2, 543-555

265. Su, A. I., Wiltshire, T., Batalov, S., Lapp, H., Ching, K. A., Block, D., Zhang, J., Soden, R.,

Hayakawa, M., Kreiman, G., Cooke, M. P., Walker, J. R., and Hogenesch, J. B. (2004) A

gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U

S A 101, 6062-6067

266. Ponten, F., Jirstrom, K., and Uhlen, M. (2008) The Human Protein Atlas--a tool for

pathology. J Pathol 216, 387-393

267. Pilch, B., and Mann, M. (2006) Large-scale and high-confidence proteomic analysis of

human seminal plasma. Genome Biol 7, R40

268. Sardana, G., Jung, K., Stephan, C., and Diamandis, E. P. (2008) Proteomic analysis of

conditioned media from the PC3, LNCaP, and 22Rv1 prostate cancer cell lines: discovery

and validation of candidate prostate cancer biomarkers. J Proteome Res 7, 3329-3338

269. Cox, B., Kotlyar, M., Evangelou, A. I., Ignatchenko, V., Ignatchenko, A., Whiteley, K.,

Jurisica, I., Adamson, S. L., Rossant, J., and Kislinger, T. (2009) Comparative systems

biology of human and mouse as a tool to guide the modeling of human placental pathology.

Mol Syst Biol 5, 279

270. Kislinger, T., Cox, B., Kannan, A., Chung, C., Hu, P., Ignatchenko, A., Scott, M. S.,

Gramolini, A. O., Morris, Q., Hallett, M. T., Rossant, J., Hughes, T. R., Frey, B., and Emili,

A. (2006) Global survey of organ and organelle protein expression in mouse: combined

proteomic and transcriptomic profiling. Cell 125, 173-186

271. Thongboonkerd, V. (2007) Practical points in urinary proteomics. J Proteome Res 6, 3881-

3890

272. Court, M., Selevsek, N., Matondo, M., Allory, Y., Garin, J., Masselon, C. D., and Domon,

B. (2011) Toward a standardized urine proteome analysis methodology. Proteomics Clin

Appl 5, 547

273. Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., Davis, A. P.,

Dolinski, K., Dwight, S. S., Eppig, J. T., Harris, M. A., Hill, D. P., Issel-Tarver, L.,

Kasarskis, A., Lewis, S., Matese, J. C., Richardson, J. E., Ringwald, M., Rubin, G. M., and

Sherlock, G. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology

Consortium. Nature genetics 25, 25-29

167

274. Kanehisa, M., and Goto, S. (2000) KEGG: kyoto encyclopedia of genes and genomes.

Nucleic Acids Res 28, 27-30

275. Choi, H., Fermin, D., and Nesvizhskii, A. I. (2008) Significance analysis of spectral count

data in label-free shotgun proteomics. Mol Cell Proteomics 7, 2373-2385

276. Cox, B., Kislinger, T., Wigle, D. A., Kannan, A., Brown, K., Okubo, T., Hogan, B., Jurisica,

I., Frey, B., Rossant, J., and Emili, A. (2007) Integrated proteomic and transcriptomic

profiling of mouse lung development and Nmyc target genes. Mol Syst Biol 3, 109

277. Gygi, S. P., Rochon, Y., Franza, B. R., and Aebersold, R. (1999) Correlation between

protein and mRNA abundance in yeast. Mol Cell Biol 19, 1720-1730

278. UniProt, C. (2010) The Universal Protein Resource (UniProt) in 2010. Nucleic Acids Res

38, D142-148

279. Whitaker, H. C., Warren, A. Y., Eeles, R., Kote-Jarai, Z., and Neal, D. E. (2010) The

potential value of microseminoprotein-beta as a prostate cancer biomarker and therapeutic

target. The Prostate 70, 333-340

280. Cooner, W. H., Mosley, B. R., Rutherford, C. L., Jr., Beard, J. H., Pond, H. S., Terry, W. J.,

Igel, T. C., and Kidd, D. D. (1990) Prostate cancer detection in a clinical urological practice

by ultrasonography, digital rectal examination and prostate specific antigen. J Urol 143,

1146-1152; discussion 1152-1144

281. Thompson, I. M., Ankerst, D. P., Chi, C., Lucia, M. S., Goodman, P. J., Crowley, J. J.,

Parnes, H. L., and Coltman, C. A., Jr. (2005) Operating characteristics of prostate-specific

antigen in men with an initial PSA level of 3.0 ng/ml or lower. JAMA 294, 66-70

282. Han, M., Piantadosi, S., Zahurak, M. L., Sokoll, L. J., Chan, D. W., Epstein, J. I., Walsh, P.

C., and Partin, A. W. (2001) Serum acid phosphatase level and biochemical recurrence

following radical prostatectomy for men with clinically localized prostate cancer. Urology

57, 707-711

283. Veeramani, S., Yuan, T. C., Chen, S. J., Lin, F. F., Petersen, J. E., Shaheduzzaman, S.,

Srivastava, S., MacDonald, R. G., and Lin, M. F. (2005) Cellular prostatic acid phosphatase:

a protein tyrosine phosphatase involved in androgen-independent proliferation of prostate

cancer. Endocr Relat Cancer 12, 805-822

284. Chen, Y. W., Lee, M. S., Lucht, A., Chou, F. P., Huang, W., Havighurst, T. C., Kim, K.,

Wang, J. K., Antalis, T. M., Johnson, M. D., and Lin, C. Y. (2010) TMPRSS2, a serine

protease expressed in the prostate on the apical surface of luminal epithelial cells and

released into semen in prostasomes, is misregulated in prostate cancer cells. The American

journal of pathology 176, 2986-2996

285. Murphy, G. P., Kenny, G. M., Ragde, H., Wolfert, R. L., Boynton, A. L., Holmes, E. H.,

Misrock, S. L., Bartsch, G., Klocker, H., Pointner, J., Reissigl, A., McLeod, D. G., Douglas,

T., Morgan, T., and Gilbaugh, J., Jr. (1998) Measurement of serum prostate-specific

membrane antigen, a new prognostic marker for prostate cancer. Urology 51, 89-97

168

286. Hood, B. L., Malehorn, D. E., Conrads, T. P., and Bigbee, W. L. (2009) Serum proteomics

using mass spectrometry. Methods in molecular biology 520, 107-128

287. Byrne, J. C., Downes, M. R., O'Donoghue, N., O'Keane, C., O'Neill, A., Fan, Y.,

Fitzpatrick, J. M., Dunn, M., and Watson, R. W. (2009) 2D-DIGE as a strategy to identify

serum markers for the progression of prostate cancer. J Proteome Res 8, 942-957

288. Garbis, S. D., Tyritzis, S. I., Roumeliotis, T., Zerefos, P., Giannopoulou, E. G., Vlahou, A.,

Kossida, S., Diaz, J., Vourekas, S., Tamvakopoulos, C., Pavlakis, K., Sanoudou, D., and

Constantinides, C. A. (2008) Search for potential markers for prostate cancer diagnosis,

prognosis and treatment in clinical tissue specimens using amine-specific isobaric tagging

(iTRAQ) with two-dimensional liquid chromatography and tandem mass spectrometry. J

Proteome Res 7, 3146-3158

289. Laxman, B., Morris, D. S., Yu, J., Siddiqui, J., Cao, J., Mehra, R., Lonigro, R. J., Tsodikov,

A., Wei, J. T., Tomlins, S. A., and Chinnaiyan, A. M. (2008) A first-generation multiplex

biomarker analysis of urine for the early detection of prostate cancer. Cancer research 68,

645-649

290. van Gils, M. P., Hessels, D., van Hooij, O., Jannink, S. A., Peelen, W. P., Hanssen, S. L.,

Witjes, J. A., Cornel, E. B., Karthaus, H. F., Smits, G. A., Dijkman, G. A., Mulders, P. F.,

and Schalken, J. A. (2007) The time-resolved fluorescence-based PCA3 test on urinary

sediments after digital rectal examination; a Dutch multicenter validation of the diagnostic

performance. Clin Cancer Res 13, 939-943

291. Barry, M. J. (2001) Clinical practice. Prostate-specific-antigen testing for early diagnosis of

prostate cancer. N Engl J Med 344, 1373-1377

292. Draisma, G., Boer, R., Otto, S. J., van der Cruijsen, I. W., Damhuis, R. A., Schroder, F. H.,

and de Koning, H. J. (2003) Lead times and overdetection due to prostate-specific antigen

screening: estimates from the European Randomized Study of Screening for Prostate

Cancer. Journal of the National Cancer Institute 95, 868-878

293. Esserman, L., Shieh, Y., and Thompson, I. (2009) Rethinking screening for breast cancer

and prostate cancer. Jama 302, 1685-1692

294. Roobol, M. J., Zappa, M., Maattanen, L., and Ciatto, S. (2007) The value of different

screening tests in predicting prostate biopsy outcome in screening for prostate cancer data

from a multicenter study (ERSPC). The Prostate 67, 439-446

295. Elschenbroich, S., Ignatchenko, V., Sharma, P., Schmitt-Ulms, G., Gramolini, A. O., and

Kislinger, T. (2009) Peptide separations by on-line MudPIT compared to isoelectric

focusing in an off-gel format: application to a membrane-enriched fraction from C2C12

mouse skeletal muscle cells. J Proteome Res 8, 4860-4869

296. Gortzak-Uzan, L., Ignatchenko, A., Evangelou, A. I., Agochiya, M., Brown, K. A., St Onge,

P., Kireeva, I., Schmitt-Ulms, G., Brown, T. J., Murphy, J., Rosen, B., Shaw, P., Jurisica, I.,

and Kislinger, T. (2008) A proteome resource of ovarian cancer ascites: integrated

169

proteomic and bioinformatic analyses to identify putative biomarkers. J Proteome Res 7,

339-351

297. McNeal, J. E. (1992) Cancer volume and site of origin of adenocarcinoma in the prostate:

relationship to local and distant spread. Human pathology 23, 258-266

298. Sodek, K. L., Evangelou, A. I., Ignatchenko, A., Agochiya, M., Brown, T. J., Ringuette, M.

J., Jurisica, I., and Kislinger, T. (2008) Identification of pathways associated with invasive

behavior by ovarian cancer cells using multidimensional protein identification technology

(MudPIT). Mol Biosyst 4, 762-773

299. Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009) Systematic and integrative

analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4, 44-57

300. Ploussard, G., and de la Taille, A. (2010) Urine biomarkers in prostate cancer. Nat Rev Urol

7, 101-109

301. Chung, B. I., Tarin, T. V., Ferrari, M., and Brooks, J. D. (2011) Comparison of prostate

cancer tumor volume and percent cancer in prediction of biochemical recurrence and cancer

specific survival. Urol Oncol 29, 314-318

302. D'Amico, A. V., Whittington, R., Malkowicz, S. B., Schultz, D., Kaplan, I., Beard, C. J.,

Tomaszewski, J. E., Renshaw, A. A., Loughlin, K. R., Richie, J. P., and Wein, A. (1998)

Calculated prostate cancer volume greater than 4.0 cm3 identifies patients with localized

prostate cancer who have a poor prognosis following radical prostatectomy or external-

beam radiation therapy. J Clin Oncol 16, 3094-3100

303. Yossepowitch, O., Eggener, S. E., Bianco, F. J., Jr., Carver, B. S., Serio, A., Scardino, P. T.,

and Eastham, J. A. (2007) Radical prostatectomy for clinically localized, high risk prostate

cancer: critical analysis of risk assessment methods. J Urol 178, 493-499; discussion 499

304. Han, M., Partin, A. W., Pound, C. R., Epstein, J. I., and Walsh, P. C. (2001) Long-term

biochemical disease-free and cancer-specific survival following anatomic radical retropubic

prostatectomy. The 15-year Johns Hopkins experience. Urol Clin North Am 28, 555-565

305. Roehl, K. A., Han, M., Ramos, C. G., Antenor, J. A., and Catalona, W. J. (2004) Cancer

progression and survival rates following anatomical radical retropubic prostatectomy in

3,478 consecutive patients: long-term results. J Urol 172, 910-914

306. Hull, G. W., Rabbani, F., Abbas, F., Wheeler, T. M., Kattan, M. W., and Scardino, P. T.

(2002) Cancer control with radical prostatectomy alone in 1,000 consecutive patients. J Urol

167, 528-534

307. Amling, C. L., Blute, M. L., Bergstralh, E. J., Seay, T. M., Slezak, J., and Zincke, H. (2000)

Long-term hazard of progression after radical prostatectomy for clinically localized prostate

cancer: continued risk of biochemical failure after 5 years. J Urol 164, 101-105

308. Stephenson, A. J., and Kattan, M. W. (2006) Nomograms for prostate cancer. BJU Int 98,

39-46

170

309. Chodak, G. W., and Warren, K. S. (2006) Watchful waiting for prostate cancer: a review

article. Prostate Cancer Prostatic Dis 9, 25-29

310. Klotz, L. (2008) Active surveillance for prostate cancer: trials and tribulations. World J Urol

26, 437-442

311. Paulovich, A. G., Whiteaker, J. R., Hoofnagle, A. N., and Wang, P. (2008) The interface

between biomarker discovery and clinical validation: The tar pit of the protein biomarker

pipeline. Proteomics Clin Appl 2, 1386-1402

312. Batruch, I., Lecker, I., Kagedan, D., Smith, C. R., Mullen, B. J., Grober, E., Lo, K. C.,

Diamandis, E. P., and Jarvi, K. A. (2011) Proteomic analysis of seminal plasma from

normal volunteers and post-vasectomy patients identifies over 2000 proteins and candidate

biomarkers of the urogenital system. J Proteome Res 10, 941-953

313. Dowling, P., Wormald, R., Meleady, P., Henry, M., Curran, A., and Clynes, M. (2008)

Analysis of the saliva proteome from patients with head and neck squamous cell carcinoma

reveals differences in abundance levels of proteins associated with tumour progression and

metastasis. J Proteomics 71, 168-175

314. Hu, S., Yu, T., Xie, Y., Yang, Y., Li, Y., Zhou, X., Tsung, S., Loo, R. R., Loo, J. R., and

Wong, D. T. (2007) Discovery of oral fluid biomarkers for human oral cancer by mass

spectrometry. Cancer Genomics Proteomics 4, 55-64

315. Zhao, T., Zeng, X., Bateman, N. W., Sun, M., Teng, P. N., Bigbee, W. L., Dhir, R., Nelson,

J. B., Conrads, T. P., and Hood, B. L. (2012) Relative quantitation of proteins in expressed

prostatic secretion with a stable isotope labeled secretome standard. J Proteome Res 11,

1089-1099

316. Woodson, K., O'Reilly, K. J., Hanson, J. C., Nelson, D., Walk, E. L., and Tangrea, J. A.

(2008) The usefulness of the detection of GSTP1 methylation in urine as a biomarker in the

diagnosis of prostate cancer. J Urol 179, 508-511; discussion 511-502

317. Hessels, D., Klein Gunnewiek, J. M., van Oort, I., Karthaus, H. F., van Leenders, G. J., van

Balken, B., Kiemeney, L. A., Witjes, J. A., and Schalken, J. A. (2003) DD3(PCA3)-based

molecular urine analysis for the diagnosis of prostate cancer. Eur Urol 44, 8-15; discussion

15-16

318. Irani, J., Salomon, L., Soulie, M., Zlotta, A., de la Taille, A., Dore, B., and Millet, C. (2005)

Urinary/serum prostate-specific antigen ratio: comparison with free/total serum prostate-

specific antigen ratio in improving prostate cancer detection. Urology 65, 533-537

319. Bolduc, S., Lacombe, L., Naud, A., Gregoire, M., Fradet, Y., and Tremblay, R. R. (2007)

Urinary PSA: a potential useful marker when serum PSA is between 2.5 ng/mL and 10

ng/mL. Can Urol Assoc J 1, 377-381

320. Sumitomo, M., Iwase, A., Zheng, R., Navarro, D., Kaminetzky, D., Shen, R., Georgescu, M.

M., and Nanus, D. M. (2004) Synergy in tumor suppression by direct interaction of neutral

endopeptidase with PTEN. Cancer Cell 5, 67-78

171

321. Osman, I., Yee, H., Taneja, S. S., Levinson, B., Zeleniuch-Jacquotte, A., Chang, C., Nobert,

C., and Nanus, D. M. (2004) Neutral endopeptidase protein expression and prognosis in

localized prostate cancer. Clin Cancer Res 10, 4096-4100

322. De Marzo, A. M., Platz, E. A., Sutcliffe, S., Xu, J., Gronberg, H., Drake, C. G., Nakai, Y.,

Isaacs, W. B., and Nelson, W. G. (2007) Inflammation in prostate carcinogenesis. Nat Rev

Cancer 7, 256-269

323. Sfanos, K. S., and De Marzo, A. M. (2012) Prostate cancer and inflammation: the evidence.

Histopathology 60, 199-215

324. Nonomura, N., Takayama, H., Nakayama, M., Nakai, Y., Kawashima, A., Mukai, M.,

Nagahara, A., Aozasa, K., and Tsujimura, A. (2011) Infiltration of tumour-associated

macrophages in prostate biopsy specimens is predictive of disease progression after

hormonal therapy for prostate cancer. BJU Int 107, 1918-1922

325. Gannon, P. O., Poisson, A. O., Delvoye, N., Lapointe, R., Mes-Masson, A. M., and Saad, F.

(2009) Characterization of the intra-prostatic immune cell infiltration in androgen-deprived

prostate cancer patients. J Immunol Methods 348, 9-17

326. Fredolini, C., Meani, F., Luchini, A., Zhou, W., Russo, P., Ross, M., Patanarut, A.,

Tamburro, D., Gambara, G., Ornstein, D., Odicino, F., Ragnoli, M., Ravaggi, A., Novelli,

F., Collura, D., D'Urso, L., Muto, G., Belluco, C., Pecorelli, S., Liotta, L., and Petricoin, E.

F., 3rd (2010) Investigation of the ovarian and prostate cancer peptidome for candidate early

detection markers using a novel nanoparticle biomarker capture technology. AAPS J 12,

504-518

327. Longo, C., Patanarut, A., George, T., Bishop, B., Zhou, W., Fredolini, C., Ross, M. M.,

Espina, V., Pellacani, G., Petricoin, E. F., 3rd, Liotta, L. A., and Luchini, A. (2009) Core-

shell hydrogel particles harvest, concentrate and preserve labile low abundance biomarkers.

PLoS One 4, e4763

328. Tamburro, D., Fredolini, C., Espina, V., Douglas, T. A., Ranganathan, A., Ilag, L., Zhou,

W., Russo, P., Espina, B. H., Muto, G., Petricoin, E. F., 3rd, Liotta, L. A., and Luchini, A.

(2011) Multifunctional core-shell nanoparticles: discovery of previously invisible

biomarkers. J Am Chem Soc 133, 19178-19188

329. Roth, J. (2002) Protein N-glycosylation along the secretory pathway: relationship to

organelle topography and function, protein quality control, and cell interactions. Chem Rev

102, 285-303

330. Tian, Y., Zhou, Y., Elliott, S., Aebersold, R., and Zhang, H. (2007) Solid-phase extraction

of N-linked glycopeptides. Nat Protoc 2, 334-339

331. Zhang, H., Li, X. J., Martin, D. B., and Aebersold, R. (2003) Identification and

quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling

and mass spectrometry. Nat Biotechnol 21, 660-666

172

332. Kaji, H., Yamauchi, Y., Takahashi, N., and Isobe, T. (2006) Mass spectrometric

identification of N-linked glycopeptides using lectin-mediated affinity capture and

glycosylation site-specific stable isotope tagging. Nat Protoc 1, 3019-3027

333. Sogabe, M., Nozaki, H., Tanaka, N., Kubota, T., Kaji, H., Kuno, A., Togayachi, A., Gotoh,

M., Nakanishi, H., Nakanishi, T., Mikami, M., Suzuki, N., Kiguchi, K., Ikehara, Y., and

Narimatsu, H. (2014) Novel glycobiomarker for ovarian cancer that detects clear cell

carcinoma. J Proteome Res 13, 1624-1635

334. Drake, R. R., Schwegler, E. E., Malik, G., Diaz, J., Block, T., Mehta, A., and Semmes, O. J.

(2006) Lectin capture strategies combined with mass spectrometry for the discovery of

serum glycoprotein biomarkers. Mol Cell Proteomics 5, 1957-1967

335. Anderson, N. L., Anderson, N. G., Haines, L. R., Hardie, D. B., Olafson, R. W., and

Pearson, T. W. (2004) Mass spectrometric quantitation of peptides and proteins using Stable

Isotope Standards and Capture by Anti-Peptide Antibodies (SISCAPA). J Proteome Res 3,

235-244

336. Anderson, N. L., Jackson, A., Smith, D., Hardie, D., Borchers, C., and Pearson, T. W.

(2009) SISCAPA peptide enrichment on magnetic beads using an in-line bead trap device.

Mol Cell Proteomics 8, 995-1005

337. Mataija-Botelho, D., Murphy, P., Pinto, D. M., Maclellan, D. L., Langlois, C., and Doucette,

A. A. (2009) A qualitative proteome investigation of the sediment portion of human urine:

Implications in the biomarker discovery process. Proteomics Clin Appl 3, 95-105

338. Nawaz, M., Camussi, G., Valadi, H., Nazarenko, I., Ekstrom, K., Wang, X., Principe, S.,

Shah, N., Ashraf, N. M., Fatima, F., Neder, L., and Kislinger, T. (2014) The emerging role

of extracellular vesicles as biomarkers for urogenital cancers. Nat Rev Urol 11, 688-701

339. Chen, Y. T., Chen, H. W., Domanski, D., Smith, D. S., Liang, K. H., Wu, C. C., Chen, C.

L., Chung, T., Chen, M. C., Chang, Y. S., Parker, C. E., Borchers, C. H., and Yu, J. S.

(2012) Multiplexed quantification of 63 proteins in human urine by multiple reaction

monitoring-based mass spectrometry for discovery of potential bladder cancer biomarkers. J


340. Zhi, W., Wang, M., and She, J. X. (2011) Selected reaction monitoring (SRM) mass

spectrometry without isotope labeling can be used for rapid protein quantification. Rapid

Commun Mass Spectrom 25, 1583-1588

341. Smith, L. M., Kelleher, N. L., and Proteomics, C. T. D. (2013) Proteoform: a single term

describing protein complexity. Nature methods 10, 186-187

342. Pavelka, N., Fournier, M. L., Swanson, S. K., Pelizzola, M., Ricciardi-Castagnoli, P.,

Florens, L., and Washburn, M. P. (2008) Statistical similarities between transcriptomics and

quantitative shotgun proteomics data. Mol Cell Proteomics 7, 631-644

173

343. Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009) Bioinformatics enrichment

tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids

Res 37, 1-13

344. Wu, G., Feng, X., and Stein, L. (2010) A human functional protein interaction network and

its application to cancer data analysis. Genome Biol 11, R53

345. Gross, J. L., and Yellen, J. (1999) Graph theory and its applications, CRC Press, Boca

Raton, Fla.

346. Girvan, M., and Newman, M. E. (2002) Community structure in social and biological

networks. Proc Natl Acad Sci U S A 99, 7821-7826

347. Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., Amin, N.,

Schwikowski, B., and Ideker, T. (2003) Cytoscape: a software environment for integrated

models of biomolecular interaction networks. Genome Res 13, 2498-2504

348. Uhlen, M., Oksvold, P., Fagerberg, L., Lundberg, E., Jonasson, K., Forsberg, M., Zwahlen,

M., Kampf, C., Wester, K., Hober, S., Wernerus, H., Bjorling, L., and Ponten, F. (2010)

Towards a knowledge-based Human Protein Atlas. Nat Biotechnol 28, 1248-1250

349. Alexander, H., Stegner, A. L., Wagner-Mann, C., Du Bois, G. C., Alexander, S., and Sauter,

E. R. (2004) Proteomic analysis to identify breast cancer biomarkers in nipple aspirate fluid.

Clin Cancer Res 10, 7500-7510

350. MacLean, B., Tomazela, D. M., Shulman, N., Chambers, M., Finney, G. L., Frewen, B.,

Kern, R., Tabb, D. L., Liebler, D. C., and MacCoss, M. J. (2010) Skyline: an open source

document editor for creating and analyzing targeted proteomics experiments. Bioinformatics

26, 966-968

351. Han, B., and Higgs, R. E. (2008) Proteomics: from hypothesis to quantitative assay on a

single platform. Guidelines for developing MRM assays using ion trap mass spectrometers.

Brief Funct Genomic Proteomic 7, 340-354

352. Lange, V., Picotti, P., Domon, B., and Aebersold, R. (2008) Selected reaction monitoring

for quantitative proteomics: a tutorial. Mol Syst Biol 4, 222

353. Cooperberg, M. R., Broering, J. M., and Carroll, P. R. (2010) Time trends and local

variation in primary treatment of localized prostate cancer. J Clin Oncol 28, 1117-1123

354. Welch, H. G., and Albertsen, P. C. (2009) Prostate cancer diagnosis and treatment after the

introduction of prostate-specific antigen screening: 1986-2005. Journal of the National

Cancer Institute 101, 1325-1329

355. Thompson, I. M., and Klotz, L. (2010) Active surveillance for prostate cancer. JAMA 304,

2411-2412

356. Ganz, P. A., Barry, J. M., Burke, W., Col, N. F., Corso, P. S., Dodson, E., Hammond, M. E.,

Kogan, B. A., Lynch, C. F., Newcomer, L., Seifter, E. J., Tooze, J. A., Viswanath, K., and

174

Wessells, H. (2012) National Institutes of Health State-of-the-Science Conference: role of

active surveillance in the management of men with localized prostate cancer. Annals of

internal medicine 156, 591-595

357. Sieh, W., Lichtensztajn, D. Y., Nelson, D. O., Cockburn, M., West, D. W., Brooks, J. D.,

and Chang, E. T. (2013) Treatment and Mortality in Men with Localized Prostate Cancer: A

Population-Based Study in California. The open prostate cancer journal 6, 1-9

358. Chang, C. Y., Picotti, P., Huttenhain, R., Heinzelmann-Schwarz, V., Jovanovic, M.,

Aebersold, R., and Vitek, O. (2012) Protein significance analysis in selected reaction

monitoring (SRM) measurements. Mol Cell Proteomics 11, M111 014662

359. Polanski, M., and Anderson, N. L. (2007) A list of candidate cancer biomarkers for targeted

proteomics. Biomarker insights 1, 1-48

360. Schreiner, D., Simicevic, J., Ahrne, E., Schmidt, A., and Scheiffele, P. (2015) Quantitative

isoform-profiling of highly diversified recognition molecules. eLife 4

361. de Gramont, A., Watson, S., Ellis, L. M., Rodon, J., Tabernero, J., de Gramont, A., and

Hamilton, S. R. (2015) Pragmatic issues in biomarker evaluation for targeted therapies in

cancer. Nature reviews. Clinical oncology 12, 197-212

362. Kobayashi, M., Salomon, C., Tapia, J., Illanes, S. E., Mitchell, M. D., and Rice, G. E.

(2014) Ovarian cancer cell invasiveness is associated with discordant exosomal

sequestration of Let-7 miRNA and miR-200. J Transl Med 12, 4

363. Thery, C., Ostrowski, M., and Segura, E. (2009) Membrane vesicles as conveyors of

immune responses. Nature reviews. Immunology 9, 581-593

364. Shedden, K., Xie, X. T., Chandaroy, P., Chang, Y. T., and Rosania, G. R. (2003) Expulsion

of small molecules in vesicles shed by cancer cells: association with gene expression and

chemosensitivity profiles. Cancer research 63, 4331-4337

365. Gyorgy, B., Szabo, T. G., Pasztoi, M., Pal, Z., Misjak, P., Aradi, B., Laszlo, V., Pallinger,

E., Pap, E., Kittel, A., Nagy, G., Falus, A., and Buzas, E. I. (2011) Membrane vesicles,

current state-of-the-art: emerging role of extracellular vesicles. Cellular and molecular life

sciences : CMLS 68, 2667-2688

366. D'Souza-Schorey, C., and Clancy, J. W. (2012) Tumor-derived microvesicles: shedding

light on novel microenvironment modulators and prospective cancer biomarkers. Genes Dev

26, 1287-1299

367. Buzas, E. I., Gyorgy, B., Nagy, G., Falus, A., and Gay, S. (2014) Emerging role of

extracellular vesicles in inflammatory diseases. Nature reviews. Rheumatology 10, 356-364

368. Kim, H. K., Song, K. S., Park, Y. S., Kang, Y. H., Lee, Y. J., Lee, K. R., Kim, H. K., Ryu,

K. W., Bae, J. M., and Kim, S. (2003) Elevated levels of circulating platelet microparticles,

VEGF, IL-6 and RANTES in patients with gastric cancer: possible role of a metastasis

predictor. European journal of cancer 39, 184-191

175

369. Graves, L. E., Ariztia, E. V., Navari, J. R., Matzel, H. J., Stack, M. S., and Fishman, D. A.

(2004) Proinvasive properties of ovarian cancer ascites-derived membrane vesicles. Cancer

research 64, 7045-7049

370. van Doormaal, F. F., Kleinjan, A., Di Nisio, M., Buller, H. R., and Nieuwland, R. (2009)

Cell-derived microvesicles and cancer. The Netherlands journal of medicine 67, 266-273

371. Poliakov, A., Spilman, M., Dokland, T., Amling, C. L., and Mobley, J. A. (2009) Structural

heterogeneity and protein composition of exosome-like vesicles (prostasomes) in human

semen. The Prostate 69, 159-167

372. Drake, R. R., and Kislinger, T. (2014) The proteomics of prostate cancer exosomes. Expert

review of proteomics 11, 167-177

373. Aalberts, M., Stout, T. A., and Stoorvogel, W. (2014) Prostasomes: extracellular vesicles

from the prostate. Reproduction 147, R1-14

374. Moremen, K. W., Tiemeyer, M., and Nairn, A. V. (2012) Vertebrate protein glycosylation:

diversity, synthesis and function. Nature reviews. Molecular cell biology 13, 448-462

375. Varki, A. (1993) Biological roles of oligosaccharides: all of the theories are correct.

Glycobiology 3, 97-130

376. Hakomori, S. (2002) Glycosylation defining cancer malignancy: new wine in an old bottle.

Proc Natl Acad Sci U S A 99, 10231-10233

377. Pierce, M., Buckhaults, P., Chen, L., and Fregien, N. (1997) Regulation of N-

acetylglucosaminyltransferase V and Asn-linked oligosaccharide beta(1,6) branching by a

growth factor signaling pathway and effects on cell adhesion and metastatic potential.

Glycoconjugate journal 14, 623-630

378. Rudd, P. M., and Dwek, R. A. (1997) Glycosylation: heterogeneity and the 3D structure of

proteins. Critical reviews in biochemistry and molecular biology 32, 1-100

379. Venable, J. D., Dong, M. Q., Wohlschlegel, J., Dillin, A., and Yates, J. R. (2004) Automated

approach for quantitative analysis of complex peptide mixtures from tandem mass spectra.

Nature methods 1, 39-45

380. Rost, H. L., Rosenberger, G., Navarro, P., Gillet, L., Miladinovic, S. M., Schubert, O. T.,

Wolski, W., Collins, B. C., Malmstrom, J., Malmstrom, L., and Aebersold, R. (2014)

OpenSWATH enables automated, targeted analysis of data-independent acquisition MS

data. Nat Biotechnol 32, 219-223

381. Gillet, L. C., Navarro, P., Tate, S., Rost, H., Selevsek, N., Reiter, L., Bonner, R., and

Aebersold, R. (2012) Targeted data extraction of the MS/MS spectra generated by data-

independent acquisition: a new concept for consistent and accurate proteome analysis. Mol

Cell Proteomics 11, O111 016717

176

382. Peterson, A. C., Russell, J. D., Bailey, D. J., Westphall, M. S., and Coon, J. J. (2012)

Parallel reaction monitoring for high resolution and high mass accuracy quantitative,

targeted proteomics. Mol Cell Proteomics 11, 1475-1488

383. Gallien, S., Duriez, E., Crone, C., Kellmann, M., Moehring, T., and Domon, B. (2012)

Targeted proteomic quantification on quadrupole-orbitrap mass spectrometer. Mol Cell


384. Ginsburg, G. S., and Willard, H. F. (2009) Genomic and personalized medicine: foundations

and applications. Translational research : the journal of laboratory and clinical medicine

154, 277-287

385. Hamburg, M. A., and Collins, F. S. (2010) The path to personalized medicine. N Engl J Med

363, 301-304

proteomic identification and systematic verification of ... · proteomic identification and...

Documents