proteomic identification and systematic verification of ... · proteomic identification and...
TRANSCRIPT
Proteomic Identification and Systematic Verification of
Biomarkers for Aggressive Prostate Cancer
by
Yunee Kim
A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
Medical Biophysics University of Toronto
© Copyright by Yunee Kim 2015
ii
Proteomic Identification and Systematic Verification of
Biomarkers for Aggressive Prostate Cancer
Yunee Kim
Doctor of Philosophy
Medical Biophysics
University of Toronto
2015
Abstract
Despite prostate cancer’s status as the most common cancer to affect men in the Western world,
only an estimated 20% of men will die from it. Due to the lack of effective means of
prognostication, many men are unnecessarily biopsied and treated. The early and non-invasive
identification of lethal disease would have a profound impact on the current practice of prostate
cancer management. To this end, the work presented in this thesis describes efforts to
characterize proteins in a prostatic fluid known as expressed prostatic secretions (EPS), and to
delineate factors in this fluid that demonstrate a high potential as prognostic biomarkers. Using
different mass spectrometry-based platforms, I present both a global view of the EPS proteome,
as well as targeted, quantitative measures of promising biomarker candidates in this fluid.
In order to generate an archive of soluble and secreted factors present in the proteome of EPS, I
used shotgun proteomics to analyze direct-EPS and EPS-urine. This led to the identification of
over 2000 proteins that comprise the EPS proteome. The data was integrated with other publicly
accessible information, such as gene expression profiles and immunohistochemistry images, to
characterize this proteome. Furthermore, a collection of prostate-enriched proteins was identified
iii
by comparing the proteomes of EPS-urine and urine, demonstrating the unique protein profile of
EPS.
Next, comparative proteomic profiling of direct-EPS from organ confined and extracapsular
tumors produced a long list of potential biomarker candidates. A targeted assay was
systematically developed in order to quantitatively assess the performance of the most promising
candidates in cohorts of EPS-urines from a heterogeneous population of prostate cancer patients
and controls. Utilizing the assay, a number of candidates were verified to be putative biomarkers
for prostate cancer diagnosis and prognosis. This information needs to be further validated and
holds promise for improved prostate cancer management.
iv
Acknowledgments
Let me begin with my deepest gratitude for the support and encouragement I received from my
supervisor, Dr. Thomas Kislinger. His integrity, courage, and commitment to contribute high
caliber research to advancing medical science have been great driving forces in my own pursuit
towards a productive Ph.D.
The members of my advisory committee, Drs. Theodorus Van der Kwast and Rama Khokha,
have been instrumental in my journey. I thank them for taking the time to engage in stimulating
discussions and offering their valuable insights.
This thesis is a culmination of the dedication of numerous individuals. I thank Drs. Richard
Drake, John Semmes, and Raymond Lance for organizing and providing the samples. Thanks to
Dr. Anthony Gramolini for permitting the unlimited use of the mass spectrometer. Thank you to
Drs. Paul Boutros, Clare Jeon, and Irina Kalatskaya, and Miss Cindy Yao for performing the
bioinformatics analyses and patiently answering my often naïve questions. Lastly, I thank the
patients and donors. None of this would have been possible without their generosity.
My time in grad school was made enjoyable in large part due to the many friends and groups that
became a part of my life. I have been fortunate to work alongside the past and present members
of the Kislinger lab: Ankit Sinha, Dr. Salvador Mejia, Alexandr Ignatchenko, Vladimir
Ignatchenko, Javier Alfaro, Dr. Erwin Schoof, Dr. Lusia Sepiashvili, Dr. Sarah Elschenbroich,
Lyudmila Nedyalkova, Dr. Simona Fontana and Dr. Simona Principe. I admire each and every
one of them for their bright minds and am grateful for the lasting friendships. A special shout out
to Sarah for patiently teaching me the way around a mass spectrometer and guiding me through
my first attempts at analyzing the data. The memories of the happy hours we spent together at
Café Volo after a hard day’s work will be forever cherished. I’m grateful to my lab mate and
friend Simona, with whom I shared countless lunches, washroom breaks, and random
adventures. I admire her tremendous abilities as a scientist and enormous warm heart. Thank you
to Lusia, for always having my back and taking the time to share your brilliant ideas with me. In
addition to my lab mates, I thank the Rottapel, Raught, and Gramolini labs for making the work
environment so pleasurable. The countless hours spent in the windowless mass spec room were
made enjoyable by the many friends that kept me company.
v
I am grateful to my best friends, Serge and Laura, for being my supporters and therapists, even
from half way around the world. I’m thankful to my #yologirls for being those with whom I can
be downright silly. Thanks to my MPC friends for their openness, kindness, generosity, and
deep, thought-provoking insights. They have inspired an exploration of myself to a depth and
truth I have never experienced before.
I thank my family for giving me the best foundation to grow from and showering me with their
warm love every day. With their strong support, I am fearless about whatever life throws my
way.
Last but not least, I thank my partner, Albert, for his quiet patience and unwavering love.
vi
Summary of Accomplishments
Publications (10)
Original research articles
1. Principe S, Jones E, Kim Y, et al. In-depth proteomic analyses of exosomes isolated from
expressed prostatic secretions in urine. 2013. Proteomics 13(10-11):1667. Underlined
authors contributed equally to this work.
2. Kim Y, et al. Identification of differentially expressed proteins in direct expressed prostatic
secretions of men with organ-confined versus extracapsular prostate cancer. 2012.
Molecular Cellular Proteomics 11(12):1870.
3. Principe S, Kim Y, Fontana S, et al. In-depth proteomic analyses of expressed prostatic
secretions in urine: identification of prostate-enriched proteins as potential biomarkers.
2012. Journal of Proteome Research 11(4): 2386. Underlined authors contributed equally
to this work.
4. Drake RR, Elschenbroich S, Lopez-Perez O, Kim Y, et al. In-depth proteomic analyses of
direct expressed prostatic secretions. 2010. Journal of Proteome Research 9:2109.
Reviews, book chapter, commentary
5. Kim Y, Kislinger T. Isoform-specific targeted proteomics sheds new light on synaptic
ligands. 2015. eLife (submitted)
6. Kim Y, Kislinger T. Novel approaches for the identification of biomarkers of aggressive
prostate cancer. 2013. Genome Medicine 5(6):56
7. Kim Y, Lance R, Drake R, Semmes O, Kislinger T. Proximal tissue fluids and targeted
proteomics: a novel approach to identify aggressive prostate cancers. Commentary in
UroToday, published February 26th, 2013 (http://urotoday.com)
vii
8. Cooper S, Tun H, Roper S, Kim Y. Current status of biomarker discovery in human clear
cell renal cell carcinoma. 2012. Journal of Molecular Biomarkers & Diagnosis S2:005.
doi:10.4172/2155-9929.S2-005.
9. Kim Y, et al. Use of Colloidal Silica-Beads for the Isolation of Cell Surface Proteins for
Mass Spectrometry-Based Proteomics. 2011. Methods in Molecular Biology 748:227.
10. Elschenbroich S, Kim Y, et al. Isolation of cell surface proteins for mass spectrometry-
based proteomics. 2010. Expert Review of Proteomics 7:141 Underlined authors
contributed equally to this work.
Presentations to Scientific Audiences (12)
Oral presentations
1. Canadian National Proteomics Network, 2014. Montreal, QC, Canada
2. International Conference on Urine-Omics, 2013. Costa da Caparica, Portugal
3. Canadian National Proteomics Network, 2012. Toronto, ON, Canada
4. Lake Louise Tandem Mass Spectrometry Workshop, 2011. Lake Louise, AB, Canada
5. James Lepock Memorial Symposium Ph.D. Talk, 2011. Toronto, ON, Canada
6. Ontario Cancer Institute Cancer Proteomics Group Meeting, 2010. Toronto, ON, Canada
7. Australian-Canadian Prostate Cancer Research Alliance Conference, 2010. Gold Coast,
QLD, Australia
Poster presentations
8. Canadian National Proteomics Network, 2014. Montreal, QC, Canada
9. Australian-Canadian Prostate Cancer Research Alliance Conference, 2010. Gold Coast,
QLD, Australia
10. Institute of Medical Science Day, 2010. Toronto, ON, Canada
viii
11. 22nd Annual Canadian Society for Immunology Conference, 2009. Whistler, BC, Canada
12. Centre for Genetic Improvement of Livestock Workshop, 2009. Guelph, ON, Canada
Abstracts (7)
1. Kim Y, et al., Biomarkers for the identification of biomarkers in expressed prostatic
secretions. International Conference in Urine-Omics, 2013. Costa da Caparica, Portugal.
2. Kim Y, et al., Identification of differentially expressed proteins in direct expressed
prostatic secretions of men with organ-confined versus extracapsular prostate cancer, 2012.
Canadian National Proteomics Network Conference, Toronto, ON, Canada.
3. Kim Y, et al., Identification of prognostic biomarkers of prostate cancer in prostatic fluids.
Lake Louise Tandem Mass Spectrometry Workshop, 2011. Lake Louise, AB, Canada.
4. Kim Y, et al., Proteomics of prostate cancer: from discovery to validation of putative
prognostic markers. Australian-Canadian Prostate Cancer Research Alliance Conference,
2010. Gold Coast, QLD, Australia.
5. Kim Y, et al., Proteomics of prostate cancer: from discovery to validation of putative
prognostic markers. Institute of Medical Science Day, 2010. Toronto, ON, Canada.
6. Fontana S, Drake R, Kim Y, et al., Proteome profiling of EPS-urine for the identification of
putative prostate cancer biomarkers. 10th Advanced Meeting on Cancer Omics, 2010.
Erice, Sicily, Italy.
7. Drake R, Elschenbroich S, Lopez-Perez L, Kim Y, et al., Comprehensive proteomic
profiling of direct expressed prostatic secretions: A novel strategy for the discovery of
prostate cancer biomarkers, 2009. Clinical Proteomics 5:275.
Academic Awards, Scholarships, Fellowships, and Honours (20)
1. 2014 Paul Starita Graduate Student Fellowships
2. 2014 Peterborough K.M. Hunter Graduate Studentships
ix
3. 2014 Frank Fletcher Memorial Fund
4. 2014 Canadian National Proteomics Network Travel Award
5. 2014 Canadian National Proteomics Network Best Oral Presentation by a Trainee
6. 2013 Ontario Graduate Scholarship
7. 2013 Scace Graduate Fellowship in Prostate Cancer Research
8. 2013 Paul Starita Graduate Student Fellowship
9. 2013 Medical Biophysics Travel Grant
10. 2012 Ontario Graduate Scholarship
11. 2012 Medical Biophysics Excellence Fellowship
12. 2011 James Lepock Memorial Symposium Best PhD Talk Award
13. 2011 Ontario Graduate Scholarship
14. 2011 Canadian Society for Mass Spectrometry Travel Award
15. 2010 Ontario Graduate Scholarship
16. 2010 Paul Starita Graduate Student Fellowship
17. 2010 Australian-Canadian Prostate Cancer Research Alliance Travel Award
18. 2010 Prostate Cancer Foundation of Australia Poster Contest
19. 2010 Paul Starita Graduate Student Fellowship
20. 2009 Institute of Medical Science Entry Award
x
Table of Contents
Acknowledgments .......................................................................................................................... iv
Summary of Accomplishments ...................................................................................................... vi
Table of Contents ............................................................................................................................ x
List of Tables ................................................................................................................................ xv
List of Figures .............................................................................................................................. xvi
List of Appendices ....................................................................................................................... xix
List of Abbreviations ................................................................................................................... xxi
Chapter 1 Introduction and thesis overview ................................................................................... 1
1 Introduction and thesis overview ............................................................................................... 1
1.1 The prostate gland ............................................................................................................... 1
1.1.1 Anatomy and histology of the prostate gland ......................................................... 1
1.2 Prostate cancer .................................................................................................................... 3
1.2.1 Prostate cancer epidemiology and statistics ............................................................ 3
1.2.2 Histopathology and clinical progression of prostate cancer ................................... 5
1.2.3 Current concepts of insignificant versus significant prostate cancer ...................... 6
1.2.4 Genetic alterations and molecular subtypes of prostate cancer .............................. 7
1.2.5 Current methods of prostate cancer and diagnosis ................................................. 8
1.3 Prostate cancer biomarkers ............................................................................................... 11
1.3.1 Introduction to cancer biomarkers ........................................................................ 11
1.3.2 Clinical and analytical considerations for cancer biomarker assay development . 12
1.3.3 Biological sources of biomarkers .......................................................................... 15
1.3.3.1 Tissue proximal fluids ............................................................................ 16
1.3.3.2 Biomarkers in biological fluids .............................................................. 17
1.3.4 Adjunct prostate cancer biomarkers ...................................................................... 18
xi
1.3.4.1 Cellular markers ..................................................................................... 19
1.3.4.2 Genetic aberrations as prognostic biomarkers ........................................ 20
1.3.4.3 MicroRNAs ............................................................................................ 21
1.3.4.4 Circulating biomarkers ........................................................................... 21
1.3.4.5 Urinary biomarkers ................................................................................. 22
1.4 Clinical proteomics for biomarker discovery ................................................................... 23
1.4.1 Discovery proteomics using shotgun, bottom-up proteomics ............................... 23
1.4.2 Protein quantification using mass spectrometry ................................................... 27
1.4.3 Targeted proteomics .............................................................................................. 29
1.4.3.1 Selected reaction monitoring mass spectrometry ................................... 30
1.4.3.2 Application of selected reaction monitoring mass spectrometry to
biomarker research ................................................................................. 31
1.5 Rationale and objectives ................................................................................................... 33
1.5.1 Rationale ............................................................................................................... 33
1.5.2 Hypotheses and aims ............................................................................................. 33
Chapter 2 In-depth proteomic analyses of prostate proximal tissue fluids ................................... 36
2 In-depth proteomic analyses of prostate proximal fluids ......................................................... 36
2.1 Abstract ............................................................................................................................. 36
2.2 Introduction ....................................................................................................................... 37
2.3 Results ............................................................................................................................... 39
2.3.1 In-depth proteomic analyses of direct expressed prostatic secretions .................. 39
2.3.1.1 MudPIT profiling of direct expressed prostatic secretions from
prostate cancer patients ........................................................................... 39
2.3.2 Integrative data mining and comparison to other public databases ...................... 43
2.3.2.1 Comparison of the direct-EPS proteome to other published proteomic
datasets ................................................................................................... 45
2.3.2.2 Multi-dataset integration......................................................................... 46
xii
2.3.3 Identification of prostate-enriched proteins by in-depth proteomic analyses of
expressed prostatic secretions in urine .................................................................. 50
2.3.3.1 MudPIT profiling of expressed prostatic secretions in urine ................. 50
2.3.3.2 Identification of prostate-enriched proteins ............................................ 57
2.3.3.3 Characterization of prostate-enriched proteins ....................................... 62
2.3.3.4 Generation of candidate shortlist ............................................................ 64
2.3.3.5 Verification of proteomic data ................................................................ 64
2.4 Discussion ......................................................................................................................... 66
2.5 Materials and methods ...................................................................................................... 67
2.5.1 Materials ............................................................................................................... 67
2.5.2 Sample collection and properties .......................................................................... 67
2.5.3 Protein digestion and sample preparation ............................................................. 69
2.5.4 MudPIT analyses .................................................................................................. 69
2.5.5 Protein identification and data analyses ................................................................ 71
2.5.5.1 Gene Ontology annotation and data comparisons .................................. 72
2.5.6 Prostate-enriched proteins characterization .......................................................... 73
2.5.6.1 SDS-PAGE and Western blot analysis on urine and EPS-urine pools ... 74
Chapter 3 Identification of candidate biomarkers of aggressive prostate cancer ......................... 75
3 Identification of candidate biomarkers of aggressive prostate cancer ..................................... 75
3.1 Abstract ............................................................................................................................. 75
3.2 Introduction ....................................................................................................................... 76
3.3 Results ............................................................................................................................... 78
3.3.1 Proteomic characterization of expressed prostatic secretions from organ-
confined and extracapsular prostate cancers ......................................................... 78
3.3.2 Semi-quantitative comparison of expressed prostatic secretions from
extracapsular and organ-confined prostate cancers .............................................. 84
3.3.3 Generation of candidate shortlist .......................................................................... 87
xiii
3.3.4 Verification of differentially expressed candidates in direct-EPS ........................ 90
3.3.5 Verification of differentially expressed candidates in EPS-urines ....................... 93
3.4 Discussion ......................................................................................................................... 99
3.5 Materials and methods .................................................................................................... 103
3.5.1 EPS Sample Properties ....................................................................................... 103
3.5.2 Protein digestion and sample preparation for mass spectrometry ...................... 104
3.5.3 Protein identification and data analysis .............................................................. 104
3.5.4 Gene Ontology annotation, pathway enrichment and network-based analysis
of EPS proteins ................................................................................................... 105
3.5.5 Integrative data mining ....................................................................................... 106
3.5.6 Immunoassays in prostatic fluids ........................................................................ 106
3.5.7 Statistical analysis ............................................................................................... 107
3.5.8 SRM-MS development ....................................................................................... 107
3.5.9 Relative quantification by SRM-MS in EPS-urines ........................................... 109
Chapter 4 Systematic development of a quantitative assay for putative prostate cancer
biomarkers in expressed prostatic secretions ......................................................................... 110
4 Systematic large-scale development of targeted proteomics assays for biomarker
verification in prostate-proximal fluids .................................................................................. 110
4.1 Abstract ........................................................................................................................... 110
4.2 Introduction ..................................................................................................................... 111
4.3 Results ............................................................................................................................. 112
4.3.1 Candidate selection for targeted quantification by selected reaction monitoring
mass spectrometry ............................................................................................... 112
4.3.2 Targeted relative quantification of candidates in a cohort of EPS-urines .......... 114
4.3.3 Selection of peptide candidates for verification .................................................. 116
4.3.4 Absolute quantification for the verification of candidates in EPS-urines ........... 119
4.3.5 Comparative analysis between subgroups of patients ........................................ 122
4.4 Discussion and future direction ...................................................................................... 127
xiv
4.5 Materials and Methods .................................................................................................... 130
4.5.1 Sample collection and properties ........................................................................ 130
4.5.2 Sample preparation for mass spectrometry ......................................................... 131
4.5.3 Target acquisition by selected reaction monitoring mass spectrometry ............. 131
4.5.4 Phase 2 peptide selection .................................................................................... 132
4.5.5 Phase 2 peptide quantification and Phase 3 peptide selection ............................ 132
4.5.6 Generation of response curves and determination of figures of merit ................ 133
4.5.7 Multiplexed, scheduled selected reaction monitoring mass spectrometry
method ................................................................................................................. 133
4.5.8 Verification of Phase 3 peptides in EPS-urines .................................................. 133
4.5.9 Quantitative and statistical analyses ................................................................... 134
Chapter 5 Summary and future direction .................................................................................... 135
5 Summary and future direction ................................................................................................ 135
5.1 Summary of key findings ................................................................................................ 135
5.1.1 In-depth proteomic analyses characterizing the direct and urinary expressed
prostatic secretion proteomes .............................................................................. 135
5.1.2 Discovery phase: identification of candidate biomarkers of extracapsular
prostate cancer .................................................................................................... 136
5.1.3 Verification phase: targeted quantification of selected prostate cancer
biomarker candidates .......................................................................................... 136
5.1.4 Additional proteomics analysis of prostate-proximal fluids ............................... 137
5.2 Future directions ............................................................................................................. 138
5.3 Concluding remarks ........................................................................................................ 142
References ................................................................................................................................... 143
xv
List of Tables
Table 1.1 Descriptors of disease biomarkers. ............................................................................... 14
Table 1.2 Common sources of prostate cancer biomarkers. ......................................................... 15
Table 1.3 Summary of common labeled and label-free protein quantification techniques using
MS. ................................................................................................................................................ 29
Table 2.1 Clinical information of the direct-EPS fluids analyzed by MudPIT proteomics. ......... 39
Table 2.2 Clinical information for the prostate cancer (5 samples) and benign prostatic
hyperplasia (6 samples) EPS-urines analyzed by MudPIT. .......................................................... 51
Table 2.3 Clinical information for the pooled urine and EPS-urine samples analyzed by
MudPIT.. ....................................................................................................................................... 58
Table 3.1 Clinical information of patients used for discovery proteomics. .................................. 80
Table 3.2 Fourteen candidates selected by data mining scheme. ................................................. 89
Table 3.3 Clinical information of patient samples used for Western blotting verification. .......... 91
Table 3.4 Clinical information of patient samples used to generate EPS-urine pools. ................. 95
Table 4.1 Overview of clinical information from patients enrolled in Cohort A of EPS-urines. 114
Table 4.2. Limits of detection and quantification for 34 peptides determined by response curves
generated using AQUA peptides spiked into urine. .................................................................... 118
Table 4.3 Overview of clinical information from patients enrolled in Cohort B of EPS-urines. 119
Table 4.4 Relative expression of Phase 3 proteins and peptides in EPS fluids using various
quantitative methods. .................................................................................................................. 126
xvi
List of Figures
Figure 1.1 The prostate gland. ........................................................................................................ 2
Figure 1.2 Cell types of the prostate gland.. ................................................................................... 3
Figure 1.3 Progression of human prostate cancer. .......................................................................... 6
Figure 1.4 Receiver operating characteristic curve. ...................................................................... 13
Figure 1.5 Prostate specific antigen access to circulation. ............................................................ 17
Figure 1.6 Various strategies for novel molecular biomarker identification. ............................... 19
Figure 1.7 Schematic of multidimensional protein identification technology (MudPIT). ........... 25
Figure 1.8 Dynamic concentration range of human plasma proteins. .......................................... 27
Figure 1.9 Selected reaction monitoring mass spectrometry. ....................................................... 30
Figure 2.1 Proteomics analyses of direct expressed prostatic secretions. ..................................... 41
Figure 2.2 Functional enrichment of proteins detected in EPS. ................................................... 42
Figure 2.3 Integrative data mining and comparison to other databases. ....................................... 44
Figure 2.4 Comparison to other published proteomes. ................................................................. 46
Figure 2.5 Integration of multiple data resources with the EPS proteome. .................................. 48
Figure 2.6 Immunohistochemistry images as obtained from the Human Protein Atlas. .............. 50
Figure 2.7 Study workflow. Proteomic analysis of EPS-urine and urine samples. ...................... 53
Figure 2.8 Characterization of the EPS-urine proteome.. ............................................................. 54
Figure 2.9 Functional enrichment of proteins detected in EPS-urine and comparison of the EPS-
urine proteome to other related body fluids. ................................................................................. 56
xvii
Figure 2.10 Comparison between the two proteomics data of the current study. ......................... 59
Figure 2.11 Data mining to identify prostate-enriched proteins. .................................................. 62
Figure 2.12 Additional annotations for prostate-enriched proteins. ............................................. 63
Figure 2.13 Validation of mass spectrometry data for seven selected proteins. ........................... 65
Figure 3.1 Workflow of study design from the discovery of differentially expressed proteins to
verification of candidates in prostatic fluids. ................................................................................ 79
Figure 3.2 Average number of unique proteins identified in the duplicate MudPIT runs for
discovery direct-EPS samples. ...................................................................................................... 81
Figure 3.3 Total number of peptides (A) and total number of spectra (B) recorded for each
individual direct-EPS analyzed in the discovery phase. ............................................................... 81
Figure 3.4 Venn diagram depicting the total number of proteins exclusively expressed in the EC
or OC groups. ................................................................................................................................ 82
Figure 3.5 The current proteomics dataset compared to prostate-related proteomics data from
various reports. .............................................................................................................................. 82
Figure 3.6 Proportion of proteins with predicted SPs and TMDs. ................................................ 83
Figure 3.7 Gene Ontology enrichment of the EC and OC direct-EPS. ......................................... 83
Figure 3.8 Distribution of all 624 proteins identified in the current direct-EPS data set. ............ 84
Figure 3.9 Two significantly enriched pathways within the 133 differentially expressed proteins
are involved in hemostatic and immunological processes. ........................................................... 86
Figure 3.10 Additional modules enriched by pathway analysis. .................................................. 87
Figure 3.11 Protein prioritization scheme for the selection of candidates for verification. .......... 88
Figure 3.12 Measurement of PSA and PAP in direct-EPS. .......................................................... 90
Figure 3.13 Differential expression of verified candidates. .......................................................... 92
xviii
Figure 3.14 Differential expression of candidates measured by immunoassays. ......................... 93
Figure 3.15 Standard curves generated for each peptide used for SRM-MS measurements. ....... 97
Figure 3.16 Verification of differential expression of candidates in EPS-urines pooled from
patients with EC or OC prostate tumors. ...................................................................................... 98
Figure 3.17 PSA measurements in individual EPS-urines from patients with EC and OC prostate
tumors by ELISA .......................................................................................................................... 99
Figure 4.1 Study workflow outlining the three major phases of the current project. ................. 113
Figure 4.2 Peptides in EPS-urine.. .............................................................................................. 115
Figure 4.3 Phase 3 peptides. ....................................................................................................... 117
Figure 4.4 Cohort B SRM-MS data quality. ............................................................................... 120
Figure 4.5 Absolute quantification of Phase 3 peptides in EPS-urine. ....................................... 122
Figure 4.6 Comparative quantification of candidates in different subgroups of EPS. ............... 125
Figure 5.1 Overview of all proteomics datasets generated for various prostate-proximal fluids..
..................................................................................................................................................... 138
xix
List of Appendices
Appendix Table 1.1 Definitions of the American Joint Committee on Cancer for prostate
cancer tumor staging
Appendix Table 2.1 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples
Appendix Table 2.2 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples that could be mapped to the Human Gene Atlas mRNA
microarray
Appendix Table 2.3 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples that could be mapped to an antibody in the Human Protein Atlas
Appendix Table 2.4 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and 22Rv1 prostate cancer culture medium
Appendix Table 2.5 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and PC3 prostate cancer cell culture medium
Appendix Table 2.6 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and LNCaP prostate cancer cell culture medium
Appendix Table 2.7 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and seminal plasma
Appendix Table 2.8 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and normal urine
Appendix Table 2.9 Proteins confidently identified by MudPIT proteomics in 9 direct-EPS
samples and ≥ 10-fold above median in prostate tissue on the Human Gene
Atlas microarray
Appendix Table 2.10 Proteins identified in either the individual EPS-urines or the pooled
samples pre- and post-DRE
xx
Appendix Table 2.11 All 49 prostate-enriched proteins identified in the current study including
additional annotations
Appendix Table 3.1 Protein report
Appendix Table 3.2 Peptide report
Appendix Table 3.3 Pathway enrichment analysis for major network modules identified by
clustering algorithm
Appendix Table 3.4 Peptide sequences used for SRM-MS
Appendix Table 3.5 SRM-MS relative quantification of peptides
Appendix Figure 3.1 Transitions and integrated quantifier ions
Appendix Table 4.1 Information for the Phase 1 (232) peptides selected for SRM-MS
evaluation
Appendix Table 4.2 Fold changes of diagnostic and prognostic peptides in Cohort A measured
by SRM-MS
Appendix Table 4.3 Figures of merit for 34 peptides selected for verification
Appendix Table 4.4 Cohort A patient information
Appendix Table 4.5 Cohort B patient information
Appendix Table 4.6 Functions of Phase 3 protein candidates
Appendix Figure 4.1 Response curves for each of the 34 peptides analyzed in the verification
phase
Appendix Figure 4.2 Correlation between the 34 Phase 3 peptides from EPS-urine of Cohorts A
and B
xxi
List of Abbreviations
AMACR alpha-methylacyl-coenzyme A racemase
AR androgen receptor
AUC area under the curve
BCR biochemical recurrence
BPH benign prostatic hyperplasia
CA125 cancer antigen 125
CID collision induced dissociation
CNA copy number alteration
CV coefficient of variation
DIA data independent acquisition
DRE digital rectal examination
DTT dithiotreitol
EC extracapsular
ELISA enzyme linked immunosorbent assay
EPS expressed prostatic secretions
ESI electrospray ionization
FDR false discovery rate
FFPE formalin fixed paraffin embedded
FI functional interaction
FT-ICR Fourier transform ion cyclotron resonance
GO Gene Ontology
HGPIN high grade intraepithelial neoplasia
HPA Human Protein Atlas
ICAT isotope coded affinity tags
IL-6sR interleukin 6 receptor
iTRAQ isobaric tags for relative and absolute quantification
KLK3 kallikrein 3
LC liquid chromatography
LOD limit of detection
LOQ limit of quantification
m/z mass-to-charge ratio
MALDI matrix assisted laser desorption ionization
miRNA microRNA
MME membrane metalloendopeptidase
mRNA messenger RNA
MS mass spectrometry
MS/MS tandem mass spectra
MSMB microseminoprotein-β
MudPIT multidimensional protein identification technology
NC noncancer
xxii
NPV negative predictive value
NR nonrecurrent
NSAF normalized spectral abunance factor
OC organ-confined
PAP prostatic acid phosphatase
PARK7 Parkinson protein 7
PCA3 prostate cancer antigen 3
PPV positive predictive value
PRM parallel reaction monitoring
PSA prostate specific antigen
PSMA prostate specific membrane antigen
QconCAT quantification concatemer
QqQ tripe quandrupole
ROC receiver operating characteristic
RP radical prostatectomy
RP reverse phase
SCX strong cationic exchange
SD standard deviation
SD signal peptide
SFN stratifin
SID stable isotope dilution
SILAC stable isotope labeling with amino acids in cell culture
SRM selected reaction monitoring
TGF-β1 transforming growth factor beta 1
TGM4 transglutaminase 4
TIMP1 tissue inhibitor of metalloproteinase 1
TMD transmembrane domain
TMPRSS2:ERG transmembrane protease serine 2 gene fusion with E twenty-six transcription factors
TMT tandem mass tags
TOF time-of-flight
uPA urokinase plasminogen activator
1
Chapter 1
Introduction and thesis overview
Portions of this chapter have been reformatted from the original publication: Kim, Y., Kislinger
T. (2013). Novel approaches for the identification of biomarkers of aggressive prostate cancer.
Genome Medicine 5:56
1 Introduction and thesis overview
1.1 The prostate gland
1.1.1 Anatomy and histology of the prostate gland
The prostate is a secretory gland that sits at the base of the bladder in the male urogenital tract
(Figure 1.1). It surrounds the urethra and functions to produce fluid that consists of factors that
nourish sperm and aids in fertilization.
Three major regions of the prostate that differ histologically and biologically exist: the
peripheral, central, and transition zones (1). The transition zone is commonly the site of benign
prostatic hyperplasia, whereas carcinoma occurs primarily in the peripheral zone of the gland (1,
2). Surrounding the prostate is an integral fibromuscular band known as the prostatic capsule.
Within the epithelium of the prostate, three distinct cell types exist (Figure 1.2): secretory
luminal cells, basal cells, and neuroendorcrine cells (2). Secretory luminal cells predominate the
epithelium and function to produce prostatic proteins, including prostate specific antigen (PSA),
that are secreted into the lumen of the prostatic duct. They are characterized by the expression of
cytokeratin 8 and 18, CD57, and androgen receptor (2-7). The basal cells form a continuous
layer, separating luminal cells from the basement membrane and stroma (1). They do not
produce secretory proteins and are characterized by the expression of cytokeratins 5, 14, and
CD44 (3-8). Lastly, a small population of neuroendocrine cells are dispersed throughout the
basal cell layer (2). These cells are believed to support the growth of the secretory luminal cells
(9, 10), and can, in rare cases (1% of all prostate cancers), become neoplastic, producing highly
aggressive small cell carcinoma of the prostate (2).
2
Figure 1.1 The prostate gland. The secretory gland is located at the base of the bladder in the
male urogenital tract. The prostatic urethra runs through the organ, carrying urine through it.
Three major zones of the prostate exist: the central, transition, and peripheral zones.
3
Figure 1.2 Cell types of the prostate gland. Secretory luminal cells predominate the prostate
epithelium, releasing proteins into the lumen of the prostatic acini. Basal cells separate the
luminal cells from the basement membrane and stroma. A small population of neuroendocrine
cells are dispersed throughout the basal cell layer.
1.2 Prostate cancer
1.2.1 Prostate cancer epidemiology and statistics
Prostate cancer is the most common cancer among Canadian men (excluding non-melanoma skin
cancer), occurring in 1 of every 8 men and killing every 1 in 28. In 2015 alone, 24,000 new cases
and 4100 deaths are estimated in Canadian men (11). Most patients are diagnosed after age 65
and most men over 85 years of age have histological prostate cancer (12). The world-wide
incidence rate of this disease can vary considerably, with the highest rates occurring among
black American men and the lowest in India (13). These differences can be attributed to genetic
4
susceptibility, exposure to external risk factors, or the accuracy of cancer registration across
countries (14). Despite these daunting statistics, only 15-20% of men will present with
aggressive disease (15, 16), while the majority of patients remain asymptomatic for many years.
The incidence of prostate cancer is climbing in both high and low-risk populations around the
world for different reasons (17). For instance, in the US, the wide-spread use of PSA screening
has led to the identification of asymptomatic men who harbor the disease (18), while in Sweden,
where screening is much less common, the frequent use of transurethral resection of the prostate
and random biopsies has led to a staggering 100% increase in prostate cancer detection over the
last three decades (18). These practices cannot explain, however, the rising incidence of prostate
cancer in developing countries, where unidentified external risk factors likely play a role (19).
Epidemiological studies have demonstrated a stronger familial clustering of prostate cancer
amongst patients when compared to age and race-matched controls (20). For instance, brothers
of young prostate cancer patients (age at diagnosis <63 years) have a four-fold increased risk of
the disease compared to their brothers-in-law and the general population (21). Brothers and
fathers of prostate cancer patients have a 76% higher risk of prostate cancer than the male first-
degree relatives of control subjects (22). Interestingly, the risk of prostate cancer increases for
first-degree relatives of prostate cancer patients with decreasing age at diagnosis in the patients;
thus, suggesting that screening in men with a family history of prostate cancer should begin at an
earlier age (23, 24). Polymorphisms in genes regulating androgen metabolism and metabolites of
oxidative stress related to apoptosis (25-32) have been linked to prostate cancer. Androgens play
a vital role in the development of the prostate, and medical castration (withdrawal of
testosterone) is a method of treatment for patients with metastatic prostate cancer. Animal studies
have demonstrated that testosterone and dihydrotestosterone can induce the formation of prostate
tumors; however, a definitive study in humans correlating androgen concentrations and prostate
cancer has yet to surface (14). Dietary factors such as high intakes of alpha-linolenic acid and
calcium, as well as red meat cooked at high temperatures, may be linked to increased risk of
disease (14). Other factors such as smoking and alcohol consumption have also been
investigated with varying results.
5
1.2.2 Histopathology and clinical progression of prostate cancer
Prostate tumors predominantly arise from the malignant transformation of luminal epithelial cells
(adenocarcinomas), accounting for >95% of all cancers of the prostate. Other, much less
prevalent categories of prostate cancer are ductal adenocarcinoma, mucinous carcinoma, signet
ring carcinoma, and neuroendocrine. Primary prostate cancer consists of multiple independent
and genetically distinct foci (33-37), while different metastases within an individual may be
clonally related (38, 39). This suggests that metastatic cancer may arise from the selective
advantage of individual clones during cancer progression (40). Age is the greatest risk factor for
prostate cancer; even still, prostates from healthy men between the ages of 20 and 40 frequently
bear histologic foci of prostate cancer (41-43). This suggests that, although multiple neoplastic
transformation events may occur in the prostate, most of them only give rise to latent disease and
do not progress to clinical presentation. Furthermore, latent prostate cancer may arise from a
different pathogenic program from clinical prostate cancer, whereby activating events are
mitigated or there is sufficient tumor suppression to maintain the foci in a subclinical state (40).
High grade prostatic intraepithelial neoplasia (HGPIN), which occurs in 0.7-7.1% of needle
biopsies (44), is thought to be an immediate precursor of early invasive prostate cancer (2, 45,
46), although not a prerequisite for carcinomas (Figure 1.3). This is supported by their existence
in proximity to carcinomas within the peripheral zone and their resemblance to early invasive
carcinoma including multifocality, similarity in chromosomal abnormalities and reduction in the
expression of the differentiation markers E-cadherin and vimentin (2, 47-53). In contrast,
neoplasms can be distinguished from early invasive cancer by the disruption of the basal layer,
which is completely lost in cancer (54). Prostatic intraepithelial neoplasia is histologically
characterized by the presence of luminal epithelial hyperplasia, reduction in basal cells,
enlargement of nuclei and nucleoli, cytoplasmic hyperchromasia, nuclear atypia, and a marked
elevation of cellular proliferation markers (45).
6
Figure 1.3 Progression of human prostate cancer. Adapted Shen and Abate-Shen, Genes
Dev. 2010 © Cold Spring Harbor Laboratory Press. Prostate cancer arises as a result of
genetic events that culminate in the transformation of the secretory epithelial cells
(adenocarcinoma). Prostatic intraepithelial neoplasia is thought to be a non-obligatory precursor
for prostate cancer.
Secondary metastasis of prostate cancer most commonly occurs in bone, where it forms
characteristic osteoblastic lesions (55, 56). Although the mechanisms by which prostate cancer
cells form metastases in bone are poorly understood, tumor cells from patients with localized and
advanced disease can be detected in circulation as well as in bone marrow (57-59).
1.2.3 Current concepts of insignificant versus significant prostate cancer
Insignificant and indolent are terms used interchangeably to prostate describe cancer that is low-
grade, small volume, organ-confined (OC), and thus not likely to progress to clinical significance
in the absence of treatment (60). The Epstein preoperative criteria was developed in an attempt to
clinically and pathologically define subgroups of patients with insignificant disease who may be
spared treatment (61). Insignificant disease is defined as: PSA <10 ng/ml, organ-confinement
without extraprostatic extension, no Gleason pattern 4/5, and <33% positive biopsies (60-64).
Extraprostatic or extracapsular (EC) extension refers to the presence of tumor beyond the
confines of the prostate, and is defined as carcinoma mixed with periprostatic adipose tissue, or
bulging out beyond the contours of the gland. The presence of extraprostatic extension is
associated with an increased risk of biochemical recurrence (BCR) (65). The reported incidence
of clinically insignificant tumors is approximately 40% (62); however, about 30% of patients are
misclassified based on the criteria and present significant disease upon inspection of their radical
7
prostatectomy (RP) specimens (60). Furthermore, even insignificant tumor patients have a 10%
5-year risk of BCR after treatment (66). Therefore, there is a clear and urgent need for
identifying truly biologically significant prostate cancer.
Between 27% and 53% of patients who receive definitive treatment for prostate cancer develop
BCR within 10 years of treatment, as indicated by two consecutive PSA values of >0.2 ng/ml
(67, 68). However, only about 30% of patients with PSA relapse develop clinical recurrence and
6% of them die from it (69, 70). Interestingly, the rate of recurrence appears to be correlated with
the risk of disease. Kapadia et al., demonstrated that in patients treated with radiotherapy, BCR
occurred in 8%, 15%, and 36% of low-risk, intermediate-risk, and high-risk patients, respectively
(71). Time to post-RP BCR, PSA velocity and doubling time, pathological stage and Gleason
score are important clinical parameters for stratifying patients into risk groups for prostate
cancer-specific mortality (72). Approximately half of the patients develop local disease, while
the other half develops distant metastatic disease with or without local disease. Patients may
undergo diagnostic imaging and biopsies to evaluate clinical recurrence and may take a course of
salvage therapy (radiotherapy, hormone therapy, salvage RP), with varying outcomes depending
on factors such as age, pre-treatment PSA, and the presence of distant metastatic disease.
1.2.4 Genetic alterations and molecular subtypes of prostate cancer
Genetic mutations found in prostate cancers occur in abrupt periodic bursts, resulting in large
scale reshuffling of DNA leading to cancer (73). This phenomenon, known as “chromoplexy”, is
a process by which distant genomic regions are disrupted in a coordinated fashion that eliminates
cancer-suppressor genes, leaving genes that may drive cancer progression. Common mutations in
prostate cancers are somatic copy number alterations (CNA), point mutations, structural
rearrangements, and aneuploidy (74). Copy number alterations are present in 90% of prostate
cancers, and are more common in prostate cancer than other types of cancers (75).
Highly altered primary tumors are associated with increase risk of relapse (76). Deletions can be
seen on chromosomes 6q, 8p, 10q, and 13q and include tumor suppressing genes such as NKX3-
1, pTEN, BRCA2, and RB1. Meanwhile, frequent amplification of chromosomes X, 7, 8q, and
9q can be observed, which include the androgen receptor (AR) and MYC oncogenes (74).
Structural rearrangements are more commonly observed in high-risk prostate tumors, of which
8
the TMPRSS2-ERG and ETS fusions are present in 50% of cases (77). Interestingly, recurrent
mutations in SPOP (involved in JNK and Hedgehog signaling pathways) may define a new
molecular subtype of prostate cancer since this mutation is only found in ETS fusion-negative
tumors and exhibit a pattern of distinct alterations in 5q21 and 6q21 (78).
Like breast, renal, and ovarian cancers, primary prostate cancer has a somatic mutation rate of 1
x 10-6 to 2 x 10-6 (79-81), leading to thousands of point mutations that can exist in the prostate
tumor genome; however, only about 20 per genome may have influence on protein stability and
function (74). In prostate cancer, point mutations have been observed in TP53, PTEN, RB1, and
PIK3CA tumor suppressors (78, 81-83), and activating mutations have been reported in the
oncogenes KRAS and BRAF.
1.2.5 Current methods of prostate cancer and diagnosis
In the absence of symptoms, early screening of prostate cancer begins with a digital rectal
examination (DRE) and a PSA blood test. The DRE is a physical examination in which the
physician examines the prostate for nodules through the rectum. Prostate specific antigen, also
known as kallikrein 3 (KLK3), is a glycoprotein and serine protease that is produced by the
secretory epithelial cells of the prostate. Elevated levels of PSA in the bloodstream (the most
widely accepted cutoff for PSA is 4.0 ng/ml) is associated with prostate cancer. However, PSA
increases with benign prostatic conditions such as benign prostatic hyperplasia (BPH) and
prostatitis, as well as transient causes such as acute urinary retention and ejaculation within 48
hours of testing (84).
The estimated sensitivity and specificity of the PSA test at a cutoff of 4.0 ng/ml was ~21% and
91%, respectively, for detecting prostate cancer (85). The positive predictive value (PPV; i.e., the
proportion of men with PSA >4.0 ng/ml who have prostate cancer) is ~30% (86-88), thus about 1
man in 3 with an elevated PSA will have prostate cancer upon biopsy; with PSA >10 ng/ml, the
PPV increases to ~40-60% (87, 89). The negative predictive value (NPV; i.e., the proportion of
men with a PSA ≤4ng/ml who do not have prostate cancer) is 85% (90). Prostate specific antigen
screening has been shown to reduce prostate cancer-specific mortality by 20% but the number of
men needed to be treated in order to prevent one death is estimated to be 18-29 (91).
9
The confirmation of a prostate cancer diagnosis can only be established by examining biopsy
samples. The biopsy is performed with transrectal ultrasound (TRUS) imaging to help guide the
biopsy needles in the prostate. Typically, 6 or more cores are taken to sample different areas of
the organ. Prostate cancer from biopsy specimens is marked by the absence of p63 and
cytokeratin 5 and 14, markers for basal cells (92, 93), while α-methylacyl-CoA racemase is
upregulated in cancerous lesions (93-95). If cancer is identified, the pathologist will determine
the grade and stage of the cancer.
Disease prognosis is defined as the prediction of malignancy in the absence of or post therapy
(96). Prognostic factors are those that, independent of treatment, predict relapse or progression
(96). Currently, pathologic Gleason grading is considered to be the best predictor of outcome
(97). Devised by Dr. Donald Gleason, this procedure involves the histologic evaluation of the
extent of glandular differentiation and pattern of arrangement of carcinoma cells in hematoxylin
and eosin (H&E) stained tissue sections (98). Five grades, from 1 (well differentiated) to 5 (most
poorly differentiated), are assigned to the most predominant growth pattern (primary grade) and
the second most common pattern (secondary grade), which are summed to generate a Gleason
score ranging from 2 to 10.
Patients with Gleason scores 7 or higher are at increased risk of extraprostatic extension and
recurrence after therapy (99, 100). Furthermore, individuals with Gleason 4+3 tumors may be at
greater risk of prostate cancer-specific mortality than Gleason 3+4 patients (101). Poorer
prognosis is associated with Gleason 4 or 5 cancers, while patients with tumors composed of
Gleason 3 pattern are unlikely to progress and die of the disease (101, 102). The defining
difference between Gleason 3 and 4 is a focal loss of integrity of discrete glandular units of the
tumor. Gleason 3 cancers consist of acini that have a smooth and rounded edge with an intact
basement membrane, while Gleason 4 tumors are composed of poorly differentiated, fused
glands, frequently with irregular borders that invade the stroma (103). Gleason 5 pattern is
represented by sheets of cancerous cells due to the complete loss of any rounded glandular shape
(54).
Gleason grading has also been linked to various clinical end points such as clinical stage,
response to therapy, biochemical failure, progression to metastatic disease, and cancer-specific
and overall survival (104-106). Needle biopsy Gleason grade is typically combined with serum
10
PSA and clinical stage in Partin tables, which predict risk for extraprostatic extension, seminal
vesicle invasion, lymph node metastasis, and aid in treatment planning (107). Needle biopsy
Gleason grading provides some idea of the pathologic stage, but is not absolutely accurate;
patients with low-grade prostate cancer may still progress and develop tumors that spread outside
of the prostate, while some high grade tumors may never extend outside of the gland. Although
Gleason grading allows for the assignment of two separate patterns in an individual sample, thus
controlling for some degree of tumor heterogeneity within the gland, an average of 2.7 Gleason
grade patterns are observed in whole prostates (33), with some even having four different
patterns (108). Even tertiary patterns of high grade Gleason pattern 4 or 5 that make up <5% of
the tumor have been found to impact stage and progression (109), so high grade tertiary patterns
are now recommended to be included in reports (107). In addition, it has been recommended that
the most predominant pattern and the highest grade should be summed as the Gleason score, in
cases where the highest grade is not the most predominant nor the second most common pattern
observed (110).
Staging is performed in order to assess the extent of spread of the cancer within the prostate or to
other parts of the body and to facilitate treatment planning. Various tests are performed in order
to gather sufficient information for an accurate assessment of the disease, including imaging and
biopsies. The American Joint Committee on Cancer TNM cancer staging system (Appendix
Table 1.1) is most widely used and is based on five factors: 1) the extent of the primary tumor (T
category); 2) the involvement of lymph nodes (N category); 3) the presence of metastasis (M
category); 4) serum PSA at diagnosis; and 5) the Gleason score (obtained from biopsy or
prostatectomy). Nomograms are multivariate prognostic algorithms that combine different
prognostic factors and facilitate treatment decision-making. Preoperative nomograms include
information such as ethnicity, prostate weight, serum PSA, clinical stage, and Gleason score.
Postoperative nomograms incorporate additional information such as surgical margins, seminal
vesicle invasion, and regional lymph node involvement.
11
1.3 Prostate cancer biomarkers
1.3.1 Introduction to cancer biomarkers
Accurate and timely assessment of prostate cancer prognosis remains one of the most significant
clinical challenges in prostate cancer management. Rapid advances in molecular technologies are
likely to lead to significant progress in the foreseeable future. Despite these technological strides,
prostate cancer is still over-diagnosed and many patients are unnecessarily treated. Possible
reasons are the multifocal and heterogeneous nature of this disease leading to frequent
misclassification of patients, intra-institutional variability, and patient variability; all of which
contribute to the lack of well-defined and validated prognostic biomarkers.
A biomarker is a measureable biological indicator that can provide information about the
presence or progression of a disease, or the effects of a given treatment. A clinically useful
biomarker should have high sensitivity and specificity, as well as high PPV and NPV, and
facilitate clinical decisions to administer optimal care (111) (Table 1.1).
According to the Early Detection Research Network (112), a biomarker should undergo five
major phases of development before it can be confidently utilized under clinical settings and
benefit the population. These phases are: i) preclinical exploratory studies, where
tumor/aggressive disease-associated samples are compared to non-tumor/indolent disease
specimens in order to identify molecular characteristics that distinguish both cohorts and can be
further explored; ii) clinical assay development and validation, whereby an assay that can
accurately measure the biomarker and reliably segregate tumor from non-tumor specimens is
developed; iii) retrospective longitudinal studies that utilize specimens from individuals who
were monitored over time for the development or progression of disease (e.g., patients who
progress from indolent to aggressive prostate cancer) are compared to individuals who do not
develop disease or do not progress; iv) prospective screening studies that are performed using the
assay in order to evaluate the extent of disease at the time of detection; and v) randomized
control studies that are performed in order to determine the reduction of disease burden in the
population, as a result of performing the assay.
12
1.3.2 Clinical and analytical considerations for cancer biomarker assay
development
Once a putative cancer biomarker has been determined, a clinically useful assay must be
developed. Clinical validation involves determining the sensitivity and specificity, which
measure the diagnostic accuracy of a test (113) (Table 1.1). Clinical sensitivity refers to the
ability of a test to correctly classify individuals affected by the disease. Specificity is the ability
of the test to correctly identify individuals without the disease. These parameters depend on the
decision threshold used for a test; that is, a predetermined threshold that, when exceeded,
indicates disease or state (113). Receiver operating characteristic (ROC) analysis can be used to
evaluate the performance of a clinical assay and is used for determining the threshold (Figure
1.4). Various methods of calculating the optimal decision threshold exist. The most common
method is to find the point on the curve closest x = 0, y = 1 coordinates (114, 115). The area
under the curve (AUC) of the ROC curve is a measure of the sensitivity and specificity of a test
over all threshold values. An ideal assay would have an AUC of 1.0 (100% sensitivity and
specificity), while an assay with no discriminatory power would have an AUC of 0.5.
13
Figure 1.4 Receiver operating characteristic curve. A biomarker with good discriminatory
power has an AUC closer to 1.0 (e.g., marker A). A biomarker with no discriminatory power has
an AUC of 0.5 (e.g., marker B).
The sensitivity and specificity of a biomarker tell us about the accuracy of the test, but do not
provide information about the probability of disease. The clinical prevalence of the disease in
question also plays an important role in the performance of an assay. If the prevalence of the
disease is high in the population, it is more likely that an individual who tests positive for a
disease will truly have the disease than if the prevalence of the disease is low. The parameters
that need to be evaluated are the PPV and the NPV. The PPV indicates the likelihood that, given
a positive test result, the patient actually has the disease; the NPV indicates the likelihood that,
given a negative test result, the patient is free of the disease in question. Although an assay may
have high accuracy, low prevalence of the disease may limit its adoption into clinical use (113).
For instance, the ovarian cancer biomarker, cancer antigen 125 (CA-125), has a PPV of 4%;
hence, only 4 out of 100 positive test will indicate actual disease (113, 116).
14
Table 1.1 Descriptors of disease biomarkers.
Assays targeting a single or a multitude of biomarker candidates must also meet rigorous
analytical standards for reliable quantification and reproducibility. Parameters that should be
accounted for include precision, limit of detection (LOD), limit of quantification (LOQ),
linearity and working range, and specificity (113). These so called Figures of Merit (117) may be
affected by differences in sample matrix and external factors that affect the assay’s sensitivity
(117). Therefore, a calibration curve, represented by the response versus the concentration of a
given analyte (117), should be generated for each analyte for which an assay is being developed.
Calibration is defined as the procedure for determining the relationship between the assay’s
generated signal for a given analyte and the analyte’s concentration (113). Preferably in the
sample matrix that the assay will be used in, a series of concentrations of the analyte (spanning at
least two orders of concentration magnitude) is introduced into the matrix. The data is plotted as
the measured concentration versus the spiked-in concentration. The slope of the curve represents
the analytical sensitivity of the assay for the analyte. A slope of 1 and an intercept of 0 is ideal,
signifying a perfect correlation between the theoretical and measured concentrations of the
analyte.
The calibration curve is also used to define the linear range of detection, the LOD and LOQ, and
the reproducibility of the assay. The lowest concentration at which an analyte can be reliably
distinguished from noise is represented by the LOD (113). The lower LOQ is the lowest analyte
concentration at which a quantitative measurement can be reliably made, while the upper LOQ is
described as the highest concentration of the analyte that falls within the linear range of the assay
(117). The calibration curve can also provide information about possible interference or
endogenous signals from the sample matrix, determined by measurements of the blank sample.
15
Precision, usually expressed as standard deviation (SD), variance, or the coefficient of variation
(CV), is determined by the closeness of agreement between a series of measurements for an
analyte in a sample under specific conditions. Within-run precision (repeatability) refers to
repeated measurements performed under the same conditions. Within-laboratory precision
(reproducibility) is determined by measurements made across laboratories, under different
conditions such as analysis by different technicians (113). Low analyte concentrations may
hamper precision as the detection limit of the assay is reached. Biological variation – i.e., the
fluctuation of an analyte concentration around a homeostatic set point – can greatly affect
precision and thus should be taken into consideration.
1.3.3 Biological sources of biomarkers
When faced with the task of identifying disease biomarkers, “gold standard” samples would be
those in which cases and controls differ absolutely and exclusively for the disease of concern
(118). This would require that cases and controls be identical in all respects except for the
condition for which the biomarker is being identified. Model systems, including cell lines and
genetically homogeneous animals that reflect human conditions may be useful in this respect
(118). These systems allow for ample longitudinal sample collection, have uniform genetic
background, and are typically subjected to highly controlled environments that greatly abrogate
the vast variation seen amongst human subjects. However, models can only provide insight into
potential biomarkers candidates, and should not be wholly relied upon for subsequent analyses.
Table 1.2 Common sources of prostate cancer biomarkers.
16
Human tissues represent an essential class of biomarker sources (Table 1.2). Indeed, tissue can
provide histopathological features, molecular factors, and genetic aberrations or patterns that
may be indicative of disease, patient outcome, and facilitate treatment design. Allowing for
direct evaluation of protein or gene expression, tissue samples can provide important insights
into the biological mechanisms of a given condition (119). However, tissues are obtained
through invasive means that are associated with risks for the patients and limit repeat
measurements. Blood (i.e., serum or plasma) is an attractive alternative to tissue specimens that
is already in routine clinical use. Factors such as circulating tumor cells and DNA, microRNA,
metabolites, and soluble proteins can be non-invasively and repeatedly collected with ease.
Unfortunately, the immense complexity, wide dynamic range, and dilution of specific disease
markers make it extremely difficult to discover protein biomarkers in this fluid. To facilitate this
goal, extensive fractionation strategies and affinity approaches that remove some of the most
abundant blood constituents can be utilized, but introduces additional sample preparation steps.
1.3.3.1 Tissue proximal fluids
Tissue proximal fluids are a class of biofluids that are in direct contact with the disease site. It is
rationalized that local bathing of the site results in enrichment of protein biomarkers compared to
distal circulating fluids (120, 121). In the context of prostate cancer, prostate proximal fluids
include urine, seminal fluid, semen, and expressed prostatic secretions (EPS). Expressed
prostatic secretions can be collected directly from the prostate just prior to RP. Although this
fluid is presumably abundant in prostatic proteins, it is not possible to collect more than once.
Urine has been shown to be a source of microRNAs (122), RNAs (123), soluble proteins (124),
and exosomes (125), and can be considered as a prostate proximal fluid since the prostatic
urethra carries it through the prostate. In the context of a DRE, urine containing EPS (EPS-urine)
can be routinely collected in voided urine. Indeed, EPS-urine has previously been interrogated
for prostate cancer-related proteins and prostasomes (126, 127). A major advantage of urine over
serum or plasma, with regards to protein biomarker detection, is that its contents remain
relatively stable and do not undergo massive proteolytic degradation (128). Nevertheless,
variations in collection volume may result in varying protein concentrations, underscoring the
need for standardized collection protocols.
17
1.3.3.2 Biomarkers in biological fluids
Fluctuations in the levels of molecules in biological fluids, such as blood and tissue proximal
fluids, can serve as cancer biomarkers. As cancer progresses, tumor angiogenesis and tissue
invasion causes the destruction of the tissue architecture. This results in the release of factors into
the interstitial fluid and delivery into the circulation by way of the lymphatics. For instance,
PSA, which is normally abundantly expressed by the secretory epithelial cells of the prostate
diffuses through the basement membrane, stromal layer, and the walls of blood and lymphatic
capillaries into the circulation, and can be seen in the serum of healthy men in the range of 0.5-2
ng/ml (129) (Figure 1.5). In prostatic carcinoma, the destruction of tissue architecture through
the loss of basal cells and degradation of the basement membrane, angiogenesis, and disruption
of cell attachment, causes PSA to leak out of the glandular lumen and into circulation, reaching
serum PSA levels beyond 4 ng/ml (129).
Figure 1.5 Prostate specific antigen access to circulation. Adapted from Kulasingam and
Diamandis Nature Clinical Practice. 2008 © Macmillan Publishers Limited. Normally, small
amounts of PSA enter the circulation by way of the lymphatics. Under diseased states, the
architectural integrity of the prostate is compromised. This leads to PSA leakage out of the gland
and subsequent elevation in circulating PSA levels.
18
Changes in protein abundances may also be explained by aberrant secretion or shedding of
secreted or membrane-bound proteins. Indeed, mutations in the signal peptide (SP) domains of
proteins causes impairment of the SP function and cellular localization of human proteins (130).
Furthermore, a loss of epithelial cell polarity in cancer may result in irregular trafficking of
plasma membrane proteins to apical and basolateral domains causing misdirectional release of
proteins into the extracellular domain (131). Quantitative changes in molecules may also be
explained by fluctuations in gene expression. Alterations in gene or chromosome copy numbers
and changes in transcriptional activity have been described for prostate cancer and are
highlighted in the following sections.
1.3.4 Adjunct prostate cancer biomarkers
In broad terms, current and proposed alternative or adjunct markers for prostate cancer can be
divided into clinical-pathological features and molecular factors. These include the classic
Gleason grading and more recently discovered molecular features that might offer insight into
disease progression and prognosis. Various methodologies can be utilized to identify these
factors (Figure 1.6), and some of them are highlighted in the following sections.
19
Figure 1.6 Various strategies for novel molecular biomarker identification. Adapted from
from Kulasingam and Diamandis, Nature Clinical Practice. 2008 © Macmillan Publishers
Limited. Advancing technologies have paved the way for disease biomarker discovery. The
integration of multiple discovery modalities holds promise for novel molecular biomarkers.
1.3.4.1 Cellular markers
Ki-67 is a nuclear protein that is associated with cellular proliferation (132). Its
immunohistochemical staining index has been correlated with outcome in treated patients (133-
136). Heterogeneous immunohistochemical staining for α-methylacyl-coenzyme A racemase
(AMACR) has been correlated with Gleason score (137), and low AMACR gene expression in
localized prostate cancer has been linked to recurrence and metastasis (138). Prostate specific
membrane antigen (PSMA) is a transmembrane protein expressed in all types of prostatic tissue
that is used in the diagnosis of prostate cancer (139). Its over-expression is associated with
higher tumor grade, stage, PSA recurrence, and metastatic disease (140, 141).
20
1.3.4.2 Genetic aberrations as prognostic biomarkers
Focusing on a specific pathway or a group of interrelated genes involved in fundamental tumor
biology has also proven useful. Cuzick et al. (142) focused on genes involved in cell cycle
progression and measured mRNA expression of 126 genes in formalin-fixed paraffin-embedded
prostate cancer tissues. A 31-gene signature was generated on the basis of their correlation with
the mean expression of the entire panel of 126 genes. When used to retrospectively score patients
who underwent prostatectomy and patients with localized disease, this signature was shown to
predict recurrence after surgery and risk of death in conservatively managed patients,
independently of Gleason score and other clinical factors. Using comparative transcriptomic
analyses, Ding et al. (143) identified the robust activation of the Tgfβ/Bmp-Smad4 signaling
pathway in indolent Pten-null mouse prostate tumors. Deletion of Smad4 in the Pten-null mouse
prostate led to highly proliferative, invasive, metastatic and lethal tumors. Combined with key
molecular players, cyclin D1 and osteopontin, their four-gene signature (PTEN, SMAD4, cyclin
D1 and osteopontin) could predict BCR and supplement the Gleason score in predicting lethal
metastasis of prostate cancer in patients.
Genomic instability, caused by variations such as CNAs, may also be useful prognosticators of
prostate cancer. Recently, Lalonde et al., identified a 100-loci DNA signature that could predict
5-year biochemical relapse in patients treated with radiotherapy or RP; its effect augmented by
tumor hypoxia (144). In a comprehensive genomic analysis of prostate cancer, Taylor and
colleagues analyzed CNAs in primary prostate tumors and found distinct patient clusters with
varying degrees of relapse, with no association with Gleason score (76). Penney and colleagues
(145) constructed a 157-gene signature based on the comparison of Gleason ≤6 and Gleason ≥8
patients. When applied to patients with Gleason 7 scores, their signature improved the prediction
of lethality, compared to Gleason score alone. DNA methylation patterns in prostate cancer may
also provide insight into prostate cancer outcome. Cottrell et al. (146) performed a genome-wide
scan in patients with early recurrence, high Gleason score or advanced stage and identified 25
methylation markers that were significantly different between low and high Gleason score
patients. Furthermore, the methylations of 3 markers (GPR7, ABHD9, Chr3-EST) were found to
be significantly increased in patients with recurrence, as measured by elevated post-
prostatectomy PSA levels.
21
1.3.4.3 MicroRNAs
MicroRNAs (miRNAs) are a class of small, non-coding RNA molecules, involved in the
negative regulation of gene expression. Porkka and colleagues (147) demonstrated distinct
miRNA expression profiles of BPH, untreated prostate cancers, and hormone-refractory prostate
cancers, suggesting a potential prognostic role for miRNAs. Mitchell et al. (148) demonstrated
that tumor-derived miRNAs are present in plasma and could show that miR-141 was
significantly elevated in the sera of prostate cancer patients, demonstrating the utility of miRNAs
as blood-based cancer biomarkers. Khan et al. (149) analyzed localized prostate tumors and
adjacent normal tissues, as well as samples from advanced cases using isobaric tags for relative
and absolute quantification (iTRAQ) followed by mass spectrometry (MS). Integrating their
findings with a cancer microarray database, the authors identified differentially expressed
proteins that are targets of miR-128, which was further supported by in vitro experiments
demonstrating a role for miR-128 in prostate cancer invasion.
1.3.4.4 Circulating biomarkers
Urokinase plasminogen activator (uPA) and its inhibitor, PAI-1, have been associated with
aggressive prostate cancer exhibiting extraprostatic extension and seminal vesicle invasion, and
post-prostatectomy recurrence in patients with aggressive disease (150). Preoperative plasma
levels of transforming growth factor beta 1 (TGF-β1) has been shown to be a predictor of BCR
(151) and, in conjunction with preoperative plasma levels of interleukin 6 receptor (IL-6sR), has
been associated with metastasis and progression (152).
Disseminated tumor cells in the bone marrow, a common site of prostate cancer metastasis, have
been shown to have an association with metastatic disease and high Gleason score (153, 154).
Although disseminated tumor cells may be a prognostic marker of unfavorable outcome in
patients with localized disease at diagnosis, attention has shifted to tumor cells that have entered
the peripheral blood as these are more easily accessible. The number of circulating tumor cells
can be determined at the time of diagnosis and elevated numbers, as indicated by reverse
transcriptase polymerase chain reaction for PSA, have been associated with advanced stage and
increased Gleason score (155). Goodman et al. (156) determined that prior to treatment, a cut-off
22
value of 4 circulating tumor cells per 7.5 ml of blood or more was negatively correlated with
survival and could predict metastasis.
1.3.4.5 Urinary biomarkers
Prostate cancer antigen 3 (PCA3) is a prostate specific non-coding RNA that was first identified
in a comparative transcriptomics study in tumor and adjacent normal tissues (157). Subsequently,
a RT-PCR based test was developed for its detection in urinary EPS (158). A ratio of the
PCA3:PSA RNA, known as the PCA3 score, is used in combination with other clinical
information to guide decisions for repeat biopsy in men who are 50 years of age or older and
who previously have had at least one negative prostate biopsies. Interestingly, Nakanishi et al.
(159) reported the mean PCA3 score to be significantly lower in patients with low volume and
low-grade prostate tumors, compared to patients with advanced tumors. However, the ability of
the PCA3 test to predict aggressive prostate cancers is under debate (159-161). Tomlins et al.
(77) first reported the occurrence of a recurrent TMPRSS2:ERG (transmembrane protease serine
2 gene fusion with E twenty-six (ETS) transcription factors) fusion transcript. These fusions
were detectable in 42% of urinary EPS samples from men with prostate cancer (162), although
their presence in urinary sediment was not correlated with biopsy Gleason scores (163).
Telomerase is a ribonucleoprotein involved in telomere synthesis and repair (164). Its activity,
which can be measured in urinary EPS using the telomeric repeat amplification protocol assay
(165, 166), was found to be increased in prostate cancer and has been shown to be associated
with Gleason score (165). Urinary annexin A3 and various matrix metalloproteinases have also
been shown to have diagnostic/prognostic potential for prostate cancer (167-170).
Approximately 3% of the total urinary protein content is comprised of exosomal proteins (171),
thus representing a sub-fraction for discovery of prostate cancer biomarkers (172, 173).
Exosomes are small vesicles (40-100 nm) that are secreted by various normal and tumor cells
(173), containing protein, RNA and lipids (174). Wang et al. (175) used shotgun proteomics to
generate the largest catalogue of urinary exosome proteins to date. In their study, over 3000
unique proteins were identified from samples derived from nine healthy individuals. Exosome
secretion is elevated in biofluids of cancer patients, including prostate cancer (176) and
exosomes have been shown to be enriched in tumor cell-specific transcripts (177, 178). MiRNA
and mRNA are transferable between cells via exosomes and have been shown to be functional in
23
their new location (179). Nilsson et al. (173), in a proof-of-concept study, showed that urinary
exosomes derived from prostate cancer patients contain two known biomarkers (PCA3 and
TMPRSS2:ERG) and thus may be used as disease biomarkers.
1.4 Clinical proteomics for biomarker discovery
Proteins are the effector molecules of the cell and are likely to be the most affected in disease
(118). Proteomes are dynamic and dictated by cellular mechanisms that regulate gene expression,
post-transcriptional events, and post-translational modifications. Furthermore, significant
changes occur in proteomes in response to physiological and environmental stimuli. Hence,
cataloguing the proteome – the entirety of proteins expressed by a particular cell, tissue, or
organism under specific conditions (180) – may provide important indicators of disease.
Clinical proteomics is defined as the analysis of the identity, quality and the quantity of proteins
in patient samples (181). Mass spectrometry-based proteomics is the most widely applied tool in
clinical proteomics and holds promise to identify new disease biomarkers, drug targets, and
aberrant molecular pathways in disease. A typical cancer biomarker discovery workflow consists
of a discovery phase, whereby a comprehensive comparative catalogue of proteins under
different disease states is generated. This is followed by verification of putative biomarkers using
targeted methods of quantification, and finally, validation and clinical assay development (118).
1.4.1 Discovery proteomics using shotgun, bottom-up proteomics
Proteomic analysis of intact proteins (known as top-down proteomics) largely remains limited by
sensitivity and throughput (182). Therefore, bottom-up proteomics – based on the analysis of
proteolysed peptides emerging from proteins (183) – has been widely adopted in proteomic
investigations. Shotgun proteomics refers to the application of bottom-up proteomics to protein
mixtures (184-186).
A mass spectrometer consists of an ion source that ionizes analytes, a mass analyzer that
measures the mass-to-charge ratio (m/z) of the ionized analytes, and a detector that measures the
number of ions of each m/z value (187). Most commonly, proteins and peptides are volatized and
ionized using the soft ionization approaches: electrospray ionization (ESI) (188) or Matrix-
Assisted Laser Desorption Ionization (MALDI) (189). Because ESI ionizes analytes out of
24
solution, it is amenable to liquid-based separation such as liquid chromatography (LC), and is the
preferred method for the analysis of complex samples (187).
As ionized peptides enter the mass spectrometer, they are guided and manipulated by electric or
magnetic fields in a vacuum system. The basic types of mass analyzers used for proteomics are:
time-of-flight (TOF), quadrupole, ion trap, Fourier transform ion cyclotron resonance (FT-ICR),
and more recently, Orbitrap. These can also exist as hybrids, such as triple quadrupoles, TOF-
TOF, and quadrupole TOF. Time-of-flight instruments determine the m/z of an ion based on the
amount of time it takes to travel through a flight tube and hit a detector (190-194); lighter ions
fly faster than heavier ions, and thus reach the detector sooner. Quadrupoles are essentially mass
filters, allowing the selective passage of ions of a defined m/z by applying a particular potential
that stabilizes their trajectory down the quadrupole, while filtering out ions of other m/z values
(195, 196). A mass spectrum is generated by the step-wise application of different potentials and
detecting the ions that pass through at each m/z value over the m/z range (196). Ion traps capture
and accumulate incoming ions in a dynamic electric field; subsequently ejecting them to the
detector according to their m/z. Orbitrap mass analyzers trap ions around a central spindle
electrode (197-199). The oscillation frequency of trapped ions is mass dependent and can be
converted from the recorded time-domain signal into a m/z spectrum using the Fourier
transformation algorithm (200). Similarly, FT-ICR mass analyzers resolve the masses of ions by
recording the rotation frequency of the ions in a magnetic field (201). Detectors (e.g. electron
multiplier) detect ions that pass through the mass analyzer and generate an electric current that is
proportional to their abundance.
The workflow of a typical shotgun experiment begins with the enzymatic digestion (usually with
the protease trypsin) of a mixture of proteins in a gel or in solution into peptides, followed by LC
separation. The generation of high-quality spectra heavily depends on the sufficient fractionation
of immensely complex samples (for instance, a typical serum digest contains 400,000 – 600,000
peptides (202)). First reported by Washburn and colleagues (184, 203), multidimensional protein
identification technology (MudPIT) is a technique of fractionation, yielding comprehensive
proteome coverage (204) (Figure 1.7). Here, peptides are loaded on a biphasic LC column
consisting of a strong cationic exchange (SCX) and a C18 reverse phase (RP). The protein
mixture is separated by means of SCX using iterative steps of increasing salt concentration (“salt
25
bumps”) followed by RP chromatography. Peptides are thus separated in two orthogonal
dimensions prior to MS analysis.
Figure 1.7 Schematic of multidimensional protein identification technology (MudPIT). In
this setup, a peptide mixture sample (S) is automatically loaded onto a biphasic pre-column
consisting of a strong cation exchange (SCX) and a reverse phase (RP) resin, coupled to an
analytical RP column. The mobile phase is water with increasing concentrations of an organic
solvent (e.g., acetonitrile). The eluting peptides are subsequently introduced to the mass
spectrometer by way of ESI. A series of increasing concentrations of ammonium acetate steps
(pink to red circles) further separate increasingly basic (charged) peptides.
In shotgun proteomics, intact peptide ions (known as precursor/parent ions) travel through the
mass spectrometer, where their m/z values are recorded to generate a mass spectrum referred to
as a MS or survey scan (196, 206). Following each survey scan, precursor ions are isolated based
on abundance in a data-dependent manner and subsequently fragmented. This is known as a
MS/MS or product ion scan and generates tandem mass spectra. The MS spectrum provides
information about the exact mass and charge of a peptide and the MS/MS spectrum provides
information about the amino acid sequence of that peptide. This is compared to theoretical, in
26
silico tandem mass spectra in a protein database and peptides are assigned to protein sequences
in a process known as protein inference.
The process of data dependent acquisition collects MS/MS data for only 3-10 of the most
abundant precusor ions that exceed a predetermined intensity threshold (207). In highly complex
samples, the number of co-eluting ions can significantly exceed the number of MS/MS spectra
the mass spectrometer can record (202). Therefore, high abundance peptides are more likely to
be selected for fragmentation, while low abundance species will be sampled by chance. This
“random sampling” of peptides results in the masking of low abundance peptides by co-eluting
peptides of higher abundance, and is affected by sample complexity, chromatographic resolution,
ionization efficiency, ion suppression, and MS cycling time (202).
The dynamic concentration range of proteins, especially in clinical samples, remains a significant
challenge in proteome analyses by current LC-MS systems. For instance, the plasma proteome
concentration range spans up to 12 orders of magnitude (Figure 1.8). The issue is further
amplified by the presence of highly abundant proteins such as serum albumin (35-30 mg/ml) and
immunoglobulins (5-18 mg/ml), which mask lower abundance proteins below one microgram
per milliliter (208, 209). To circumvent this, fractionation of the proteome can be performed in
order to reduce sample complexity and increase sensitivity of detection. However, this approach
is significantly more time consuming, labour intensive, and introduces another level of
variability due to the introduction of more steps in sample processing (208). Alternatively, high
abundance proteins, such as albumin in plasma, can be selectively removed by immunodepletion,
thereby enriching for less abundant proteins (210). Yet another approach exploits the principles
of Michaelis-Menten kinetics of protein digestion in order to remove high-abundance proteins
(211). In this method, a limited amount of protease, trypsin, is introduced to a lysate, whereby
high-abundance proteins are digested before lower-abundance proteins, following Michaelis-
Menten kinetics. The resulting peptides are then removed via a molecular weight cut-off spin
filter and the remaining proteome is digested under standard digestion conditions and analyzed
by LC-MS/MS. This method has been shown to improve the identification and sequence
coverage of low-abundance proteins by 3-fold. Other avenues of tackling the wide dynamic
range and the immense heterogeneity of complex proteomes is to enrich for sub-proteomes (for
example, N-glycoproteins account for the majority of the current clinically used biomarkers and
27
drug targets, as they are often secreted by cells or found at the cell surface (212)) and tissue-
proximal fluids that may be more abundant in relevant biomarker candidates.
Figure 1.8 Dynamic concentration range of human plasma proteins. Adapted from Schiess
et al., Molecular Oncology. 2008 © Federation of European Biochemical Societies. The
dotted line represents the concentration range of various plasma proteins. Classic plasma
proteins, such as albumin, are in excess of 5 orders of magnitude higher in abundance than tissue
leakage products and other potentially disease-related proteins.
1.4.2 Protein quantification using mass spectrometry
Mass spectrometry-based quantification strategies can be categorized as label-free or labeled
(Table 1.3). Label-free quantification includes spectral counting and extracted ion
chromatograms. Spectral counting (202, 203, 213) is an estimate of the protein abundance based
on the frequency of MS/MS spectra (spectral counts) generated from a given protein divided by
the number of theoretically observable peptides for that protein (known as the protein abundance
index) (214). It is based on the assumption that higher abundance proteins produce more tryptic
peptides, resulting in higher spectral counts. Indeed, a linear relationship between spectral counts
and abundance has been demonstrated over at least two orders of magnitude (202, 215);
however, this method biases against low abundance species.
28
The intensity-based approach uses the measured peptide chromatographic peak areas of peptide
precursor ions that belong to a given protein (202, 213, 216, 217). The key to this approach is the
alignment of chromatographic peaks in order to achieve accuracy in comparative quantitative
data. High quality peaks have aligned retention times, matching precursor ions, charge state, and
fragment ions (218). The AUC of the aligned peptides from each sample are derived and
compared for estimating relative protein abundances. The AUC has been shown to directly
correlate with the concentration of peptides in the range of 10 fmol – 100 pmol (216, 219). One
obvious factor that must be carefully taken into consideration when using this approach is the
presence of co-eluting peptides and shifts in retention time. It has been demonstrated that both
spectral counting and extracted ion chromatograms correlate with protein abundances in complex
samples (217, 220-223).
Isotope labels can be resolved either at the MS or MS/MS level, depending on the type of
labeling method adopted. These include stable isotope labeling with amino acids in cell culture
(SILAC) (224), using mass tags such as isotope-coded affinity tags (ICAT) (225), isobaric tags
for relative and absolute quantification (iTRAQ) (226) and tandem mass tags (TMT) (227).
Although labeling methods allow for more accurate quantification, additional preparation steps
and costs should be considered.
Absolute quantification of proteins by MS is based on stable isotope dilution (SID) (228), which
relies on the introduction of predetermined amounts of stable heavy isotope labeled standards
into the sample. Isotope dilution has been used in the quantification of small molecules for
decades (229), and has been adopted for the absolute quantification of peptides in biological
material in recent years. Peptide labeling methods include chemical synthesis (AQUA) (230),
biologically synthesized artificial quantification concatemer (QconCAT) peptides (231), and
intact isotope labeled protein standard absolute quantification (232-234).
Absolute quantification (AQUA) peptides are highly purified, chemically synthesized peptides
with 13C and 15N isotope labels at lysine and arginine residues. These standards have identical
sequences and thus the same physicochemical properties and chromatographic elution times to
endogenous peptides, but they have a known mass difference that is resolvable by MS.
Quantification using this approach is very straightforward – known amounts of AQUA internal
standards are spiked into the sample prior to the LC-MS step and the resulting recorded
29
abundance of the standard is compared to the endogenous peptide. Although a highly attractive
method, some caveats exist: due to limitations of chemical synthesis, certain peptides such as
those above 15 amino acids in length and those containing chemically reactive residues (e.g.,
oxidation of tryptophan, methionine, cysteine, etc.) or sequences (such as aspartate-glycine, N-
terminal glutamine, etc.) may not be compatible. AQUA peptides are very expensive, due to the
labour intensive process of production, purification, and quantification. Furthermore, since
AQUA peptides are usually fully tryptic peptides, complete tryptic digestion of the target
peptides must be ensured in order to prevent quantification of only the digested fraction (235).
Table 1.3 Summary of common labeled and label-free protein quantification techniques
using MS.
1.4.3 Targeted proteomics
Discovery proteomics does not require a priori knowledge about the contents of the sample being
analyzed, but the stochastic nature of shotgun proteomics limits the reproducible detection and
quantification of proteins between different experiments. Targeted proteomics approaches
selectively analyze only a subset of predefined peptides within a proteome, and hence, offer
30
much higher sensitivity, selectivity, and accuracy as compared to quantification by discovery
methods.
1.4.3.1 Selected reaction monitoring mass spectrometry
Selected reaction monitoring mass spectrometry (SRM-MS) is the most widely used targeted MS
technique and has been used for reliably quantifying small molecules (236). In more recent
years, it has been adopted as a robust, sensitive, reproducible and specific assay for protein
quantification (237-239) and validation of cancer biomarkers from various sources (240-244).
Selected reaction monitoring mass spectrometry allows for the detection and quantification of
predefined sets of peptide targets by employing the use of triple quadrupole (QqQ) mass
spectrometers that allow for two-stage mass filtering. In SRM-MS mode, a precursor ion of a
specific m/z is selected in a front-end mass analyzer (Q1) (Figure 1.9). The ion is then
fragmented by collision-induced dissociation (CID) in a collision cell (Q2), and its resulting co-
eluting fragment ions (transitions) (206) are selectively measured as a function of time in a
second mass analyzer (Q3).
Figure 1.9 Selected reaction monitoring mass spectrometry. By using two stages of mass
filtering in a triple quadrupole mass spectrometer, a specific peptide can be isolated, fragmented,
and subsequently analyzed. This method offers a high level of sensitivity and specificity and can
be coupled to stable isotope dilution for accurate quantification of proteins in complex mixtures.
In order to develop a SRM-MS assay, a priori selection of proteotypic peptides (peptides that are
unique to the proteins of interest and are reliably detected by MS) and the determination of their
MS coordinates (peptide elution time, most intense fragments, and relative intensities of
fragments) is required. Only a small fraction of the millions of peptides resulting from the
enzymatic digestion of a cellular proteome are proteotypic (245), and thus, the ability to target
31
these greatly facilitate peptide detection and alleviate redundant peptide sampling and analysis
time (206). Computational tools to guide the selection of proteotypic peptides have been
developed based on physicochemical properties of frequently observed peptides from large-scale
shotgun experiments (245). Tools such as SRM-MS Collider (246) can identify redundant
transitions of a peptide within a proteome background of choice that can help determine
potentially interfering transitions. Spectral libraries from previous experiments (either available
through online repositories such as the Global Proteome Machine, SRMAtlas, or through own
experimental data) can also be used to help select frequently observed proteotypic peptides.
Alternatively, recombinant proteins of candidates can be digested and analyzed by MS to
generate a spectral library. The acquired spectra can provide information for the peptide SRM-
MS assay: m/z of the precursor and its fragments, the preferred ionization charge state, the
relative intensity of the fragment ions signals, and the chromatographic retention time (206).
To achieve sufficient sensitivity and precise quantification, the SRM-MS cycle time and dwell
time should also be considered. The dwell time refers to the amount of time the instrument
spends monitoring a transition. A higher dwell time, that is, the longer the instrument spends
measuring a given transition, the higher the sensitivity of the assay for that transition. The cycle
time is the sum of all the dwell times of each transition to be monitored in the assay. A longer
cycle time results in fewer data points being measured over the chromatographic elution time of
each peptide, which can affect both the detection and quantification of peptides. Under
conditions where too many transitions are monitored in a single run, the dwell time may be
reduced to sufficiently compromise sensitivity and quantitative accuracy. In such cases a
scheduled approach, whereby the instrument scans for transitions only during a small window of
time centered around a peptide’s expected elution time, can significantly augment the number of
monitored transitions in a single experiment. For less abundant peptides, the dwell time can also
be increased during the elution window in order to increase sensitivity.
1.4.3.2 Application of selected reaction monitoring mass spectrometry to
biomarker research
Validation of proposed biomarker candidates in a major bottleneck in protein biomarker
development. The gold standard for clinical protein biomarker quantification is enzyme linked
immunosorbent assay (ELISA), which relies on the use of specific antibodies. Although ELISAs
32
have excellent specificity and sensitivity, the limited number of antibodies and the cost of
developing a clinically validated ELISA assay (which runs up to $1 million (247, 248)) severely
limit their use for biomarker validation. Protein quantification by SRM-MS has been shown to
have good correlation with ELISAs (249, 250), although its sensitivity has been shown to range
from 34-60% lower depending on the analyte (251). Considering the vast numbers of proposed
biomarkers, the capability of SRM-MS to simultaneously quantify a multitude of analytes in a
matter of hours compared to the usually single analyte analysis of ELISA, significantly facilitates
the timely appraisal of putative candidates. Selected reaction monitoring mass spectrometry
assays can be used to rapidly evaluate putative biomarker candidates, of which the most
promising candidates would be carried forward for ELISA assay development. Therefore, SRM-
MS can serve as an attractive complementary technique to ELISA in the biomarker validation
pipeline.
Selected reaction monitoring mass spectrometry application to the quantification of proteins in
complex biological systems has made considerable headway in recent years. Picotti et. al (252)
demonstrated the application of SRM-MS to target proteins in a complex biological system (total
S. cerevisiae extract) and its ability to cover a dynamic range of over 4-5 orders of magnitude. In
their work, low-abundance proteins that were previously unidentifiable by MS were sucessfully
detected and quantified. Hüttenhain et al. (253) developed a high-throughput workflow for the
quantification of cancer-associated proteins in human urine and plasma. Their study, which
utilized SRM-MS, tracked 408 urinary proteins detectable by SRM-MS. Interestingly, 169 of
these were previously undetected in the datasets from Human Protein Atlas and a urinary
proteome dataset from Adachi et al. (124). Furthermore, using SRM-MS assays in ovarian cancer
and benign ovarian tumor patient plasma, the authors demonstrate the reproducible differential
expression of a number of candidates. In another study, Cima and colleagues (240) focused their
analyses on the N-glycoproteome of Pten-null mouse serum and prostate. Label-free comparative
analysis of the Pten-null animals and age-matched wild-type mice revealed 111 and 12
candidates that were significantly differentially expressed in tissue and sera, respectively. Next,
the authors utilized SRM-MS assays to reliably quantify the 39 protein orthologs (selected based
on consistent quantification) in sera of prostate cancer patients and controls, and used the
resulting profiles to build predictive regression models for diagnosis and grading of prostate
cancer. In our own study (described in Chapter 3), I identified a number of putative prognostic
33
markers of prostate cancer and verified the differential expression of these in prostatic fluids
using SRM-MS (244).
1.5 Rationale and objectives
1.5.1 Rationale
There exists a wide discrepancy between the number of men with prostate cancer and those
dying from it. Although treatment efficacy has improved, the discrepancy is, at least in part,
attributable to the detection of cancers that are non-life threatening within the patient’s lifetime
(254). Considering the potentially serious hazards associated with prostate biopsies, and even
more so with treatment procedures, the current modalities of diagnosis and prognosis are failing
to accurately classify patients based on the significance of their disease and subjecting them to
unnecessary risks. Establishing non-invasive molecular features capable of distinguishing
insignificant prostate tumors from significant ones would greatly alleviate such burdens. In the
current thesis, I have utilized proteomics methodologies to investigate the molecular factors
present in prostatic fluids in order to delineate differential protein signatures of prostate cancer
subtypes.
1.5.2 Hypotheses and aims
(1) Discovery proteomic exploration of EPS and identification of prostate-enriched proteins
I hypothesized that the EPS proteome is a rich source of prostatic proteins. The following aims
were carried out to test the hypothesis:
a) Catalogue the direct-EPS and EPS-urine proteomes using discovery proteomics
techniques;
b) Integrate the proteomics data with other available resources to characterize the EPS
proteomes;
c) Comparatively analyze the proteomes of pre- and post-DRE urines for the identification
of prostate-enriched proteins;
34
d) Verify the expression of a number of enriched proteins.
(2) Comparative proteomic analyses of organ-confined and extracapsular EPS fluids
I hypothesized that comparative analyses of the proteomes of EPS from patients with EC and OC
prostate cancers would reveal putative prognostic biomarkers for prostate cancer. The following
aims were carried out to test the hypothesis:
a) Characterize the proteomes of EPS derived from prostate cancer patients with EC and OC
tumors;
b) Implement a data mining strategy to prioritize proteins based on their biomarker potential
utilizing pathway analysis, publicly available datasets, and literature review;
c) Verify the differential expression of candidates using immunoassays;
d) Evaluate the application of targeted proteomics using SRM-MS in EPS-urine;
e) Develop SRM-MS assays and quantify candidates in EPS-urine.
(3) Systematic large-scale development of targeted proteomics assays for biomarker verification
in prostate-proximal fluids
I hypothesized that protein signatures of prostate cancer could be generated and accurately
quantified in EPS-urine using a multiplexed proteomics assay. The following aims were carried
out to test the hypothesis:
a) Pinpoint putative protein biomarkers and their proteotypic peptides based on discovery
proteomics data from Aim 2;
b) Design SRM-MS assays for peptides using synthetic peptide standards in a urine background;
c) Apply SRM-MS assays to a cohort of EPS-urines from patients with EC, OC prostate cancer,
as well as noncancer individuals for relative quantification of candidates;
35
d) Develop a single SRM-MS assay capable of quantifying multiple biomarker candidates;
e) Apply SRM-MS assay to an independent cohort of EPS-urines for absolute quantification and
verification;
f) Utilize bioinformatics and statistics to identify a protein signature to distinguish different
patient groups.
36
Chapter 2
In-depth proteomic analyses of prostate proximal tissue fluids
This chapter has been reformatted from the following original publications:
Drake, R.R., Elschenbroich, S., Lopez-Perez, O., Kim, Y., Ignatchenko, V., Ignatchenko, A.,
Nyalwidhe, J.O., Basu, G., Wilkins, C.E., Gjurich, B., Lance, R.S., Semmes, O.J., Medin, J.A.,
and Kisilnger, T. In-depth proteomic analyses of direct expressed prostatic secretions. Journal of
Proteome Research 2010; 9:2109. © 2010 American Chemical Society
Principe, S.*, Kim, Y.*, Fontana, S.*, Ignatchenko, V., Nyalwidhe, J.O., Lance, R.S., Troyer,
D.A., Alessandro, R., Semmes, O.J., Kislinger, T., Drake, R.R., and Medin, J.A. Identification of
prostate-enriched proteins by in-depth proteomic analyses of expressed prostatic secretions in
urine. Journal of Proteome Research 2012; 11:2386. © 2012 American Chemical Society *These
authors contributed equally to this work.
2 In-depth proteomic analyses of prostate proximal
fluids
2.1 Abstract
It is expected that clinically obtainable fluids that are proximal to organs contain a repertoire of
secreted proteins and shed cells reflective of the physiological state of that tissue, and thus
represent potential sources for biomarker discovery, investigation of tissue-specific biology, and
assay development. The prostate gland secretes many proteins into a prostatic fluid that
combines with seminal vesicle fluids to promote sperm activation and function. Proximal fluids
of the prostate that can be collected clinically are seminal plasma and expressed prostatic
secretions (EPS). In this chapter, I present two separate proteomic investigations into direct-EPS
and EPS-urine. In the first study, multidimensional protein identification technology (MudPIT)-
based proteomics was applied to direct-EPS obtained from nine men with prostate cancer,
resulting in the confident identification of 916 unique proteins. Bioinformatics analyses using
publicly available microarray data of 21 human tissues (Human Gene Atlas), the Human Protein
37
Atlas database, and other published proteomics data of shed/secreted proteins were performed to
systematically characterize this comprehensive proteome.
In a second study, 1022 unique proteins in a heterogeneous cohort of 11 EPS-urines derived from
biopsy-negative noncancer diagnoses, benign prostatic hyperplasia (BPH), and low grade
prostate cancer were identified. Furthermore, MudPIT was used to generate and compare the
differential proteome from a subset of pooled urines and EPS-urines (pre- and post-digital rectal
examination urines, DRE, respectively) from noncancer and prostate cancer patients. The direct
proteomic comparison of these highly controlled patient sample pools enabled the generation of
a list of prostate-enriched proteins detectable in EPS-urine that is distinguishable from the
complex urine protein background. A combinatorial analysis of both proteomics datasets and the
systematic integration with publicly available proteomics data of related body fluids, human
tissue transcriptomic information, and immunohistochemistry images from the Human Protein
Atlas database allowed the demarcation a robust panel of 49 prostate-derived proteins in EPS-
urine. Finally, the expression of seven of these proteins was validated using Western blotting,
supporting the likelihood that they originate from the prostate.
2.2 Introduction
Proximal fluids are found adjacent to a given tissue or organ and contain a repertoire of secreted
proteins and shed cells reflective of the physiological state of that tissue (255). Hence, tissue-
proximal fluids are rapidly emerging as discovery sources of proteins, metabolites, and genetic
biomarkers for cancers. Seminal plasma and EPS are prostate-proximal fluids that can be
collected in the clinic. Seminal plasma consists of fluids originating not only from the prostate,
but also from several other male accessory glands such as the seminal vesicles, epididymis, and
Cowper’s gland. In aging men with prostatic disease, obtaining seminal fluids for diagnostic
assays and screening purposes is highly dependent on patient age and disease status, and thus not
an attractive assay medium.
The exocrine compartment of the prostate is composed of differentiated epithelial cells, that
actively secrete proteins such as prostate specific antigen (PSA), prostatic acid phosphatase
(PAP), and prostaglandins into the glandular lumen (256), and hence, these proteins can be
accessed by the expulsion of prostatic fluids. Expressed prostatic secretions can be readily
38
obtained in the clinic by forced (or expressed) ejection of prostatic fluids into the urethra
following prostate massage. For evaluation of some types of prostatitis, a vigorous massage is
done to force fluid through the urethra to collect a few drops from the tip of the penis for
subsequent use in microorganism cultures. If a more gentle massage is performed in the context
of a DRE of the prostate, the EPS is then immediately collected in voided urine post-exam (EPS-
urine). Currently, a commercial genetic assay for prostate cancer detection, based on the
presence of a non-coding RNA (PCA3), uses post-DRE EPS-urines as a source of prostate-
derived genetic material (257-259).
In the context of prostate cancer, undiluted EPS (direct-EPS) can be collected while the patient is
under anesthesia, just prior to prostatectomy. In comparison to the clinical DRE massages, a
more vigorous digital rectal massage can be done to force prostatic fluids into the urethra with
subsequent collection from the penis. For prostate cancer biomarker discovery purposes, direct-
EPS has several ideal clinical features beyond its nature as a proximal fluid. Since direct-EPS is
collected just prior to prostatectomy, clinical information that led to the decision to perform a
prostatectomy is readily available (255). Post-prostatectomy pathology staging and grading can
also be obtained for comparison with the pre-surgical assessments. When linked with clinical
outcome data, these direct-EPS fluids thus have the potential to be used for discovery of both
diagnostic and prognostic biomarkers for management of prostate cancer.
A MudPIT-based proteomics approach was employed to obtain a detailed inventory of proteins
present in direct-EPS and EPS-urine samples. By using a high mass accuracy Orbitrap mass
spectrometer and rigorous identification criteria, a high quality account these fluids was obtained
to a degree not previously reported. To select for proteins likely secreted from the prostate in the
complex EPS-urine background, a highly controlled quantitative comparison of pooled samples
of urine collected prior to DRE and EPS-urine was performed. Importantly, these pre- and post-
samples were collected from the same patients, enabling direct quantitative intra-patient
comparisons to identify proteins that are upregulated in the post-DRE samples. The combination
of these data and their integration with publicly available proteomic (124, 260), microarray data
from the BioGPS portal (http://biogps.org/) (261), and immunohistochemistry images from the
Human Protein Atlas (HPA; http://www.proteinatlas.org/) (262, 263) allowed the generation of a
panel of 49 proteins likely secreted by the prostate and detectable in EPS-urine. Additionally, the
39
entire proteomics data was deposited to the Tranche server (www.proteomecommons.com) for
others to utilize.
2.3 Results
2.3.1 In-depth proteomic analyses of direct expressed prostatic
secretions
2.3.1.1 MudPIT profiling of direct expressed prostatic secretions from
prostate cancer patients
Nine individual direct-EPS samples from men with prostate cancer (Gleason 6-7, stages pT1c-
pT2b) were selected for MudPIT-based proteomic analysis (Table 2.1). To minimize variability
introduced by sample handling, direct-EPS samples were directly digested in-solution and
analyzed by a 9-step MudPIT on a LTQ-Orbitrap XL mass spectrometer.
Table 2.1 Clinical information of the direct-EPS fluids analyzed by MudPIT proteomics. a
Serum PSA values, ng/ml. b Gleason score determined by post-prostatectomy pathology. c
Determined by post-prostatectomy pathology. d Determined pre-prostatectomy by transurethral
ultrasound measurements.
Rigorous protein identification criteria resulted in the identification of 916 unique proteins with
high confidence (Figure 2.1A and B), with 230 to 550 unique proteins identified per individual
cancer patient sample (Figure 2.1B). The list of 916 proteins (Appendix Table 2.1) obtained from
40
the direct-EPS cancer samples were compared to those reported in a recent publication by Li and
colleagues (264) who collected several EPS fluids using a more vigorous prostate massage
procedure (Figure 2.1C). Only twenty proteins were unique to the study of Li et al. (264),
consisting primarily of blood-derived proteins. The vast majority of proteins of the 916 proteins
were uniquely identified in the current study with 816 unique protein clusters and 94 protein
clusters that were shared with Li et al. Based on the very low levels of hemoglobin and
seminogelin proteins detected in the analysis, it can be concluded that direct-EPS samples have
minimal contamination with blood and seminal vesicle derived proteins. In relation to known
prostate derived proteins, the next most abundant proteins accounting for an additional 25% total
abundance were lactoferrin, PSA, PAP, zinc-α2 glycoprotein and aminopeptidase N. Functional
annotation of the direct-EPS proteome using Gene Ontology, predicted transmembrane domains
(TMD; TMAP algorithm) and signaling peptides (SP; PrediSi algorithm) revealed a wide variety
of functional categories (Figure 2.2A-C). A significant number of detected proteins were
membrane proteins and members of the extracellular region, including 34 CD molecules
(Appendix Table 2.1). This was further supported by the fact that ~ 40% of the detected proteins
had at least one predicted TMD and contained a predicted SP, indicative of targeting to the
secretory pathway (Figure 2.2D and E). A smaller number of proteins could be assigned to other
intracellular organelles.
41
Figure 2.1 Proteomics analyses of direct expressed prostatic secretions. (A) EPS from nine
men with confirmed prostate cancer were in solution digested and individually analyzed by
online MudPIT on an LTQ-Orbitrap XL mass spectrometer. Rigorous peptide and protein
identification criteria resulted in the identification of 916 unique proteins. (B) Number of unique
proteins detected in each of the analyzed EPS samples. (C) Comparison of our EPS proteome to
a previously published EPS dataset.
42
Figure 2.2 Functional enrichment of proteins detected in EPS. (A-C) Shown are selected
Gene Ontology (GO-Slim) categories of the EPS proteome. (D) Distribution of predicted
transmembrane domain containing proteins (TMAP algorithm). (E) Distribution of predicted
signal peptides (PrediSi algorithm).
43
2.3.2 Integrative data mining and comparison to other public databases
One goal of the current study was to provide the research community with a high quality, well-
annotated resource of proteins expressed in EPS. This could improve our understanding of
general prostate biology and guide the discovery of novel prostate cancer biomarkers. To
accomplish this goal, the data was mapped to two publicly available resources: the Human Gene
Atlas (265), which is based on the mRNA microarray expression profile of >50 human tissues,
and the Human Protein Atlas (263, 266), a large resource of immunohistochemistry images and
available human antibodies (Figure 2.3). The rationale for these comparisons was to identify
proteins detected in EPS fluids that are also enriched in prostate tissue relative to other human
tissues included in the microarray database. Also, integration of EPS proteins with the Human
Protein Atlas database can catalogue the availability of validated antibodies, which could provide
researchers with an important tool to further study the biological function of selected proteins.
Using only mRNAs with ≥3-fold above median expression in prostate tissue compared to 20
other human tissue types in the Human Gene Atlas, 279 direct-EPS gene products were
identified, implying a potential function specifically related to prostate biology. In the Human
Protein Atlas, a total of 513 proteins detected in the direct-EPS proteome currently have
antibodies with varying staining intensities in human prostate tissue. The identities of the EPS
proteins mapped to the two public databases are available in Appendix Tables 2.2 and 2.3.
44
Figure 2.3 Integrative data mining and comparison to other databases. (A) The EPS
proteome was mapped against the Human Gene Atlas (mRNA microarray) and Human Protein
Atlas (antibodies and immunohistochemistry images). (B) 279 unique gene products detected in
our EPS proteome showed ≥3-fold above median expression in normal prostate tissue on the the
Human Gene Atlas microarray (detected by 317 probe IDs), as compared to 20 other tissues.
45
Mapping against available Human Protein Atlas (HPA) antibodies and CD molecules is
provided.
2.3.2.1 Comparison of the direct-EPS proteome to other published
proteomic datasets
Next, the direct-EPS proteome was integrated with three other published proteomic datasets
(124, 267, 268). This included a recent publication of proteins secreted into the cell culture
media by three established prostate cancer cell lines (PC3, LNCaP, and 22Rv1) (268) and two
human body fluid datasets (urine and seminal plasma) directly related to EPS (267, 268). The
rationale for the comparison to the cell secretions was to distinguish between proteins readily
secreted by prostate cancer cells from proteins being secreted by other cell types of the prostate.
This is not an ideal comparison, however, as the cell lines are more reflective of metastatic
prostate cancers and the direct-EPS samples were from lower grade Gleason 6 and 7 cancers. An
additional caveat is that these cells have numerous genomic alterations and have been grown
under culture conditions for many passages distant from the primary isolates, and might not truly
reflect an in vivo setting. Approximately 300 proteins (Figure 2.4) were shared between the EPS
proteome and each of the three prostate cancer cell lines (see Appendix Tables 2.4-2.6 for
detailed mappings). Likewise, approximately 500 proteins were shared between the EPS
proteome and normal human urine and seminal plasma (see Appendix Tables 2.7 and 2.8).
Several well-known prostate markers (i.e. PSA, PAP) were shared among the direct-EPS
proteome and the described prostate cancer cell lines. The data provided in this study could
therefore be used to mechanistically investigate proteins identified in vivo (EPS) using well-
established prostate cancer cell lines. Interestingly, several well known prostate markers (i.e.
MSMB, TMPRSS2) were only identified in the EPS proteome and the other two available body
fluids (see below), strengthening the rationale of using in vivo obtainable fluids to study
prostate/prostate cancer biology. It is likely that proteins exclusively detected in the EPS
proteome, as compared to proteins detected in healthy human urine and seminal plasma, are
specifically enriched in secretions of the prostate under pathological conditions, and were
therefore absent or at a very low concentration, in the other body fluids.
46
Figure 2.4 Comparison to other published proteomes. The EPS proteome was compared to
proteins detected in two other published, related body fluid proteomes Pilch et al. seminal plasma
and Adachi et al. urine) and proteins secreted into the condition medium by three established
prostate cancer cell lines (Sardana et al.).
2.3.2.2 Multi-dataset integration
To fully utilize the power of multi dataset comparison, all publicly available data resources were
integrated with the current direct-EPS proteome (Figure 2.5), as an example of a potential data
mining strategy. Gene products strongly enriched in normal prostate tissue using the described
microarray dataset (10-fold above median expression) that were also detected in the direct-EPS
proteome and had antibodies through the Human Protein Atlas database were focused on. This
integration resulted in 54 unique gene products satisfying all described integration criteria
47
(Figure 2.5 and Appendix Table 2.9). A heat map display of this integration is presented in
Figure 2.5, including available mappings to other public proteome datasets. Although all gene
products are strongly expressed in prostate tissue, most showed additional expression in several
other tissues. Known prostate markers (vertical line), such as PSA, PAP, and
microseminoprotein-β (MSMB) were readily identified. Comparison to other published
proteomics datasets is also indicated in the presented heat map, and this data could be used to
further guide potential follow up analyses. As all 54 proteins displayed in the heat map have
available immunohistochemistry images at the Human Protein Atlas, the available images were
systematically screened for both normal prostate and prostate cancer tissues. Several proteins
with intriguing staining patters are discussed below, demonstrating an example of how the
current data could be used by others.
48
Figure 2.5 Integration of multiple data resources with the EPS proteome. Heat map of 54
unique gene products detected in the EPS proteome with available antibodies in the Human
49
Proteome Atlas database. Selected gene products were enriched in normal prostate tissue as
compared to 20 other human tissues using the Human Gene Atlas microarray database (≥10-fold
above median expression in prostate). Mappings to other available proteome datasets are
indicated. Well known prostate markers are highlighted by a vertical line.
Several proteins were identified which displayed a trend towards differential expression in
normal prostate compared to prostate cancer tissue using available IHC images, although patient
variability and image quality differences were also observed (Figure 2.6). Neprilysin (MME,
CD10) showed a trend towards decreased expression in prostate cancer tissue. Alternatively,
GDF15 and FASN showed a trend towards increased expression in prostate cancer tissues
(Figure 2.6). As the entire dataset has been linked to available mRNA microarray expression,
Human Protein Atlas antibody staining, and other published, related proteomics datasets, this
data will likely provide an important resource for the prostate biology/cancer community to
guide the investigation of other proteins in hypothesis driven experiments.
50
Figure 2.6 Immunohistochemistry images as obtained from the Human Protein Atlas.
Selected examples of proteins found in the Human Protein Atlas database that showed trends
towards differential expression in normal compared to cancer prostate tissues.
2.3.3 Identification of prostate-enriched proteins by in-depth proteomic
analyses of expressed prostatic secretions in urine
2.3.3.1 MudPIT profiling of expressed prostatic secretions in urine
The first aim of this study was to provide the first in-depth proteome catalogue of this clinically
useful proximal tissue fluid (Figure 2.7, left panels). In particular, individual EPS-urine samples
from men diagnosed with low grade prostate cancer (n = 5; Gleason score total 6-7) and biopsy
negative benign conditions (n = 6, BPH) were selected for MudPIT-based proteomic analyses
(Table 2.2).
51
Table 2.2 Clinical information for the prostate cancer (5 samples) and benign prostatic
hyperplasia (6 samples) EPS-urines analyzed by MudPIT. Serum PSA values (reported in
ng/ml) are from the time of initial diagnosis. The treatment column indicates the clinical course
followed for each cancer patient (DVP – DaVinci prostatectomy; Cryo, cryoablation therapy;
AS, active surveillance.
The proteomic characterization of the EPS-urine was performed on this heterogeneous group of
11 samples as a representation of the different types of patient origin, reflective of the most
common benign and prostate cancer conditions presenting in urology clinics. For each sample,
150 µg of total protein was directly digested in solution and analyzed in triplicate by a 9-step
MudPIT on a LTQ-Ion Trap mass spectrometer, as previously described (269, 270). A total of
1022 unique proteins were identified in the EPS-urines (Appendix Table 2.10) by at least two
unique peptides (Figure 2.7, left panels), ranging from 178 to 667 unique proteins determined per
individual MudPIT run (Figure 2.8A). Although the same amount of total protein was digested
and each sample was analyzed in triplicate with a relatively low overall standard deviation
(average standard deviation of 27), there was still large variation in total proteins detected for
each sample. This high variability is reflective of the biological inter-sample variations among
human specimens, which complicates data analyses. Furthermore, dynamic metabolic changes
within each individual are manifested in urine protein content (271), and highlight some of the
52
general problems of proteomic analyses of proximal body fluids (272). These results could also
reflect variation in sample collection, as the DRE procedure required to “express” the prostatic
fluids will be different for each individual (255). Other known limitations are related to the DRE
collection procedure and can be attributed to the physician (e.g. size of hand, ability to reach to
the prostate, etc.) or to some patient physical parameters (e.g. orientation on table during
examination, overweight status, etc.). Therefore, standard collection protocols as well as internal
standards are required to ensure proper collection and to circumvent the introduction of sample
variability resulting from the collection procedure (255).
53
Figure 2.7 Study workflow. Proteomic analysis of EPS-urine and urine samples. The EPS-urine
proteome was defined by MudPIT analyses of 11 individual heterogeneous samples (prostate
cancer, PCa and benign prostatic hyperplasia, BPH) of EPS-urine. A similar analysis was
performed on pooled urine and EPS-urine samples (PCa and noncancer, NC). The comparison of
the two datasets using bioinformatics data mining lead to the identification of some putative
prostate-enriched candidates within the complex EPS-urine proteome.
54
Figure 2.8 Characterization of the EPS-urine proteome. Number of unique proteins detected
per MudPIT analyses in the (A) individual EPS-urine samples and in the (B) pooled urine and
EPS-urines. The black dots in (A) (from EPS-U1 to EPS-U5) represent the EPS-urines from
individual prostate cancer patients and the white dots in (from EPS-U6 to EPS-U11) represent
the EPS-urines from individual benign prostatic hyperplasia patients. Every sample was analyzed
in triplicate. (EPS-U: EPS-urine; U: urine; PCa: prostate cancer; NC: noncancer).
To obtain a systematic overview of the functional categories of proteins expressed in EPS-urine,
Gene Ontology (GO) (273) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways
analyses (274) on the 1022 identified proteins (Figure 2.9) were performed. The comparison was
limited to several representative GO terms in order to obtain a high-level functional overview. In
the biological processes category, a large proportion of identified proteins had functional roles
55
involved in proteolytic activity, cellular adhesion and motion, and immune responses (Figure
2.9A). In the cellular component category (Figure 2.9B), GO terms with an annotation to the
extracellular region were overrepresented; this was further supported by the evidence that ~ 50%
of the detected proteins were classified in the SwissProt database as proteins with signal peptide
sequences, and more than 30% were classified as secreted proteins (Figure 2.9E). Notably, 12%
of the EPS-urine proteins contained the annotation of vesicle localization by GO (Figure 2.9B);
this could explain why the lysosomal pathway is the most highly over-represented among the
KEGG pathways in the EPS-urine dataset (Figure 2.9D). In the molecular function category, a
large proportion of detected proteins had calcium ion binding activity and were involved in the
regulation of peptidase activity (Figure 2.9C). Based on the global GO term analyses, EPS-urine
proteins belong to a large variety of functional categories and, as expected from a secreted fluid
fraction, a large number of these proteins are extracellular and contain a signal peptide sequence.
To further characterize the EPS-urine proteome in the context of other prostate-related fluids, the
current dataset was integrated with normal human urine (124) and direct-EPS (260), which
showed that 455 proteins are shared between these fluids by proteomics analyses (Figure 2.9F).
Present among these shared proteins were several previously proposed prostate cancer
biomarkers (i.e. PSA, PAP, MSMB, PSMA, TMPRSS2), strengthening the rationale of using in
vivo obtainable fluids to study the prostate and prostate cancer biology. On the other hand, these
candidates were detected in all three fluids, which demonstrates the dilemma in detecting
prostate-enriched proteins in a complex protein background of general urine proteins.
56
Figure 2.9 Functional enrichment of proteins detected in EPS-urine and comparison of the
EPS-urine proteome to other related body fluids. (A-E) The list of the 1022 unique proteins
identified in the EPS-urine samples was compared to the human UniProt Database. The reported
graphs show significantly over-represented (p < 0.001) Gene Ontology (GO) terms (A-C),
KEGG pathways (D) and SwissProt entries (E) in the EPS-urine dataset (several representative
annotation terms are shown). (F) The current EPS-urine dataset was compared to previously
published urine (124) and direct-EPS (from Section 2.3.1) (260) proteomic datasets. Proteins are
clustered in homogeneous groups based on 98% similarity (i.e. cluster anchors).
57
Focusing on prostatic secretions, 181 proteins were shared between EPS-urine and direct-EPS
(Figure 2.9F), suggesting that these proteins are specifically enriched in prostatic secretions and
absent or at a very low concentration in urine. This could represent a useful protein dataset for
selecting new prostate specific biomarkers with diagnostic and prognostic capacities for prostatic
diseases. Interestingly, 113 proteins were unique to EPS-urine (absent from direct-EPS and
urine). This could be explained by different mass spectrometry (MS) conditions used between
our group and the study by Adachi et al (124), as well the phenomenon of random sampling
(202) and biological variability, since neither study was likely successful in detecting the entire
body fluid proteome.
This study provides a detailed description of EPS-urine and extensively expands our knowledge
of the EPS-urine proteome. The dataset of 1022 proteins (Appendix Table 2.10) can be used to
implement diagnostic test platforms and improve current screening procedures for prostatic
diseases.
2.3.3.2 Identification of prostate-enriched proteins
A second aim of this study was to provide the first direct comparison of urine samples pre- and
post-DRE using pooled samples from both normal and prostate cancer patients (Figure 2.7, right
panels). These valuable samples enabled the identification of proteins likely released by the
prostate as a result of the DRE. For this purpose, five patients with prostate cancer and five
noncancer individuals were screened once prior to DRE and once after DRE, in order to obtain
internally controlled urines and EPS-urines, respectively (Table 2.3). Each sample pool was
analyzed by a 5-step MudPIT on a LTQ-Orbitrap XL, leading to the identification of 444 unique
proteins (Figure 2.7, right panels and Appendix Table 2.10), with a range of 141 to 258 unique
proteins determined per individual MudPIT run (Figure 2.8B).
58
Table 2.3 Clinical information for the pooled urine and EPS-urine samples analyzed by
MudPIT. Urine and EPS-urine sample pairs were collected from the same patient and pooled as
described in Materials and Methods (serum PSA reported in ng/ml).
In order to highlight signatures of proteins enriched in the EPS-urine, a semi-quantitative
comparative analysis of the EPS-urine and urine data was performed based on the QSpec
algorithm (275). Proteins were considered to be prostate-enriched if they had a ≥2-fold change in
spectral abundance factors (202, 275) with a false discovery rate ≤0.05 by QSpec analysis.
Applying these criteria to the entire list of 444 proteins, a panel of 49 significantly enriched
proteins was generated (Appendix Table 2.11). Interestingly, direct comparison of both
proteomic datasets (1,022 versus 444) showed an overlap of 406 proteins that included the 49
protein signature of “prostate-enriched proteins” (Figure 2.10).
59
Figure 2.10 Comparison between the two proteomics data of the current study. The Venn
diagram shows a comparison between the EPS-urine dataset (1022 proteins) and the dataset
containing proteins identified in both pooled urine and EPS-urine samples (444 proteins). The
overlapping area of 406 proteins includes a short list of proteins enriched in the EPS-urine
samples compared to the urine samples (termed: prostate-enriched proteins).
The prostate tissue selectivity of the 49 identified protein signature was also evaluated at the
transcriptome level (Figure 2.11A). The proteins were mapped in the BioGPS database, and the
gene expression profiles for normal human prostate tissue were compared to 24 other major
normal human tissues, under the assumption that transcript levels measured by microarrays can
be correlated with protein abundance (270). Although it is well known that expression levels of
both biomolecules are not always concordant (269, 276, 277), our own experience suggests
reasonable correlation if the data is locked on the protein level (i.e. using only transcripts with
available proteins evidence) (269). This type of comparison can provide additional evidence for
prostate-enriched expression levels, similar to previous investigations (260). Genes with prostate
expression levels 2-fold above the median across all 25 tissues were considered to be prostate-
enriched. The distribution of this analysis is shown in Figure 2.11A and represents the expression
level of each gene coding for the 49 selected proteins in prostate tissue. Among the 49 selected
gene products identified several well-known prostate biomarkers, such as PSA (KLK3), MSMB
60
and PAP (ACPP) (highlighted in green), supporting the hypothesis that direct comparison of
urine and EPS-urine by semi-quantitative proteomics results in the identification of prostate-
enriched proteins suitable for further investigation in biomarker studies. According to this
analysis, 32 of the 49 informative genes are enriched in the prostatic tissue. The genes indicated
with red bars in the graph correspond to the seven proteins for which antibodies were obtained
and used to verify the MS-based proteomics data.
Next, a random sampling of the dataset was examined to address the question of chance selection
in arriving at the 49-protein candidate list. Proteins were randomly selected 10,000 times from
the entire list of 1022 proteins detected in EPS-urine in order to evaluate if a random selection
from a larger dataset of EPS-urine proteins yielded the same probability of selecting prostate-
enriched proteins. The boxplot shown in Figure 2.11B reports the comparison between the two
datasets. As expected, the shortlist of 49 selected proteins is enriched in prostate-specific
proteins when compared with a random assortment of 10,000 selections from the same EPS-
urine database. The median values of the two distributions tend to be lower in the random
sampling analysis (median = 0.6) compared to the selection (median = 2.2) with a highly
significant P-value ≤0.001.
61
62
Figure 2.11 Data mining to identify prostate-enriched proteins. (A) Comparison of the 49
prostate-enriched proteins to published human tissue transcriptomic data from the BioGPS gene
expression portal (261). On the X-axis gene names are shown and the Y-axis represents a scale
indicating the gene expression level in prostate as compared to 24 additional human tissues. The
Y-values were obtained as log2 [fold change] ratio of prostate gene expression level versus the
median value calculated for 25 major organs (listed in Materials and Methods). Green bars
highlight protein biomarkers already suggested as candidates for prostate cancer prognosis and
diagnosis (used as internal positive controls). Red bars highlight the 7 proteins that have been
validated in this study. The red line demarcates the 2-fold increased expression level in normal
prostate as compared to all other tissues. (B) Box plot shows prostate-enriched proteins in the 49
protein candidate list versus a random sampling of the entire 1022 EPS-urine protein dataset. The
Y-axis has the same log2 scale reported in (A), rectangles are bounded by the lower and upper
quartiles, the solid lines in the rectangles are the medians, the box whiskers extend to the
minimum to the maximum data point of the rectangle, and the circles represent outliers beyond
this range. *** p-value < 0.001.
2.3.3.3 Characterization of prostate-enriched proteins
To better characterize the list of prostate-enriched proteins, information available at the Human
Protein Atlas (262, 263) and UniProt databases (278) were used. As shown in Figure 2.12A, 38
proteins (78%) of all 49 have at least one antibody ID available at the Human Protein Atlas (262,
263) (Appendix Table 2.11), which was validated and used to generate tissue expression profiles.
In Figure 2.12A, the protein expression patterns based on the immunohistochemistry images
available in the database are indicated. The evaluation of these images, based on staining
intensity and protein distribution in the human normal prostate tissue, suggested that 45% of the
proteins positivity was found in the glandular epithelium, only 13% in the stromal cells and 29%
showed equal staining between the two compartments.
63
Figure 2.12 Additional annotations for prostate-enriched proteins. (A) Availability of
antibodies reported in the Human Protein Atlas database, for the 49 selected proteins. The
immunohistochemistry images were manually screened and protein distribution was annotated
(glandular cells or stroma). (B) Subcellular location of the 49 prostate-enriched proteins
according to the UniProt database.
Together, protein tissue distribution and cellular localization are parameters that can provide
important insights into the function of a protein. As expected, the respective subcellular patterns
reported from the UniProt database (278) were correlated (Figure 2.12B) with additional
information collected from other available resources (i.e. ProteinCenter, signal peptide
annotations) (Appendix Table 2.11). The majority (33%) of the proteins in the analyzed EPS-
urine samples were secreted (without overlap with the other categories) (Figure 2.12B). This
localization is consistent with the biological expectation that urine contains, by definition, many
extracellular proteins. Another significant percentage of proteins (22.5%) were localized to the
64
membrane compartment. Interestingly, although the EPS-urine is not enriched in intracellular
proteins (18.4%), almost half (44%) of this category consist of lysosomal proteins. This probably
points out that exosome formation is the dominant excretion pathway in urine and reflects the
biological and physiological role of these proximal fluids, through the presence of specific
transport pathways for lysosomal proteins.
2.3.3.4 Generation of candidate shortlist
The analyses suggest an enrichment of prostate-specific proteins in the EPS-urines compared to
the urine samples. In particular, the list of 49 proteins (Appendix Table 2.11) encompasses a
number of prostate specific markers that are currently used or have been previously considered
as potential candidates for prostate cancer screening; among these are: MSMB (279), PSA (280,
281), PAP (reported in 1938 as the first serum biomarker for prostate cancer (282, 283),
TMPRSS2 (284), and PSMA (285).
A small verification set, aimed at validating the findings from the proteomic investigations, was
selected out of the 49 differentially expressed proteins. Selection was largely based on antibody
availability; but other unbiased considerations were tissue expression patterns of candidates
across prostate cancer and normal prostate tissues available through the Human Protein Atlas
(262, 263), tissue specificity based on mRNA microarray data (see above), subcellular
localization by GO analysis, and in-house proteomic datasets generated from urines and prostatic
fluids from various conditions. This led to a shortlist of 7 proteins that were assayed in the
verification stage.
2.3.3.5 Verification of proteomic data
In order to verify the proteomic data, Western blot analyses were carried out for each of the 7
candidates in pooled EPS-urines and urines from prostate cancer and noncancer patients. The
panel in Figure 2.13A shows that each of the shortlisted candidates are more abundant in EPS-
urine samples compared to the matching urines, supporting results from the MudPIT analyses. In
particular, comparing noncancer and prostate cancer EPS-urines showed the downregulation of
TGM4, LTF, ANPEP, MME and TIMP1, and upregulation of PARK7 and 14-3-3σ (Figure
2.13A), which correlated with the proteomic data analyses. These preliminary results also follow
a similar trend when looking at immunohistochemical staining patterns in normal and neoplastic
65
prostatic tissues via Human Protein Atlas (262, 263), but will require further verification in large
unrelated EPS-urine cohorts in the future (Figure 2.13B).
Figure 2.13 Validation of mass spectrometry data for seven selected proteins. (A) Western
blot analyses of the seven candidate proteins confirmed the proteomic data (based on SpC)
showing an enrichment of each selected protein in the EPS-urine compared to the urine
samples. (B) Immunohistochemistry images obtained from Human Protein Atlas database
showing the differential expression in normal and cancerous prostate glandular tissue of the
seven selected proteins. The same trend was obtained by MS and Western blot analysis. (U_NC:
66
urine non-cancer; EPS-U_NC: EPS-urine non-cancer; U_Ca: urine cancer; EPS-U_Ca: EPS-
urine cancer).
2.4 Discussion
The cumulative list of identified proteins in the EPS samples currently represents the largest
compilation resource of proteins present in prostatic fluid secretions from men with confirmed
prostate cancers. In comparison to other proteomics studies aimed at discovering prostate
function or prostate cancer biomarkers using cell secretions (268), plasma (250, 286), or tissue
(287, 288), the studies presented in this chapter used a relevant organ-proximal body fluid for the
detection of secreted proteins in vivo. One goal of the studies presented in this chapter was to
provide a high-quality, well-annotated resource of proteins present in EPS. This could improve
our understanding of general prostate biology and guide the discovery of novel prostate cancer
biomarkers. While commercial genetic assays have been developed using EPS-urines as a source
of prostate-derived genetic material (257, 289, 290), an in-depth proteomic analysis of this fluid
has not been reported. In contrast to direct-EPS (260), which is likely to contain prostate-secreted
proteins at a higher concentration, EPS-urine is highly diluted by a dynamic and variable urine
background. Nevertheless, EPS-urine samples are clinically more relevant since they can be
obtained by routine DRE examination, and can be collected repeatedly for longitudinal sample
collection relevant to active surveillance monitoring of men with prostate cancer. By directly
comparing pre- and post-DRE samples, a number of proteins that are enriched in EPS-urines
compared pre-DRE urines were identified.
Current early screening for prostate cancer relies on PSA detection in serum (291). Over time,
this screening has dramatically reduced the presentation of high grade disease, yet other studies
have indicated that the majority of early detected prostate cancers are not lethal and may be
considered clinically insignificant (292-294). Thus, current methods to establish the risk of
progression and prognosis of prostate cancers remain suboptimal, and a large number of patients
are over treated with a significant negative impacts on patients and on health care (292, 293).
Thus, new diagnostics are needed that could predict disease course and discriminate indolent
tumors from aggressive tumors, allowing for more appropriate treatment options to be offered to
newly diagnosed prostate cancer patients. Targeting identification of potential new biomarkers in
67
proximal fluids of the prostate like direct-EPS and post-DRE EPS-urines could address these
clinical needs.
With the availability of modern mass spectrometers and high performance peptide separations
like MudPIT (203, 205), we are now able to routinely identify hundreds to thousands of proteins
in complex biological samples. This can result in a significant challenge for the standard
biomarker validation strategy, ELISA, which is time-consuming and costly to develop. However,
MS-based assays, such as selected reaction monitoring mass spectrometry (SRM-MS), can
facilitate the rapid verification of candidates prior to ELISA development. An additional
challenge is the selection of the most likely candidate biomarkers from the large list of identified
proteins. To guide the selection criteria of promising candidate proteins involved in prostate
function and prostate cancer biomarker discovery, the EPS proteome was integrated with a large
variety of publicly available datasets. To further streamline SRM-MS assay development, the
entire MS data has been deposited to the Tranche database for others to mine, as previously
reported (269, 295, 296). Cumulatively, this EPS proteome dataset should facilitate the selection
and confirmation of individual biomarker candidates in follow up studies.
2.5 Materials and methods
2.5.1 Materials
Ultrapure-grade urea, ammonium acetate, ammonium bicarbonate, Tris, and calcium chloride
were from BioShop Canada Inc. (Burlington, ON, Canada). Ultrapure-grade iodoacetamide,
DTT and formic acid were obtained from Sigma-Aldrich. HPLC-grade solvents (methanol,
acetonitrile and water) were obtained from Thermo Fisher Scientific (San Jose, CA).
Trifluoroacetic acid was from J.T. Baker (Phillipsburg, NJ, USA). Mass spectrometry-grade
trypsin gold was from Promega (Madison, WI, USA). Solid-phase extraction C18 MacroSpin™
Columns were from The Nest Group, Inc. (Southboro, MA, USA).
2.5.2 Sample collection and properties
All samples were collected from patients following informed consent and use of Institutional
Review Board approved protocols at Urology of Virginia, Sentara Medical Group and Eastern
Virginia Medical School between 2007 and 2009, along with the Research Ethics Board of the
68
University Health Network (Table 2.1). Tumor grades, staging, prostate volumes and tumor
volumes (297) were determined using standard pathological procedures. Patient information was
recorded, including demographics, medical history, pathology results, and risk factors, and stored
in a Caisis database system. All personal information or identifiers beyond diagnosis and lab
results were not available to the laboratory investigators.
Direct-EPS fluids were obtained by vigorous prostate massage from subjects under anaesthesia
just prior to prostatectomy as a result of a prostate cancer diagnosis. Each sample (0.2-1 ml) was
collected from the tip of the penis in a 10 ml tube, diluted with saline to 5 ml, and stored on ice
until transport to the biorepository for processing (within 1 hour of collection). For processing,
any particulate material was removed by low-speed centrifugation. Aliquots (1 ml) of the
supernatant were stored at -80oC.
EPS-urine samples were collected by performing a gentle massage of the prostate gland during
DRE prior to biopsy, as previously described (260). The massage consisted of three strokes on
each side of the median sulcus of the prostate and the expressed fluid from the glandular network
of the prostate was subsequently voided in urine.
To generate sample pools from noncancer and cancer patients, 10-20 ml of urine and EPS-urine
were collected from the same individual an hour before the DRE massage, herein denoted as
urine, and after DRE, herein denoted as EPS-urine. Samples were pooled in order to generate
sufficient material for analyses. Urine and EPS-urine from a group of 5 patients with prostate
cancer and 5 biopsy negative, noncancer individuals (Table 2.3), were pooled together to
generate a sample panel comprising 4 different conditions: U_NC: urine noncancer; EPS-U_NC:
EPS-urine noncancer; U_Ca: urine cancer; EPS-U_Ca: EPS-urine cancer. After collection,
samples were stored on ice for no longer than 1 hour. Each sample was aliquoted and stored at -
80°C until use.
Individual EPS-urines were obtained from an independent cohort of 11 different patients: 5 with
low-grade prostate cancer and 6 with biopsy negative benign conditions (BPH) (Table 2.2).
Following collection, 9 ml of EPS-urine was centrifuged at 14,000 g to remove the cell
pellet/sediment. The supernatant was recovered and concentrated using an Amicon Ultra-15
Centrifugal Filter (3 kDa cutoff; Millipore, Billerica, MA, USA) according to the manufacturer’s
69
instructions. Approximately 500 μl of each concentrated EPS-urine sample was recovered from
the filter device and stored at −80°C until use.
2.5.3 Protein digestion and sample preparation
Direct-EPS fluid corresponding to 100 µg of total protein (as measured by Bradford assay) were
concentrated to ~ 50 µl using an Amicon Ultra spin filter (3kDa cut-off; Millipore) followed by
digestion as stated below. Briefly, concentrated EPS corresponding to 100 µg of total protein
was diluted with an equal volume of 8M urea, 4mM DTT, 50mM Tris-HCl, pH 8.5 and
incubated at 37ºC for 1 hour, followed by carbamidomethylation with 10mM iodoacetamide for
1 hour at 37ºC in the dark. Samples were diluted with 50mM ammonium bicarbonate, pH 8.5 to
~ 1.5M urea. Calcium chloride was added to a final concentration of 1mM and the protein
mixture digested with a 1:30 molar ratio of recombinant, proteomics-grade trypsin at 37ºC
overnight. The resulting peptide mixtures were solid phase-extracted with C18 MacroSpin
Columns from The Nest Group, Inc. (Southboro, MA, USA). Samples were concentrated by
speedvac and stored at -80ºC until used for MudPIT analysis.
Urine and EPS-urine were quantified using a NanoDrop™ 2000 spectrophotometer (Thermo
Fisher Scientific, San Jose, CA) and volumes corresponding to 100 μg of total protein for the
pools and 150 μg of total protein for the individual EPS-urines were resuspended in 50 μl of 8 M
urea, 2mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C with constant shaking for 30
min. Carbamidomethylation was performed by incubating samples with 8 mM of iodoacetamide
for 30 min at 37 °C in the dark. Samples were then diluted to approximately 1.5 M urea using
100 mM ammonium bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of
2 mM and the protein mixture was digested with trypsin (1:40 trypsin to protein ratio) at 37 °C
overnight. The digested peptide mixture was purified with C18 MacroSpin™ columns and
concentrated by vacuum centrifugation and reconstituted to a volume of 40 μl with 0.1 % formic
acid. Samples were stored at -80 °C until used for MudPIT analysis.
2.5.4 MudPIT analyses
Direct-EPS samples
70
A fully automated 9-cycle two-dimensional chromatography sequence was set up as previously
described (205). Peptides were loaded on a 7cm pre-column (150 µm i.d.) containing a Kasil frit
packed with 3.5 cm 5µ Magic C18 100 Å reversed phase material (Michrom Bioresources)
followed by 3.5 cm Luna® 5µ SCX 100Å strong cation-exchange resin (Phenomenex, Torrance,
CA). Samples were automatically loaded from a 96-well microplate autosampler using the
EASY-nLC system (Proxeon Biosystems, Odense, Denmark) at 3 µl/minute. The pre-column
was connected to an 8cm fused silica analytical column (75 µm i.d.; 5μ Magic C18 100 Å
reversed phase material (Michrom Bioresources)) via a microsplitter tee (Proxeon Biosystems) to
which a distal 2.3 kV spray voltage was applied. The analytical column was pulled to a fine
electrospray emitter using a laser puller. For peptide separation on the analytical column, a
water/acetonitrile gradient was applied at an effective flow rate of 400 nl/minute, controlled by
the EASY-nLC system. Ammonium acetate salt bumps (8µl) were applied at 0%, 5%, 10%,
15%, 20%, 25%, 30%, 40%, and 100% using the 96-well microplate autosampler at a flow-rate
of 3 µl/minute in a vented-column set-up.
All direct-EPS samples were analyzed on a LTQ-Orbitrap XL. The instrument method consisted
of one MS full scan (400-1800 m/z) in the Orbitrap mass analyzer, an AGC target of 500,000
with a maximum ion injection of 500 ms, 1 μscan and a resolution of 60,000 and using the
preview scan option. Five data-dependent MS/MS scans were performed in the linear ion trap
using the five most intense ions at 35% normalized collision energy. The MS and MS/MS scans
were obtained in parallel. AGC targets were 10,000 with a maximum ion injection time of 100
ms. A minimum ion intensity of 1,000 was required to trigger a MS/MS spectrum. The dynamic
exclusion was applied using a maximum exclusion list of 500 with one repeat count with a repeat
duration of 30 seconds and exclusion duration of 45 seconds.
Urine and EPS-urine samples
Individual EPS-urines (5 prostate cancer and 6 BPH) were analyzed in triplicate using the same
9-cycle MudPIT procedure as described above. A quaternary HPLC pump was interfaced with a
linear ion-trap mass spectrometer (LTQ, Thermo Fisher Scientific, San Jose, CA) equipped with
a nanoelectrospray source (Proxeon Biosystems, Odense, Denmark). The pooled urine and EPS-
urine samples were analyzed in triplicate on a LTQ Orbitrap XL, using a modified 5-cycle
MudPIT, as previously described (205).
71
2.5.5 Protein identification and data analyses
Direct-EPS samples
Raw data was converted to m/z XML using ReAdW and searched by X!Tandem against a locally
installed version of the human UniProt complete human proteome protein sequence database
(version 2009_11; 20,323 sequences). The search was performed with a fragment ion mass
tolerance of 0.4 Da, a parent ion mass tolerance of ±10 ppm. Complete tryptic digest was
assumed. Carbamidomethylation of cysteine was specified as fixed, and oxidation of methionine
as variable modification. A target/decoy search was performed to experimentally estimate the
false positive rate and only proteins identified with two unique high quality peptide
identifications were considered as previously reported (297) (1 decoy protein identified; FDR
<0.5%). An in-house protein grouping algorithm was applied to satisfy the principles of
parsimony.
Urine and EPS-urine samples
The following parameters were applied according to the instrument used:
LTQ analyses of individual EPS-urines
The search was performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass
tolerance of 4 Da. Complete tryptic digest was assumed. Carbamidomethylation of cysteine was
specified as fixed and oxidation of methionine as a variable modification. Only proteins
identified with two unique high-quality peptide identifications per triplicate were considered, as
previously reported (269, 296, 298) (11 decoy proteins identified; FDR ~1%). Each sample (n =
11) was examined by 3 technical replicates (33 total MudPIT analyses).
LTQ-Orbitrap XL analyses of pooled urines and EPS-urines
The search was performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass
tolerance of ±10 ppm. Complete tryptic digest was assumed. Carbamidomethylation of cysteine
was specified as fixed and oxidation of methionine as variable modification. Only proteins
identified with two unique high quality peptide identifications per analyzed sample were
72
considered, as previously reported (205, 296) (2 decoy proteins identified; FDR ~0.5%). Each
sample pool was analyzed by ≥3 technical replicates (13 total MudPIT analyses).
Protein relative abundance was calculated using the QSpec algorithm (269, 296, 298). Proteins
were considered to be up-regulated in the pooled EPS-urine samples versus the urine samples if
they complied with the following parameters: false discovery rate (FDR) <0.05 and fold change
(FC) ≥ 2, based on the QSpec algorithm (241, 295).
2.5.5.1 Gene Ontology annotation and data comparisons
Direct-EPS
Annotation mappings (i.e. GO, predicted transmembrane domains, signal peptides etc.) were
performed using the ProteinCenter bioinformatics software (Proxeon Biosystems, Odense,
Denmark). Comparison of direct-EPS detected proteins against other proteomic datasets
(prostatic secretions (275), seminal plasma (275), urine (264), condition media (267)) was
accomplished by ProteinCenter. Proteins were sequence aligned (i.e. BLASTed) against each
other and only proteins with at least 95% sequence identity were considered to match (i.e.
protein clusters). Comparison to the Human Gene Atlas (mRNA microarray) (124) and the
Human Protein Atlas (268) was accomplished via gene mapping using an in-house relational
database. For the comparison to data in the Human Protein Atlas, only proteins with available
IHC staining in normal or prostate cancer tissues were included. The staining criteria used for
antibody selection were directly adapted from the Human Protein Atlas website and are based on
rigorous multi-step antibody quality control criteria established by this consortium. Only gene
products with a medium to high IHC scoring intensity in either normal prostate or prostate
cancer tissue were used. Detailed information regarding the quality assurance can be found at
(http://www.proteinatlas.org/qc.php). Cluster analyses (Euclidean) were performed using the
open-source tool Cluster 3.0 (http://bonsai.ims.u-
tokyo.ac.jp/mdehoon/software/cluster/software.htm) and heat maps were displayed using
TreeView (http://jtreeview.sourceforge.net).
Urine and EPS-urine
73
Functional annotations (Gene Ontology terms, KEGG pathways, and Swiss Prot entries) were
assigned using the Database for Annotation, Visualization and Integrated Discovery (DAVID,
bioinformatics resources v6.7; http://david.abcc.ncifcrf.gov/) (265). Unique proteins detected in
the EPS-urine dataset were compared to the UniProt database and the top five significantly over-
represented categories were reported (p-value <0.001). Comparisons of the present EPS-urine
dataset to urine (266) and direct-EPS (299) datasets was accomplished using ProteinCenter
(Proxeon Biosystems, Odense, Denmark). Proteins were sequence-aligned against each other and
only proteins with at least 95% sequence identity were considered a match (i.e. protein clusters).
2.5.6 Prostate-enriched proteins characterization
The BioGPS portal (http://biogps.org/) (124) was used to map identified proteins against
available mRNA microarray datasets. Twenty-five major organ systems among those available in
BioGPS were selected and linked to proteins via gene accessions. The expression level for each
gene was based on averaged probe intensities, and the significant enrichment in prostate tissue
(>2 fold change) was calculated as a log2 ratio compared to the other selected tissues. The
random sampling analysis was carried out using the unpaired one-tailed Student`s t-test. A p-
value ≤0.05 was considered statistically significant. The alphabetical order of the selected organs
is as follows: bone marrow, colon, heart, hypothalamus, kidney, liver, lung, lymph node, ovary,
pancreas, placenta, prostate, salivary gland, skeletal muscle, skin, small intestine, smooth
muscle, spinal cord, testis, thalamus, thymus, thyroid, uterus, whole blood, and whole brain.
The UniProt database (260) was used to assign subcellular localization to the 49 proteins
enriched in EPS-urine pooled samples. The annotations were manually reported and the 49
proteins were grouped into three main categories: secreted, membrane, and intracellular (which
includes cytoplasmic, nuclear, and lysosomal). Identified proteins were also screened against the
Human Protein Atlas database (260) for availability of antibodies and to examine their prostate
tissue expression patterns.
74
2.5.6.1 SDS-PAGE and Western blot analysis on urine and EPS-urine
pools
For Western blotting, 40 μg of total proteins were separated on 8 or 10% SDS-PAGE gels and
blotted on PVDF membranes (0.2 μm; Bio-Rad Laboratories, Hercules, CA). Membranes were
blocked with 5% milk in TBS-Tween (0.2%) for 1 hour at room temperature and subsequently
incubated overnight at 4°C with the following primary antibodies: anti-Lactoferrin (1:1000
#ab10110; Abcam, Cambridge, UK), anti–CD10 (MME 1:1000 #ab951; Abcam, Cambridge,
UK), anti–TIMP1 (1:2000 #RP1-TIMP1; Triple Point Biologics, Forest Grove, OR), anti–CD13
(ANPEP 1:500 #ab7417; Abcam, Cambridge, UK), anti–TGM4 (1:500 #sc55791; Santa Cruz
Biotechnology, Santa Cruz, CA), anti–14-3-3σ (1:250 #ab14123; Abcam, Cambridge, UK), and
anti–PARK7 (1:1000 #ab11251; Abcam, Cambridge, UK). After three 10-minute washes with
TBS-Tween (0.2%), membranes were incubated with anti-mouse/anti-rabbit/anti-goat IgG-HRP
secondary antibody (Invitrogen, Carlsbad, CA) at a dilution of 1:25,000 for 1 hour at room
temperature, washed and visualized with the SuperSignal West Pico chemiluminescent substrate
(Thermo Fisher Scientific, San Jose, CA).
75
Chapter 3
Identification of candidate biomarkers of aggressive prostate
cancer
Portions of this chapter have been reformatted from the original publication: Kim, Y.,
Ignatchenko, I., Yao, C.Q., Kalatskaya, I., Nyalwidhe, J.O., Lance, R.S., Gramolini, A.O.,
Troyer, D.A., Stein, L.D., Boutros, P.C., Medin, J.A., Semmes, O.J., Drake, R.R., Kislinger, T.
(2012). Identification of differentially expressed proteins in direct expressed prostatic secretions
of men with organ-confined versus extracapsular prostate cancer. Molecular Cellular Proteomics
11:1870.
3 Identification of candidate biomarkers of aggressive
prostate cancer
3.1 Abstract
Current protocols for the screening of prostate cancer cannot accurately discriminate clinically
indolent tumors from more aggressive ones. One reliable indicator of outcome has been the
determination of organ-confined (OC) versus non-OC disease but even this determination is
often only made following prostatectomy. This underscores the need to explore alternate avenues
to enhance outcome prediction of prostate cancer patients. Fluids that are proximal to the
prostate, such as expressed prostatic secretions (EPS), are attractive sources of potential prostate
cancer biomarkers as these fluids likely bathe the tumor and accumulate disease related factors.
Direct-EPS samples from 16 individuals with extracapsular (EC, n = 8) or OC (n =8) prostate
cancer were used as a discovery cohort, and analyzed in duplicate by 9-step multidimensional
protein identification technology (MudPIT) on a LTQ-Orbitrap XL mass spectrometer. A total of
624 unique proteins were identified by at least two unique peptides with a 0.2% false discovery
rate (FDR). A semi-quantitative spectral counting algorithm identified 133 significantly
differentially expressed proteins in the discovery cohort. Integrative data mining prioritized 14
candidates, including two known prostate cancer biomarkers: prostate specific antigen (PSA) and
prostatic acid phosphatase (PAP), which were significantly elevated in the direct-EPS from the
76
OC cancer group. These and five other candidates (SFN, MME, PARK7, TIMP1, and TGM4)
were verified by Western blotting in an independent set of direct-EPS from patients with
biochemical recurrence (BCR) (n = 5) versus patients with no evidence of recurrence upon
follow-up (n =10). Lastly, we performed proof-of-concept selected reaction monitoring mass
spectrometry (SRM-MS)-based relative quantification of the five candidates using unpurified
heavy isotope-labeled synthetic peptides spiked into pools of EPS-urines from men with EC and
OC prostate tumors. This study represents the first efforts to define the direct-EPS proteome
from two major subclasses of prostate cancer using shotgun proteomics and verification in EPS-
urine by SRM-MS.
3.2 Introduction
Prostate cancer is the most common malignancy to affect men in the Western world, but only
15–20% of these men will present with aggressive, lethal disease (15, 300) whereas the majority
of patients will die of other causes. Although the implementation of large-scale screening for
prostate cancer using serum PSA has dramatically improved early detection of disease,
unnecessary biopsies and patient overtreatment are becoming increasingly evident (16, 300).
Consequently, there has been a shift in emphasis away from detection of prostate cancer and
toward identification of lethal disease. Currently, Gleason grading is considered to be one of the
best outcome predictors; however, patients with Gleason 7 tumors are in the clinical “gray zone,”
whereby the predictive ability of Gleason grading is mixed (72, 145). A recent study constructed
a 157-gene signature based on the comparison of Gleason score <6 and 8 patients, and could
show that their panel could predict lethality in the cohort of Gleason 7 patients (145).
Nonetheless, the development and large-scale implementation of prognostic markers of prostate
cancer has been hampered by numerous factors owing, in part, to the heterogeneous and
multifocal nature of the disease (2). Although the widely used Gleason grading system attempts
to control for heterogeneity of the glands and multifocality of cancerous lesions by summing the
2–3 most commonly observed histological patterns via inspection of multiple (typically 8 –12)
core biopsies, cancerous foci are still often missed (2, 300) providing only partial information
leading to imprecise diagnoses and prognoses. Pathologic staging remains the gold standard for
disease staging and risk assessment (301, 302); however, this process lacks timeliness in
discriminating OC from EC disease. Indeed, one-third of individuals with non-OC disease are
77
identified only after surgery (303). Furthermore, 35% of men treated with radical prostatectomy
(RP) with curative intent subsequently develop BCR (304-307) and the mean time from surgery
to recurrence is 3.5 years (72). Significant risk factors for time to prostate specific mortality
following BCR after RP are PSA doubling time, pathological Gleason score, and time from
surgery to BCR (72). Estimates place the percent of lethal cases at 20–25% of all patients that
show BCR, suggesting that nearly 75– 80% of patients in this group may be over treated (69).
There is an emerging trend toward recruitment of men with perceived low-risk disease to an
“active surveillance” monitoring approach. This is based on the supposition that most prostate
cancers are slow growing, and that the more aggressive forms can be identified during a period
of observation with little increased risk of death. Although a consensus may not exist for
defining the disease stage where active surveillance is warranted, there is considerable agreement
that men who have a PSA level less than 10 ng/ml, impalpable disease (clinical stage T1c) and
only 1 biopsy core out of 12 or more that show Gleason 6 cancer are most likely to harbor
indolent disease (308). Even so, these candidates for active surveillance will still contain
individuals who will have disease progression and die from their cancer. Thus, despite efforts to
recruit individuals to active surveillance protocols, overtreatment of prostate cancer is fueled by
the lack of reliable means to accurately discriminate between men with clinically indolent
prostate cancer from those with more aggressive disease (309, 310). This inability to accurately
predict prostate cancer aggressiveness based solely on standard clinicopathologic features clearly
underscores the need to explore the ability of additional biomarkers to enhance outcome
prediction for men with prostate cancer. Furthermore, it is important to acknowledge that a single
biomarker alone is unlikely to have sufficient prognostic power; rather, the integration of a panel
of biomarkers hold the promise for improved prostate cancer detection and prognosis (300).
Fluids that are proximal to the prostate are attractive sources of potential prostate cancer
biomarkers (255, 300), as they house secreted proteins and sloughed cells that provide a
presumably more comprehensive assessment of the organ and extent of disease. Further, fluids
such as urine are clinically favorable for their ease of collection, the volume and frequency at
which they can be obtained, and their adaptability to routine clinical assays. Prostate-proximal
fluids include seminal fluid, semen, and EPS. Here, we focus on the analysis of EPS as our
biological specimen, using direct-EPS samples for the discovery of candidate prognostic
biomarkers and both direct-EPS and pooled EPS-urines derived from independent sets of patients
78
for candidate biomarker verification. Direct-EPS is a prostatic fluid that is collected from
patients undergoing prostatectomy by massaging the organ and expelling 0.5–1 ml of the fluid
just prior to surgical removal. It was chosen as our discovery fluid as it is expected to house
prostate-secreted proteins at a higher concentration and purity, and we have developed a
workflow for the in-depth proteomic analysis of this fluid (260). Following discovery proteomics
in 16 clinically stratified direct-EPS samples, verification studies were performed using
independent sample sets of direct-EPS. Next, we focused our attention on the verification and
quantitative analysis of candidate proteins in pooled EPS-urines. Before EPS-urine collection,
men undergo digital rectal examination (DRE), often as part of a routine procedure, which causes
direct-EPS to be expelled from the prostate and subsequently voided in urine. Because EPS-urine
can be collected with substantial ease and in greater volumes and frequencies than direct-EPS,
much attention has been paid to this fluid as a valuable resource of prostate cancer biomarkers
amenable to routine clinical analysis. Following the recent FDA approval of the EPS-urine assay
for prostate cancer gene 3 (PCA3), standardized clinical collection protocols will be widely
implemented and easier access to this fluid is expected. Moreover, we have recently identified a
number of prostate-enriched proteins in EPS-urine by comparing its proteome to a urine
background (127).
The present study used MudPIT coupled with bioinformatics to catalog and comparatively
analyze the direct-EPS proteomes from a small cohort of patients with EC versus OC prostate
cancers. A semi-quantitative algorithm based on spectral counts (QSpec) (275) and an integrative
data mining strategy led to the selection of a number of putative biomarkers that were verified by
Western blotting in direct-EPS. Lastly, to demonstrate accurate quantitative measurements of
verified candidates in EPS-urine, a pilot study utilizing SRM-MS was undertaken as a proof-of-
concept.
3.3 Results
3.3.1 Proteomic characterization of expressed prostatic secretions from
organ-confined and extracapsular prostate cancers
An overview of the study approach is illustrated in Figure 2.1. Direct-EPS derived from 16
individuals who were clinically stratified as having EC (n = 8) or OC (n = 8) prostate tumors
79
(Table 3.1) were digested in-solution and analyzed by a 9-step MudPIT on a LTQ- Orbitrap XL
mass spectrometer, in duplicate (32 MudPITs).
Figure 3.1 Workflow of study design from the discovery of differentially expressed proteins
to verification of candidates in prostatic fluids.
80
Table 3.1 Clinical information of patients used for discovery proteomics.
A total of 624 unique proteins were identified by at least two unique peptides with a 0.2% FDR
(1 total reverse proteins) (Figure 3.2, 3.3, Appendix Tables 3.1 and 3.2). Among these, 78
proteins (13%) were only identified in the EC group and 216 (34%) were only identified in the
OC group, while the majority (330 proteins, 53%) were shared by both (Figure 3.4). A
substantial overlap of our data set with other published prostate-proximal and urine proteomic
data sets was observed (Figure 3.5). Furthermore, a large proportion of proteins had at least one
predicted transmembrane domain (TMD) and signaling peptide (SP) sequences (Figure 3.6).
Proteins in this fluid spanned a wide range of functional roles, as determined via Gene Ontology
(GO) enrichment (Figure 3.7). A number of Gene Ontology terms were significantly enriched in
both tumor groups. They were implicated in proteolysis, regulation of programmed cell death,
and cellular proliferation in the biological process category, as well as peptidase activity and
cytoskeletal protein binding in the molecular function category. Notably, many proteins were
localized in the extracellular region, as indicated by their over- representation in Gene Ontology.
81
Figure 3.2 Average number of unique proteins identified in the duplicate MudPIT runs for
discovery direct-EPS samples.
Figure 3.3 Total number of peptides (A) and total number of spectra (B) recorded for each
individual direct-EPS analyzed in the discovery phase.
82
Figure 3.4 Venn diagram depicting the total number of proteins exclusively expressed in
the EC or OC groups.
Figure 3.5 The current proteomics dataset compared to prostate-related proteomics data
from various reports.
83
Figure 3.6 Proportion of proteins with predicted SPs and TMDs.
Figure 3.7 Gene Ontology enrichment of the EC and OC direct-EPS.
84
3.3.2 Semi-quantitative comparison of expressed prostatic secretions
from extracapsular and organ-confined prostate cancers
To identify differentially expressed proteins in both patient groups we employed the QSpec
algorithm (275). Only proteins with a FDR of < 0.05 and at least a two-fold change in
normalized spectral abundance factor between the EC and OC groups for each protein were
considered. This analysis resulted in 133 differentially expressed proteins for further
consideration (Figure 3.8), and 100 (77%) of these were present in a functional interaction (FI)
network.
Figure 3.8 Distribution of all 624 proteins identified in the current direct-EPS data set. Blue
and red dots represent the 133 differentially expressed proteins based on a fold change of at least
2 and a FDR of ≤0.05 by QSpec analysis (red dots are proteins up-regulated in the EC tumor
85
group, blue dots are significantly down-regulated in the EC tumor group, with the 14 shortlisted
candidates indicated by their names).
Following extraction of these proteins from the FI network and the addition of linker genes, a
proteomics-associated prostate cancer network consisting of 161 genes, 61 of which are linkers,
was obtained. To decompose this network into smaller independent pieces and run pathway
annotation for every cluster separately, we used network community analysis to automatically
identify network modules that contain genes and their co-regulators that are involved in common
biological processes with high probability. We identified five modules of at least 10 proteins and
pathway enrichment analysis was done for each cluster separately. The first module was
significantly enriched in the “Glycolysis/ gluconeogenesis” pathway; the second one was
mapped to the cell cycle-related process “G2/M transition”; the functional annotation of the third
and fifth modules showed that they were significantly enriched in hemostasis processes like
“Complement and coagulation cascades” and “Further platelet releasate” and others (Figure 3.9,
2.10, and Appendix Table 3.3). The fourth module was not significantly associated with any
known processes or pathways.
86
Figure 3.9 Two significantly enriched pathways within the 133 differentially expressed
proteins are involved in hemostatic and immunological processes. Pink circles represent
proteins up-regulated in the EC tumor group; blue circles represent proteins down-regulated in
the EC group; orange triangles represent linkers.
87
Figure 3.10 Additional modules enriched by pathway analysis.
3.3.3 Generation of candidate shortlist
To select proteins for verification in a small, independent cohort of direct-EPS samples, we
performed a systematic data mining strategy using in-house and publicly available resources to
minimize the panel of 133 proteins to a refined set (Figure 3.11).
88
Figure 3.11 Protein prioritization scheme for the selection of candidates for verification.
Proteins were ranked initially based on six features by assigning a +1 value to a given protein
with the following annotations (0 if the term could not be assigned): 1) present by MS in six out
of eight samples corresponding to the risk group that the candidate was found to be higher in by
spectral counting; 2) overexpression of candidate genes in the prostate as demonstrated by at
least a threefold above median mRNA expression in normal prostate tissue compared with 22
different normal human tissues; 3) presence of predicted TMDs; 4) presence of predicted SP
sequences; 5) cellular localization assignment to “cell surface” (GO:0009986) and
“extracellular” (GO:0005576) by Gene Ontology annotation; and 6) cancer-associated protein as
available from the Human Protein Atlas database. The rationale for each of the key features is as
follows: as individual biological specimens can have a high level of variation, we set the criteria
89
that a candidate protein should have been detected in at least six out of eight samples in its
associated risk group. This would minimize the inclusion of proteins that are over represented by
large numbers of spectra in just a small number of samples. The integration of mRNA
microarray profiles from normal prostate and other human tissues with our proteomic data set
was performed with the rationale that the most promising biomarker candidates would be
preferentially expressed in the prostate, and thus may have an important functional role in the
organ that contributes to the outcome of the patient. Microarray data was therefore solely used to
select proteins with a likely tissue-selective expression pattern. We also took into consideration
the presence of predicted TMDs and SPs, as well as Gene Ontology annotations linked to the cell
surface and extracellular region as they are characteristics of secreted and plasma membrane
proteins, found in many currently known biomarkers. Sixty proteins had at least three out of six
features; 32 of which are cancer- associated proteins (253). Because of limitations in antibody
availability, a small subset of proteins (14 candidates, Table 3.2) were chosen for verification.
Also included in this list were two well-defined prostate cancer biomarkers, PSA and PAP, their
levels were also evaluated by ELISA in a different set of 14 direct-EPS samples. As shown in
Figure 3.12, the PSA and PAP concentrations determined by ELISA and Western blotting
yielded a similar distribution as the MS analysis, with an elevated level of expression being
observed in the OC group compared with the EC group. Further verification of the expression of
a subset of the short-listed candidate proteins was carried out in new sets of EPS fluids.
Table 3.2 Fourteen candidates selected by data mining scheme.
90
Figure 3.12 Measurement of PSA and PAP in direct-EPS. Samples were obtained from
patients with EC and OC prostate tumors in the discovery cohort by MS (reported as normalized
spectral abundance factors, NSAF) and in a new set of samples by ELISA (reported as μg/ml;
**p-value ≤0.01; *p-value ≤0.05).
3.3.4 Verification of differentially expressed candidates in direct-EPS
For the verification of the shortlisted candidates, we chose a small cohort of direct-EPS samples
that could be linked with clinical outcome information related to whether the patient had BCR or
91
no evidence of recurrence within a two-year period post-prostatectomy (Table 3.3). Using these
samples, the expression levels of candidates the verification set were evaluated by Western
blotting.
Table 3.3 Clinical information of patient samples used for Western blotting verification.
Five out of the 14 the candidates were successfully verified to correlate with the discovery data
(Figure 3.13, 3.14). These are: stratifin (SFN), membrane metalloendopeptidase (MME),
Parkinson protein 7 (PARK7), tissue inhibitor of metalloproteinase 1 (TIMP1), and
transglutaminase 4 (TGM4). For SFN, QSpec analysis of the discovery data in direct-EPS
revealed that it was three-fold higher in the OC group with a FDR of <0.002. In the BCR and
nonrecurrent samples, the level of SFN expression was significantly higher in the nonrecurrent
group (p = 0.03). For MME, it was found to be two-fold higher in the OC proteome with a FDR
of <0.002. In the comparison between BCR and nonrecurrent samples, MME expression was
found to be significantly higher in expression in the nonrecurrent group compared with BCR
patients (p = 0.05). In addition, we analyzed EC (n = 23) and OC (n = 19) direct-EPS samples for
the expression of MME using a commercially available ELISA, and found a similar trend in this
new set of samples (p = 0.07) (Figure 3.14B). PARK7 had a threefold difference by QSpec with
a FDR of <0.002, with higher expression being observed in the OC group. Similarly, this protein
was found to be significantly higher in the nonrecurrent group compared with the BCR group (p
= 0.008). Both proteins TIMP1 and TGM4 were three-fold different in expression in the
92
discovery sample set, whereby their expression was higher in the OC group with a FDR of
<0.002. Upon subsequent analysis in BCR and nonrecurrent patient direct-EPS samples, a
similar trend was observed with heightened expression in the nonrecurrent group for TIMP 1 (p
= 0.099) but not for TGM4 (p = 0.80).
Figure 3.13 Differential expression of verified candidates. Analysis was completed by MS in
EC and OC direct-EPS and Western blotting in BCR (R) and nonrecurrent (NR) patient direct-
EPS (reported as densitometry values; **p-value ≤0.01; *p-value ≤0.05; x, p-value ≤0.1; NS
denotes an insignificant p-value.
93
Figure 3.14 Differential expression of candidates measured by immunoassays. (A) Western
blots and stained membranes used to probe for candidates in direct-EPS from BCR and
nonrecurrent groups of patients. (B) MME ELISA in direct-EPS (x denotes p-value ≤0.1).
3.3.5 Verification of differentially expressed candidates in EPS-urines
A major aim of the current study was directed toward the detection of our candidates in a
clinically accessible fluid that can be collected with minimal invasiveness. EPS-urine samples
obtained during a DRE meet these criteria. Therefore, EPS-urine samples were obtained from
patients about to undergo a prostate biopsy procedure who subsequently were diagnosed with
pathology-confirmed EC and OC prostate tumors. Pools of EPS-urines from both groups were
prepared (Table 3.4) and used to assess the presence of the candidate proteins in this biofluid by
Western blot and a multiplexed SRM-MS approach. All candidates were quantifiable in the EPS-
urine with at least one proteotypic peptide by SRM-MS (Figure 3.15, Appendix Figure 3.1,
Appendix Tables 3.4, 3.5). The overall expression levels of MME, PARK7, and TIMP1
coincided with the Western blot analyses in the same pools (Figure 3.16). For SFN, one of the
two peptides representing SFN (NLLSVAYK) was found to be lower in the OC tumor group,
94
whereas the other (VLSSIEQK) was not quantifiable. Similarly, both TGM4 peptides were
down-regulated in the OC tumor group, which is not in line with the Western blot analyses.
Prostate specific antigen expression was found to be similar between the OC and EC tumor
groups as measured by two peptides: peptide HSQPWQVLVASR was higher in the OC group by
an average of 1.7-fold and LSEPAELTDAVK was by an average of 1.3-fold. This trend was
consistent with Western blot analysis of these same pools. ELISA was also performed on the
individual EPS-urines that comprise the pools, whereby a notable difference between the two
groups were not apparent (Figure 3.17).
95
Table 3.4 Clinical information of patient samples used to generate EPS-urine pools.
96
97
Figure 3.15 Standard curves generated for each peptide used for SRM-MS measurements.
Six concentration points were evaluated by 3 technical replicates in EPS-urine digests.
98
Figure 3.16 Verification of differential expression of candidates in EPS-urines pooled from
patients with EC or OC prostate tumors. Five candidates were measured by Western blot and
SRM-MS using heavy isotope-labeled peptide standards. Relative quantitative values are shown
for each technical replicate and as the average ratio of light/heavy peptide in the OC tumor group
divided by the light/heavy peptide in the EC tumor group ± standard deviation for each peptide.
99
Figure 3.17 PSA measurements in individual EPS-urines from patients with EC and OC
prostate tumors by ELISA (NS denotes insignificant p-value). The same patient samples were
pooled and assayed by Western blot for PSA as well as SRM-MS.
3.4 Discussion
Emerging advances in proteomics and genomics has led to the identification of a vast number of
biomarker candidates for a plethora of malignancies; however, few see clinical application (209,
311). The potential of serum as a source of protein biomarkers is hampered by its immense
complexity (118); thus we and others have looked to organ-proximal fluids for the discovery and
targeted verification of putative biomarkers of various conditions (260, 264, 267, 312-315).
Here, we interrogated a prostate-proximal fluid, EPS, for the identification of candidate
biomarkers of aggressive prostate cancer by analyzing a cohort of patients with EC versus OC
disease phenotypes. We followed up with verification of these biomarkers to identify aggressive
disease through additional analyses of a cohort of BCR and nonrecurrent patient direct-EPS.
Efforts to adapt urine as a source of biomarkers for prostate cancer have been made using various
approaches. For instance, DNA markers such as GSTP1 hypermethylation (316), RNA markers
such as PCA3 (317), and protein markers such as ANXA3 (167), matrix metalloproteinases
(169), and urinary versus serum PSA (318, 319). In light of the clinical relevance of urine
samples, we accessed EPS-urine for verification and potential future assay development of our
candidate biomarkers.
100
Our 624 EPS protein data set contained a number of previously proposed urinary protein prostate
cancer biomarkers, including PSA, ANXA3, and matrix metalloproteinases (MMP7 and 9), all of
which demonstrated differential expression between the EC and OC groups. Furthermore, a large
proportion of the proteome of EPS was predicted to have TMDs and SP sequences that are
characteristic of plasma membrane and secreted proteins, including our five candidates. A cell-
surface glycoprotein, MME, has been proposed to have an inhibitory effect on prostate cancer
cell growth and migration through its association with PTEN, PI-3 kinase (320) and its loss has
been shown to be associated with PSA relapse (321). The observation in the current study was
that this protein is significantly higher in expression in the OC and nonrecurrent groups
compared with the EC and BCR groups in direct-EPS. In general, there was a trend of decreased
expression of most proteins detected in the EC groups relative to the OC groups. However, at the
protein network level, proteins in the complement and coagulation cascade were significantly
higher in expression in the EC tumor group. There are clearly a high abundance of immune
system-related proteins identified in the EPS fluids, which is consistent with the chronic
inflammation associated with development and progression of prostate cancer (322).
Lymphocytes and macrophages present in the prostate are most often associated with this
chronic inflammation, and less frequently plasma cells, eosinophils, and neutrophils (323, 324).
The presence of tumor-associated macrophages have been reported as potential biomarkers of
poor prognosis in prostate cancers (324, 325), and we have evidence that these macrophages can
be detected by flow cytometry of the low speed cell pellets of EPS urine samples in a subset of
prostate cancer patients (data not shown). A better understanding of the role of the many
detectable biomarkers of immune system activity, like the members of the complement path-
way overexpressed in non-OC EPS samples, is critical for improving diagnostic targeting of
aggressive prostate cancers.
The complexity of the EPS proteome could be reduced using depletion columns, specifically
targeted for high-abundance proteins such as albumin, before digestion. This method may
facilitate the quantification of lower-abundance proteins although sample recovery may be a
caveat if starting concentrations are low. Advances in nanoparticle technology are also showing
promise in their application in proteomics to deepen proteome coverage and enhance recovery of
low- abundance proteins. Nanoparticles, functionalized using high-affinity baits to selectively
capture desired classes of proteins, have been shown to enrich otherwise, low-abundance
101
proteins, by orders of magnitude (326-328). Yet other strategies focus on the analysis of
subproteomes by extraction of desired organelles or exploiting post-translational modifications to
selectively enrich for desired classes of proteins. For instance, glycosylation is the most common
post-translational modification and N-linked glycosylation is particularly common in secreted or
plasma membrane proteins (329). Enrichment of this class of proteins can be achieved by the
selective capture of N-linked glycosites to a solid support via hydrazide chemistry followed by
enzymatic release by peptide N-glycosidase F (330, 331), or various lectin affinity approaches
(332-334). Quantification can be achieved by stable isotope labeling and tandem MS. Zhao and
colleagues have extended the technique of stable isotope labeling by amino acids in cell culture,
by generating stable isotope labeled secretome standards for the quantification of soluble
proteins in direct-EPS (315). Alternatively, more elegant techniques such as SISCAPA® (335,
336) can selectively enrich for target peptides using anti-peptide antibodies and coupled to SRM-
MS for targeted quantification, albeit the pipeline from generation of monoclonal antibodies and
their validation may be a significant limiting factor. Analyzing urinary sediments, which
accounts for 47.7% of the total protein content by weight (171), may offer interesting markers of
prostate cancer that may be missed by solely investigating the soluble fraction. Mataija-Botelho
et al. identified 60 proteins in the sediment of urine, many of which were also identified in the
soluble fraction, suggesting that simply discarding the sediment may alter abundance ratios of
different proteins (337). EPS fluids are also an abundant source of exosomes, a sub-fraction that
is also being targeted for discovery of prostate cancer biomarkers (172, 173, 338). Initial pilot
proteomic studies of EPS-urine exosome fractions have indicated readily detectable and
abundant proteins by MS (126). This exosome fraction is a subcomponent of the proteins
analyzed in the present study, and we are currently pursuing more detailed analyses of these
EPS-derived exosomes from samples representative of the disease spectrum of prostate cancers.
Our pipeline attempts to bridge discovery proteomics in direct-EPS with validation in a more
clinically obtainable fluid, EPS-urine. As such, we observed some discrepancies that can be
attributed to differences in direct-EPS and EPS-urine samples, sample size, patient heterogeneity,
the translation of shotgun proteomics to targeted SRM-MS, and inherent challenges in SRM-MS
quantification. In the current study, discrepancies in SFN and TGM4 by Western blotting and
SRM-MS are reported. Although a moderate-to-good correlation between antibody-based assays
(namely, ELISA) and targeted proteomics approaches have been reported (250, 339, 340),
102
discordance between the two methods can be conceptualized by the differences in the targeting
approaches. Therefore, it may be possible that the epitope for the SFN and TGM4 antibodies
used in the current study may be modified thus resulting in inaccurate measurements, or the
antibodies may be also targeting other isoforms of the proteins as both are members of larger 14-
3-3 and transglutaminase protein families. Furthermore, proteoforms – that is the different
molecular forms in which the protein product of a single gene can be found, including changes
due to genetic variations, alternatively spiced RNA transcripts, and post-translational
modification (341) − may confound accurate quantification. The quantification of SFN and
TGM4 thus remains inconclusive at this point of time.
Recently, Hüttenhain and colleagues demonstrated a high-throughput workflow for the
development of SRM-MS assays for a large number of cancer-associated proteins in human
urine and plasma (253). In their study, 408 proteins were detectable by SRM-MS in urine, of
which 169 were previously undetected in the data sets from Human Protein Atlas and Adachi et
al. (124). In another study, the serum and prostate glycoproteome of Pten-null and wild-type
mice were comparatively assessed using label-free quantitative proteomics followed by SRM-
MS of 39 protein orthologs in sera of prostate cancer patients and controls (240), demonstrating a
useful platform for biomarker discovery. Similarly, the current study demonstrates the adaptation
of a discovery-driven global proteomics experiment in a prostate-proximal fluid to a targeted
quantitative assay in EPS-urine. Although a large proportion of the SRM-MS optimization
process for peptides can be achieved in a simpler matrix, such as BSA, yeast, or cell lysate
background, finer optimization in the highly complex clinical specimen is an important
requirement. General caveats associated with protein biomarker studies in urine are low protein
abundance, the presence of cellular material, high concentrations of salts and interfering
substances such as urea, urobilin and other metabolites, and intra- and inter-individual variability
(271). In our hands, initial optimization studies were performed in a BSA background, whereby
lower concentrations of heavy isotope labeled peptide standards were easily identifiable by at
least three transitions. However, once introduced to the EPS-urine background, we observed
large amounts of ion suppression and interference of the heavy isotope-labeled peptide standards
that were easily identifiable in BSA. Most peptides showed good linearity within 1–500 fmol on-
column range but equimolar concentrations of peptides showed variable responses. Therefore, in
the end we used a heavy peptide standard concentration of 500 fmol on-column, which allowed
103
all peptides to be readily detectable in EPS-urine. Using heavy isotope labeled peptide standards,
we quantified PSA, SFN, MME, PARK7, TIMP1, and TGM4 in pooled EPS-urines of EC and
organ-confined prostate cancers by 1 or 2 peptides. The SRM-MS experiments presented in this
proof-of-concept study primarily serves to demonstrate the applicability of EPS-urines to SRM-
MS assay platforms and to support its value as an important clinical biofluid. By no means do
pooled EPS- urines represent the heterogeneity of the patient population. The only way to verify
the relevance of our proposed candidates would be to quantitatively measure them in large
cohorts of individual EPS-urines and thus future work will be dedicated to extending the
verification phase of the current study.
3.5 Materials and methods
3.5.1 EPS Sample Properties
Direct-EPS samples were obtained from patients following informed consent and use of
Institutional Review Board approved protocols at Urology of Virginia and Eastern Virginia
Medical School between 2007 and 2009 and the Research Ethics Review Board at the University
Health Network. Sample collection and handling has been previously described (260). Briefly,
an aggressive prostate massage was performed in confirmed prostate cancer patients under
anesthesia prior to undergoing prostatectomy to collect 0.2–1 ml of fluid. Samples were diluted
with saline to 5 ml and stored on ice for a maximum of 1 h. Particulates were sedimented by low-
speed centrifugation and the resulting supernatants were aliquoted to 1 ml and stored at -80 °C.
The clinical data linked to the patients enrolled in the study are outlined in Tables 2.1 and 2.3.
Organ-confined disease is defined as the condition of patients with tumors not beyond stage T2c
that are margin, seminal vesicle and lymph node negative. Extracapsular disease is differentiated
from OC by evidence of margin involvement, extraprostatic extension, or positive for tumor in
seminal vesicles or lymph nodes. BCR is defined as patients with rising serum PSA values over
0.2 ng/ml at 3 months or longer post-prostatectomy. EPS-urines were collected from men
reporting to the clinic for a prostate biopsy via prostate massage with three strokes on each side
of the median sulcus of the prostate followed by collection of voided urine (10–20 ml), as
previously described (255). Pools of EPS-urines were derived from 17 patients classified as
having OC prostate cancer and 17 from non-OC, EC cancers using 1 ml per patient sample.
Tumor grades, staging, prostate volumes and percent of cancer positive needle biopsy cores were
104
determined using standard pathological procedures. Patient information was recorded, including
demographics, medical history, pathology results, and risk factors, and stored in a Caisis
database system. All personal information or identifiers beyond diagnosis and laboratory results
were not available to the laboratory investigators.
3.5.2 Protein digestion and sample preparation for mass spectrometry
Ultrapure-grade urea, ammonium acetate, and calcium chloride were from BioShop Canada, Inc.
(Burlington, ON, Canada). Ultrapure-grade iodoacetamide, and dithiotreitol (DTT) were from
Sigma-Aldrich. HPLC-grade solvents (methanol, acetonitrile, and water) and formic acid were
from Fisher Scientific. Mass spectrometry-grade trypsin was from Promega (Madison, WI).
Solid-phase extraction C18 MacrospinTM Columns were from The Nest Group, Inc. (Southboro,
MA). Amicon spin filter columns, 0.5 ml, 3 kDa MWCO, were from Millipore.
Protein concentrations of the direct-EPS samples used for discovery-based proteomics were
quantified by Bradford assay and a volume corresponding to 100 μg of total protein were
concentrated to 20 μl using a spin filter column. Concentrated direct-EPS was diluted with 50 μl
of 8 M urea, 2 mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C for 1 h.
Carbamidomethylation was performed by incubating samples with 8 mM of iodoacetamide for 1
h at 37 °C in the dark. Samples were then diluted to 1.5 M urea using 100 mM ammonium
bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of 2 mM and the
protein mixture was digested with 4 μg of trypsin at 37 °C overnight. The digested peptide
mixture was purified with C18 MacrospinTM columns and concentrated by vacuum
centrifugation and reconstituted to a volume of 40 μl with 0.1% formic acid. Samples were
stored at -80 °C until used for MudPIT analysis. A 9-step MudPIT sequence was used as
previously described (203, 204, 260).
3.5.3 Protein identification and data analysis
Raw data was converted to m/z XML using ReAdW (version 1.1) and searched by X!Tandem
(version 06.06.01) against a locally installed version of the human UniProt complete human
proteome protein sequence database (release date 2009, 20,323 sequences). The search was
performed with a fragment ion mass tolerance of 0.4 Da and a parent ion mass tolerance of 10
ppm, and two miscleavages were allowed. Carbamidomethylation of cysteine was specified as
105
fixed and oxidation of methionine as a variable modification. A target/decoy search was
performed to experimentally estimate the false positive rate and only proteins identified with two
unique high quality peptide identifications were considered as previously reported (269, 296,
298) (1 decoy protein identified; FDR ≤0.2%). An in-house protein grouping algorithm was
applied to satisfy the principles of parsimony (296). A total of 624 protein groups were identified
(Appendix Tables 3.1, 3.2). Proteins with significant differential expression levels were
determined based on a hierarchical Bayesian statistical algorithm known as QSpec (275).
Proteins were considered to be differentially expressed if normalized spectral abundance factors
(342) derived from QSpec were at least two-fold different between EC and OC groups with a
FDR ≤0.05.
3.5.4 Gene Ontology annotation, pathway enrichment and network-
based analysis of EPS proteins
Analysis of differentially expressed proteins by Gene Ontology assignment was performed using
DAVID v6.7 (299, 343). Proteins were uploaded into the DAVID functional annotation tool and
compared with the human genome background. Enriched pathways and Gene Ontology FAT
terms with a minimum of two-fold enrichment and a Fisher’s Exact test p-value ≤0.05 were
considered and the top Gene Ontology terms that are shared between the EC and OC tumor
groups are reported as the proportion of genes involved relative to the total number of genes
involved in the whole human genome background.
To understand the relationship among differentially expressed proteins further, we performed a
network-based analysis using a protein FI network (344). From the FI network, which contains
close to 50% of the human proteome and more than 200,000 FIs, we extracted the subnetworks
that involve the differentially expressed proteins. Using a minimum spanning tree algorithm, the
minimum number of additional “linker” genes from the full FI network were added in order to
form a single fully-connected subnetwork (345). The subnetwork was clustered using the edge-
betweenness algorithm (346) in order to identify functionally related network modules. Clusters
with size ≥10 were kept for further analysis. All network diagrams were drawn using Cytoscape
v. 2.7 (347). For pathway enrichment analysis of network modules, the bionomial test was used
and the FDR was calculated with 1000 permutations across all genes in the FI network
(Appendix Table 3.3).
106
3.5.5 Integrative data mining
Differentially expressed proteins were compared with transcriptomic profiles based on gene
mapping using an in-house relational database with mRNA microarrays for 22 normal tissues,
including the prostate, available through Human Gene Atlas (261). The median mRNA
expression level of each gene product was calculated across all 22 tissues and compared with the
level of expression in the prostate alone, as detected by a total of 218 probe IDs. In this way,
genes that demonstrate a propensity toward elevated expression in the prostate compared with
other tissues at the transcriptomic level were derived. To assess whether the differentially
expressed proteins had predicted TMDs and SPs, their assigned UniProt IDs were uploaded onto
the Protein Center bioinformatics software (Thermo Fisher Scientific), which uses TMAP
(http://emboss.sourceforge.net/apps/release/6.0/emboss/apps/tmap.html) and PrediSi
(http://www.predisi. de/) tools for predictions. The Human Protein Atlas (348) was used to
determine whether a given candidate was previously proposed as a cancer biomarker (Cb; from
the Plasma Proteome Institute) and to assess the immunoreactivity of antibodies directed against
candidate proteins in normal (non-neoplastic and morphologically normal) and cancerous
prostate tissue sections. Comparisons across various proteomics data sets (direct-EPS, seminal
plasma, urine, cell lines) (124, 260, 264, 267, 268, 312) were performed using the Protein Center
bioinformatics software.
3.5.6 Immunoassays in prostatic fluids
The Human Neprilysin (MME) DuoSet® ELISA Development System (R&D Systems,
Minneapolis, MN) was used to measure MME in direct-EPS. Total protein concentration
corresponding to 140 μg was obtained from each sample. Samples and standard were diluted in
1% BSA in PBS, pH 7.4 and assayed in duplicate following the protocol outlined by the
manufacturer. PSA and PAP ELISAs were performed as previously described (255, 349).
For Western blotting, anti-SFN (ab14123), anti-MME (ab951), anti- PARK7 (ab11251), anti-
PSA (ab46976) were obtained from Abcam (Cambridge, MA), anti-TGM4 (sc55791) was from
Santa Cruz Bio- technology (Santa Cruz, CA), anti-TIMP1 (RP1T1) antibody was from Triple
Point Biologics (Forest Grove, OR). Direct-EPS from patients with BCR and no evidence of
recurrence were quantified by Bradford assay and a corresponding 30 μg of total protein were
107
resolved on Criterion 10 –20% Tris-HCl SDS-PAGE gels (Bio-Rad) and transferred to PVDF
membranes (Bio-Rad). The membranes were blocked with 5% milk in TBS-T for 1 h at room
temperature. Primary antibodies were diluted in blocking solution and incubated at 4 °C
overnight, washed with TBS-T, and incubated with secondary antibody diluted in blocking
solution for 1 h at room temperature. Signals were developed using SuperSignal West Pico or
Femto chemiluminescent substrate (Thermo Fisher Scientific). Pooled EPS-urines were
concentrated from 15 ml to 500 μl by using a spin column (Amicon Ultra Spin Filter, 3 kDa
MWCO, Millipore) and quantified using Bio-Rad DC assay. Concentrations corresponding to 50
μg of total protein were resolved on 10% SDS-PAGE, transferred to PVDF membranes, and
assayed as described above.
3.5.7 Statistical analysis
Following candidate protein selection, we verified their differential expression by Western blot
densitometry and/or ELISA. To determine if the protein levels differ between OC tumor groups
and EC tumor groups, we applied the Mann-Whitney U test to assess if the difference is
statistically significant. A p-value ≤0.05 was considered significant.
3.5.8 SRM-MS development
A spectral library was built using the Skyline software tool (version 0.5 and higher) (350) from
the LTQ-Orbitrap XL data obtained from the 16 direct-EPS samples. All spectra had scores that
passed a stringent peptide score as determined by X!Tandem target decoy search criteria (i.e.
0.5% decoy spectral matches). In addition, two publicly available consensus libraries were used:
NIST_human_QTOF (v 3.0 05/24/2011) and GPM human_ipi_cmp_20. Protein sequences were
converted to FASTA format and uploaded into Skyline for the prediction of signature peptides.
Peptides were chosen based on previously reported specifications (241): predicted tryptic digests
of 7–30 amino acids in length and containing zero possible missed cleavage sites or cysteine and
methionine residues, and matching the spectral libraries. Four to 6 of the most intense y-ions
with a precursor charge of +2 and a fragment ion charge of +1 were selected from the libraries,
resulting in several possible peptides for each of the candidates. The GPM (http://www.gpm.org)
was used to narrow down the list of peptides to those that have been observed experimentally
with a high number of +2 charge states. Furthermore, no possible missed cleavage sites within 3
108
amino acids before or after the peptide, no acidic residues on cleavage sites, and uniqueness to
the gene (241) were additional requirements. Other favorable conditions included no possible
post-translational modification sites, no N-terminal Q, E, no W, and SSR hydrophobicity 5–30
(241, 351, 352). Based on these criteria, 21 signature peptides for six candidates (SFN, MME,
PARK7, TIMP1, TGM4, and PSA) were chosen for SRM-MS experimentation (Appendix Table
3.4).
Lyophilized, unpurified 13C- and 15N-labeled arginine and lysine peptides (SpikeTides_L™,
JPT Peptide Technologies, Berlin, Germany) were obtained in a 96-well format and solubilized
in 80% 0.1 M ammonium bicarbonate and 20% acetonitrile. A heavy isotope labeled peptide
standard stock containing all 21 SpikeTides_L™ at equal concentrations was made for
subsequent experimentation.
For optimization studies, the heavy peptide standard stock corresponding to 500 fmol on-column
was added to BSA digests and automatically loaded from a 96-well microplate autosampler
using the EASY-nLC system. Analysis was performed on a TSQ Vantage triple quadrupole mass
spectrometer (Thermo Fisher Scientific, San Jose, CA) interfaced with the EASY-nLC system
(Proxeon Biosystems, Odense, Denmark), as described previously (204, 260). A 40 min HPLC
gradient running at a flow rate of 400 nl/min was used. The gradient conditions were as follows:
starting with 100% buffer A (water/0.1% formic acid) followed by an increase to 60% buffer B
(acetonitrile/ 0.1% formic acid) for 36 min, followed by a steep increase to 95% buffer B for 2
min, and finally to 100% buffer B for 2 min. For compound optimization, the Thermo TSQ Tune
software (Thermo Fisher Scientific, San Jose, CA) was used. The XCalibur data system (Thermo
Fisher Scientific, San Jose, CA) was used to input the following SRM-MS method parameters:
one scan event over a 40 min method time at a fixed scan width of 0.02 m/z, scan time of 0.010
s, 0.2 fwhm Q1 and 0.7 fwhm Q3 resolution, and data collection in profile mode. For each
peptide, 3– 4 most intense transitions for each eluting peptide were selected and their accurate
retention times were recorded using Skyline (350). These transitions were grouped into two
separate methods to maintain a cycle time of 1 s.
109
3.5.9 Relative quantification by SRM-MS in EPS-urines
For quantification by SRM-MS, EPS-urine pools were prepared as follows: concentrated EPS-
urines were precipitated in 100% methanol (1:20 v/v ratio) overnight at -20 °C. Protein pellets
were recovered by centrifuging at 16,000 g for 30 min at 4 °C. Two additional washes with
methanol were performed to obtain “clean” protein pellets. The pellets were subsequently
resolubilized in 50 μl of 8 M urea, 2 mM DTT, 50 mM Tris-HCl, pH 8.5, and incubated at 37 °C
for 1 h. Carbamidomethylation was performed by incubating samples with 8 mM of
iodoacetamide for 1 h at 37 °C in the dark. Samples were then diluted to 1.5 M urea using 100
mM ammonium bicarbonate, pH 8.5. Calcium chloride was added to a final concentration of 2
mM and the protein mixture was digested with 4 μg of trypsin at 37 °C overnight. The digested
peptide mixture was purified with C18 MacrospinTM columns and concentrated by vacuum
centrifugation and reconstituted to a volume of 100 μl with 5% acetonitrile, 0.1% formic acid.
Samples were incubated at 37 °C for 30 min to allow peptides to go into solution and
subsequently centrifuged at 16,000 g for 30 min at 4 °C to remove any remaining debris.
For quantification in EPS-urine pools, a total of 1 μg of peptides on-column (as determined by
the micro BCA assay) were spiked with 500 fmol on-column of the heavy isotope-labeled
peptide standard stock and assayed following the same conditions as above. Each sample was
assayed in four technical replicates (Appendix Table 3.5). Total peak area values and area ratios
(light/heavy peptide) were obtained from Skyline (350). Relative quantitative values are shown
for each technical replicate and as the average ratio of light/heavy peptide in the OC tumor group
divided by the light/heavy peptide in the EC tumor group ± standard deviation for each peptide.
110
Chapter 4
Systematic development of a quantitative assay for putative
prostate cancer biomarkers in expressed prostatic secretions
Kim, Y., Mejia, S.*, Jeon, J.H.*, Ignatchenko, V., Yao, C.Q., Nyalwidhe, J.O., Lance, R.S.,
Gramolini, A.O., Troyer, D.A., Drake, R.R., Boutros, P.C., Semmes, O.J., Kislinger, T.
Systematic development of a multiplexed selected reaction monitoring mass spectrometry assay
for prostate cancer biomarkers in expressed prostatic secretions. Manuscript in preparation. *
Equal contribution
4 Systematic large-scale development of targeted
proteomics assays for biomarker verification in
prostate-proximal fluids
4.1 Abstract
Current prostate cancer prognostic factors stratify patients into risk groups, but are inaccurate in
predicting outcome, resulting in overtreatment and repeat biopsies of many men with indolent
disease. Therefore, a pressing need in prostate cancer management is improved prognostic
factors that enable follow-up of men with low risk disease without repeat needle biopsies. To
address this, I systematically explored non-invasively collected prostate proximal fluids, known
as expressed prostatic secretions (EPS), as a source of prostate biomarkers. Analysis of direct-
EPS from extracapsular (EC) and organ-confined (OC) prostate cancers revealed differentially
expressed proteins based on spectral counts (244). Selection criteria led to 147 proteotypic
peptides suitable for verification using selected reaction monitoring mass spectrometry (SRM-
MS). For each candidate, a corresponding crude heavy isotope labeled peptide was synthesized
and used for SRM-MS method development. Next, relative quantification of endogenous peptide
concentrations in defined subgroups of prostate cancer patients and control EPS-urines was
performed, revealing 34 candidate peptides with high potential as prostate cancer biomarkers. A
single scheduled SRM-MS assay was developed and used to verify the differential expression of
111
these candidate peptides by absolute quantification in an independent cohort of stratified EPS-
urines.
4.2 Introduction
The worldwide incidence of prostate cancer has been steadily rising but many patients exhibit
tumors of an indolent nature; that is, these tumors grow slowly and pose minimal threat to the
life of the patient, in the absence of treatment. Once cancer has been confirmed, the optimal
course of action is tailored on a patient-to-patient basis, ranging from active surveillance
programs to surgery and radiotherapy. The goal of individualized patient management is to spare
patients with indolent disease from unnecessary procedures, while identifying and treating those
who would benefit from treatment intensification. Gleason grading is a strong prognosticator of
prostate cancer, but requires surgical or biopsy specimens. The number of patients admitted to
the hospital as a result of post-biopsy complications is sizable, posing a significant burden on
health care and risking serious complications. Most men diagnosed with prostate cancer have
localized disease with excellent prognosis (localized cancers have 100% 5 year survival, SEER
Cancer Statistics http://seer.cancer.gov), but nearly 70% receive early intervention in the form of
surgery or radiotherapy (353-357). Furthermore, even active surveillance relies on repeat
prostate specific antigen (PSA) testing, digital rectal examination (DRE), and multiple prostate
biopsies.
Liquid biopsies, such as circulating tumor cells and free DNA have been proposed as non-
invasive prostate cancer biomarkers, but their detection and enrichment remains technically
challenging. Cataloguing the secreted and soluble factors released into the interstitial fluid that
bathes the organ of interest may provide a novel inventory of putative disease biomarkers.
Indeed, I have interrogated the collection of proteins comprising a prostate-proximal fluid,
extraprostatic secretions (EPS) (126, 127, 244, 260). Reproducible detection and quantification
of a protein in the matrix of interest is a requirement of any potentially valuable disease
biomarker, but verification of these candidates is a major bottleneck in the pipeline from
discovery to clinical implementation.
Traditionally, immunoaffinity based assays, namely enzyme-linked immunosorbent assays
(ELISAs), are used to validate a biomarker; but this approach is time-consuming, costly, and
112
relies on the existence of validated antibodies for a given target protein. Targeted mass
spectrometry (MS) offers an alternative approach to the rapid verification of candidate
biomarkers. Selected reaction monitoring mass spectrometry (SRM-MS) is the leading method
for the targeted quantification of proteins using MS. It offers excellent selectivity, sensitivity,
and throughput, without the need for validated antibodies. In the current chapter, I will describe
my efforts that extend the previously published investigations into the EPS proteome (126, 127,
244, 260). Here, I systematically generated a multiplexed quantitative SRM-MS assay targeting
candidates with prostate cancer prognostic and diagnostic potential using a clinically relevant
prostate proximal fluid.
4.3 Results
4.3.1 Candidate selection for targeted quantification by selected reaction
monitoring mass spectrometry
Utilizing a previously published shotgun proteomics dataset derived from direct-EPS comparing
the proteomes of EC and OC prostatic tumors (described in Chapter 3), 232 proteotypic peptides
(Phase 1 peptides, Appendix Table 4.1) were shortlisted for targeted quantification by SRM-MS
in EPS-urines (Figure 4.1). In order to abrogate potential confounding influences due to the
differing modes of data acquisition, instruments, and samples, Phase 1 peptides were first
evaluated for reproducible detection by SRM-MS. For this, crude heavy isotope labeled synthetic
versions of the target peptides (JPT Peptide Technologies GmbH) were spiked into a pool of
EPS-urine, and the light (endogenous) and heavy (synthetic) peptides were monitored. The data
was manually inspected in order to select peptides that had at least 3 fragments ions that aligned
at the expected peptide elution time, had co-eluting light and heavy peptides, had minimal
interference, and were reproducible. Out of the 232 peptides monitored, 147 (63%) peptides
(Phase 2 peptides, Appendix Table 4.1) met these criteria, and were thus taken forward to an
independent cohort of EPS-urine samples.
113
Figure 4.1 Study workflow outlining the three major phases of the current project.
Discovery data from direct-EPS derived from patients with EC or OC prostatic tumors was used
to identify candidates demonstrating differential expression (Chapter 3). Proteotypic peptides
from these candidates were carefully selected and evaluated by SRM-MS for performance in an
EPS-urine background. Next, peptides were analyzed in EPS-urines from individuals with
benign prostatic hyperplasia (BPH) or no evidence of disease and compared to EC and OC EPS-
urines. Peptide quantification by SRM-MS was performed and 34 promising candidates with
diagnostic and prognostic potential were identified based on estimated abundance changes. The
34 candidates were accurately quantified in a different cohort of EPS-urines for verification.
Finally, ongoing statistical analyses are being performed to generate a peptide signature with a
high potential for prostate cancer diagnosis and prognosis.
114
4.3.2 Targeted relative quantification of candidates in a cohort of EPS-
urines
For targeted quantification of Phase 2 peptides, a cohort of EPS-urines derived from 74 clinically
stratified prostate cancer patients and noncancer controls (BPH and normal individuals) was
utilized (Cohort A, Table 4.1, Appendix Table 4.4). The overlap of these 147 putative candidates
with a previously published dataset composed of 661 cancer-associated proteins in urine (253)
was determined. Of the 147 Phase 2 peptides, only 10 overlapped with urine (Figure 4.2A),
suggesting that post-DRE urines are enriched in prostate-derived proteins. Indeed, in a previous
study (Chapter 2), myself and colleagues have demonstrated the significant enrichment of
proteins in EPS-urine (post-DRE urine) when compared to urine (matched pre-DRE urine) (127).
Furthermore, the estimated average concentration of Phase 2 peptides measured in this cohort of
EPS-urine demonstrated a dynamic concentration range of peptides spanning ~5 orders of
magnitude, with the exception of the two KLK3 (PSA) peptides HSQPWQVLVASR and
LSEPAELTDAVK (Figure 4.2B).
Table 4.1 Overview of clinical information from patients enrolled in Cohort A of EPS-
urines.
115
Figure 4.2 Peptides in EPS-urine. (A) Venn diagram depicting the number of Phase 2 peptides
that were also detected in the urines of a previous publication by Hüttenhain et al. (253). (B)
Estimated concentrations of Phase 2 peptides based on the spike-in concentration of the crude
heavy isotope labeled peptide standards. Red dots represent Phase 3 peptides that were
subsequently quantified by using AQUA peptides.
116
4.3.3 Selection of peptide candidates for verification
To derive a small set of peptide candidates for verification, a student’s T-test was performed to
compare the ratios of peptide abundance between cancer and noncancer groups, as well as EC
and OC groups. The first criteria used to select candidates as potential diagnostic and prognostic
biomarkers were p-value cutoffs of 0.05 and 0.1, respectively. Next, refinements to the list of
candidates were made by adding peptides representing the proteins IGJ and ANXA1 because
they were the only candidates that that were upregulated in the EC tumor group, although they
did not meet the p-value cut-off (p-value = 0.25). Furthermore, KLK3 peptides were added to
monitor PSA levels in EPS-urine. Finally, each of the peptides that met these criteria were
manually inspected for quality of SRM-MS traces. Overall, 34 candidates (Phase 3 peptides)
demonstrated a potential for classifying individuals based on prostatic disease status (Figure 4.1,
Figure 4.3, Appendix Table 4.2, Appendix Table 4.6). Peptides with diagnostic potential were
considered to be those that could discriminate prostate cancer from noncancer conditions.
Twenty-four such peptides were selected, of which the majority were upregulated in cancer
(Figure 4.3A and B). These candidates are grossly involved in functions associated with response
to oxidation when characterized by Gene Ontology enrichment (Figure 4.3C). Of the 14
candidates that demonstrated a prognostic potential – that is, they could discriminate EC tumors
from OC ones – only two candidates were upregulated in the EC tumor group (Figure 4.3A and
B). Candidates that were upregulated in the OC group are largely associated with the immune
response by Gene Ontology enrichment analysis (Figure 4.3C).
In order to generate a SRM-MS assay for accurate targeted quantification of Phase 3 peptides,
each peptide was synthesized as a highly purified (>97% purity) heavy isotope labeled standard
(AQUA peptides, Thermo Fisher Scientific) and calibration curves were generated for all
candidates in a urine background (Appendix Figure 4.1). All peptides demonstrated excellent
linearity (R2 ranging from 0.90 – 0.99) and reproducibility (coefficient of variation ranging from
0.4 – 19%) (Table 4.2, Appendix Table 4.3). The limit of detection ranged from approximately
0.006 to 8 fmol/μg of total urine peptides; the lower limit of quantification ranged from
approximately 0.02 to 25 fmol/μg of total urine peptides.
117
Figure 4.3 Phase 3 peptides. (A) Venn diagram depicting the number of peptides with
diagnostic potential and the number of peptides with prognostic potential. (B) The number of
peptides that the up- or down-regulated in the cancer group compared to the noncancer group
(left graph), and the EC tumor group compared to the OC tumor group (right graph). (C) Gene
Ontology terms that are enriched among the diagnostic and prognostic peptides.
118
Table 4.2. Limits of detection and quantification for 34 peptides determined by response
curves generated using AQUA peptides spiked into urine.
119
4.3.4 Absolute quantification for the verification of candidates in EPS-
urines
Next, the 34 Phase 3 peptides were quantified for the verification of their differential expression
in an independent cohort of 207 EPS-urines. Prostate cancer EPS-urines were further stratified
into pathological stage pT3 and pT2 tumors. Noncancer specimens were derived from patients
with BPH and normal individuals (Cohort B, Table 4.3, Appendix Table 4.5).
Table 4.3 Overview of clinical information from patients enrolled in Cohort B of EPS-
urines.
Since sufficient sample was available, each EPS-urine was measured in two technical replicates,
using a single scheduled SRM-MS method. The duplicate measurements for each sample in
Cohort B demonstrated strong reproducibility (Figure 4.4A). A Pearson correlation analysis
between the duplicate SRM-MS runs revealed that the majority of peptides have Pearson
correlation coefficients of 0.8 or above (Figure 4.4B). Peptides that have a Pearson correlation
coefficient of 0.8 or lower were found to be generally low-abundance peptides. The analysis also
demonstrated good chromatographic reproducibility (coefficient of variation <1.5%) with most
peptides eluting between 20 and 30 minutes over the 40-minute chromatographic gradient
(Figures 4.4C and D).
120
Figure 4.4 Cohort B SRM-MS data quality. (A) Peptide abundance correlation between both
replicates analyzed by SRM-MS. (B) The fraction of peptides that fall into different Pearson
correlation coefficient cut-offs in the subgroups of EPS-urine samples. (C) The average retention
121
time and coefficient of variation for each of the 34 peptides. (D) The recorded retention times of
the 34 candidates; boxplots represent the interquartile range; whiskers represent the minimum
and maximum recorded retention time.
Next, the absolute concentration of each peptide across all 207 samples in Cohort B was
determined based on the amount of spike in heavy AQUA peptides (Figure 4.5). The 34 peptides
measured in this set of EPS-urines spanned approximately 5 orders of magnitude, from the low
fmol to nmol range per µg total protein. As expected, the two peptides representing KLK3
(HSQPWQVLVASR and LSEPAELTDAVK) were found to be the most abundantly expressed
peptides in EPS-urine. The remaining peptides spanned approximately 3 orders of magnitude,
with the peptide representing DOPD being the highest and one of the peptides representing
PEDF being the lowest in this set of EPS-urines. All proteins that were represented by two
peptides were found to be within approximately 1 order of magnitude in abundance, except for
the peptides representing PEDF, which were approximately 2 orders of magnitude different in
abundance.
122
Figure 4.5 Absolute quantification of Phase 3 peptides in EPS-urine. Box plots represent the
median and interquartile range. Whiskers represent the 1-99 percentile. Outliers are represented
by red dots and the mean is represented by ‘+’.
4.3.5 Comparative analysis between subgroups of patients
In order to determine candidates with diagnostic and prognostic potential, comparative analyses
between subgroups of Cohort B patients were carried out using the SRMStats software tool
(version 1.6) (358). A number of candidates were significantly differentially expressed between
cancer and normal individuals, with all candidates being significantly upregulated in the cancer
123
group (Figure 4.6A). The proteins that were represented in this comparison were IDHC,
ANXA3, ANXA1, KLK3, KCRB, SERA, and CNDP2. Interestingly, IDHC, ANXA3, and
KLK3 were represented by the significant upregulation of two peptides for each protein in the
cancer group. When comparing the expression of candidates between the cancer and BPH
groups, ANXA3, SERA, ANXA1, and KLK3 were found to be significantly upregulated in the
cancer group (Figure 4.6B). The protein ANXA3 was represented by two peptides that were
significantly upregulated in cancer. Next, a comparison of pathological stage pT3 and pT2
samples revealed 10 peptides representing HEXB, 6PGL, DOPD, KLK3, SERA, PEDF, PRDX6,
and BTD. Two proteins – KLK3 and PRDX6 were represented by two peptides each (Figure
4.6C). When comparing cancer and noncancer (BPH and normal individuals), ANXA3, IDHC,
SERA, ANXA1, KCRB, and KLK3 were significantly upregulated in cancer (Figure 4.6D).
ANXA3 and KLK3 were represented by two peptides each. Lastly, a comparison between BPH
and normal samples revealed very little alterations, with IDHC being the only protein that was
significantly upregulated in BPH (Figure 4.6E). The relative expression of Phase 3 proteins and
peptides in EPS fluids (127, 244) using different quantitative methods are summarized in Table
4.4.
124
125
Figure 4.6 Comparative quantification of candidates in different subgroups of EPS.
Volcano plots representing the fold-changes of Phase 3 peptides in various subgroups of EPS-
urines from Cohort B. Red dots represent peptides that are upregulated; blue dots represent
peptides that are downregulated; dotted vertical lines represent the boundaries of a 1.5 fold
change; dotted horizontal line represents a p-value of 0.05.
126
Table 4.4 Relative expression of Phase 3 proteins and peptides in EPS fluids using various
quantitative methods.
127
4.4 Discussion and future direction
The timely verification of exhaustive lists of candidate disease biomarkers is becoming ever
more relevant as high-throughput technologies enable in-depth analyses of complex biological
systems exposing a plethora of putative candidates. Despite this, the implementation of
biomarkers into clinical practice is largely lagging behind. This is, in part, attributed to the lack
of validated methods of candidate testing and further evaluation. Targeted proteomics,
specifically SRM-MS, has placed itself at the forefront of candidate protein verification, due to
its relatively low cost and high-throughput assay development workflow. Therefore, the aim of
the current study was to systematically develop a targeted assay and apply it for the quantitative
verification of prostate cancer biomarkers in a sample that is routinely collected in the clinic.
Here, I developed a single multiplexed SRM-MS assay for 34 promising prostate cancer
diagnostic and prognostic candidates in EPS-urine. This work is an extension of previously
published shotgun proteomics data demonstrating the differential expression of a large number of
protein candidates in direct-EPS from patients with EC and OC tumors (described in Chapter 3)
(244). The feasibility of obtaining coordinates (peptide elution times, most intense transitions)
from different modes of data acquisition and instrumentation was demonstrated by extracting
relevant information from discovery data on a LTQ-Orbitrap in order to lay the foundation for
targeted assay development using a triple quadrupole mass spectrometer.
By utilizing semi-quantitative data (spectral counting) and spectral libraries generated from
Chapter 3, the extracted coordinates and most abundant transitions were identified and
systematically evaluated to develop a refined SRM-MS assay for differentially expressed
proteins. By performing multiple rounds of assay refinement (Phases 2 and 3), I was able to
narrow the list of promising candidates from over 200 candidates to 34 peptides with high
diagnostic and prognostic potential. By using heavy isotope-labeled peptides standards, I was
able to estimate and later absolutely quantify the candidates in the EPS-urine background.
Interestingly, a very small number of peptides in EPS-urine overlapped with those that were
previously explored in urine by Hüttenhain and colleagues (253). The cancer associated proteins
that were evaluated in their study were gathered from reports based on protein and nucleic acid
changes in human plasma and tissue (253, 359). In the current study, the evidence was directly
128
derived from comparing EC and OC tumor groups and from prostate-proximal fluids. I have
demonstrated in Chapter 2 that in pre- and post-DRE urines from the same patients, post-DRE
urines (EPS-urines) indeed contain a unique subset of proteins that are exclusive to this proteome
(127).
Nonetheless, there were some potential caveats that were faced in the design of the current study.
The discovery and subsequent Cohort A of samples were obtained from patients with either EC
or OC tumors based on pathology. This is in contrast to the samples used in the verification
phase (Cohort B) where patients were stratified based solely on their pathological stage (pT3
versus pT2). There are slight differences in their designations – the major difference being that
the EC and OC designations were primarily determined on the basis of pathological stage and
margin status (for instance, a pT2c tumor with positive margins and a pT3 tumor without
positive margins would be considered as EC cases). In order to evaluate the effects of these
differences, the expression profiles of candidate peptides in Cohorts A and B of samples were
evaluated (Appendix Figure 4.2). For the majority of peptides, Cohorts A and B were highly
concordant in terms of differential expression between pT3 versus pT2, and EC versus OC, with
the exception of a few proteins (IDHC, ANXA1, K2C8, PEDF, FIBG, and CFAB). When Cohort
A samples were converted to pT3 and pT2 designations, only GELS was affected. It is unclear
whether this effect is due to the difference in patient stratification, patient-to-patient variability,
or technical issues. However, the current evidence suggests that there is no major confounding
effect of patient stratification based on EC versus OC or pT3 versus pT2.
The overall relative expression profiles of the Phase 3 candidates across different EPS datasets
generated in our lab is summarized in Table 4.4. The first dataset was described in Chapter 3 and
in Kim et al., (244), and was the basis for candidate selection in the current study (Figure 4.1,
Discovery stage). Here, the semi-quantitative method of spectral counting was used to estimate
the relative abundance of proteins in direct-EPS from EC and OC tumors. In the same study,
immunoassays (i.e. Western blots) targeting a small number of proteins were performed in pools
of EPS-urine from EC and OC tumors. This data was compared to the expression changes of the
peptide candidates in Cohorts A and B of EPS-urines analyzed in the current study.
Encouragingly, the majority of candidates showed similar changes in direct-EPS and EPS-urine,
supporting the workflow that was used for candidate selection. Nonetheless, some candidates in
the EPS-urine did not reflect the trends observed in direct-EPS. This could be a result of patient
129
heterogeneity, instrumentation, and quantitative methods. Even when comparing the expression
levels of the candidates between EPS-urines from Cohorts A and B, there are some differences.
Interestingly, some pairs of peptides that represent a single protein demonstrate opposing trends.
For instance, the protein ANXA3 has one peptide (LTFDEYR) that is upregulated in the EC
group of EPS-urines in Cohort A, while the other peptide (SEIDLLDIR) is downregulated in the
EC group. This could be due to inaccurate quantification due to the use of crude peptide
standards of varying purities, to the presence of proteoforms (341) and modifications, or
variations in proteolytic digestion efficiencies of endogenous proteins (235, 360).
An interesting finding from the work presented in this chapter is the trend of lower abundance of
the majority of candidates with advancing disease. For instance, elevated serum levels of PSA
are indicative of prostate cancer; however, EPS levels of PSA have been consistently lower in
the EC group compared to the OC group, and in the pT3 group compared to the pT2 group (244).
Similarly, a trend of decreased PSA in EPS-urine from cancer patients compared to noncancer
individuals has also been noted by Drake et al. (255). This may be indicative of PSA leakage out
of the prostate gland and into the circulation, as described in Chapter 1 (129). Similarly, it can be
hypothesized that other proteins are leaking from the prostate and into the circulation with the
deteriorating structural integrity of advanced prostate cancers. It would be beneficial to measure
both EPS and serum levels of such proteins from the same patient, in order to test this
hypothesis.
In the comparison between BPH and normal individuals from Cohort B, only one protein
(IDHC) was found to demonstrate a significant difference in abundance. The remaining peptides
were found to show almost no differential expression. This may be explained by the fact that the
discovery cohort from Chapter 3 (244) lacked a noncancer group consisting of BPH and normal
individuals. Furthermore, the patients in Cohort A were not stratified on the basis of BPH versus
normal designations, and hence candidates that were selected for verification were not
specifically aimed at verifying their differential expression between BPH and normal EPS-
urines.
Currently, the analyses of the data presented in this chapter are ongoing. In collaboration with
Dr. Paul Boutros (OICR), a multi-feature diagnostic and prognostic signature is currently being
generated using machine learning approaches. The features that are being taken into
130
consideration include each of the 34 peptides that were measured in the 207 heterogeneous
verification sample cohort, as well as the clinical parameters that are available. For instance,
serum PSA levels that were taken at the time of sample draw may be combined with a protein
signature to provide enhanced prognostic performance of the signature. Although not obtained
for the current study, additional parameters such a tumor microenvironment parameters may also
provide information about patient outcome. Indeed, in a recent study by Lalonde et al., intra-
prostatic hypoxia was combined with DNA indices (copy number alterations) to predict 5-year
biochemical recurrence (BCR) (144). Such multi-feature signatures may provide a
comprehensive, powerful predictor of patient outcome. Patients enrolled in Cohort B of this
study have clinical information regarding BCR. In the pT2 group, 11 individuals out of 61
developed BCR at least 2 years post-RP; in the pT3 group, 10 individuals out of 29 developed
BCR (Table 4.3). Although not yet explored in this study, one approach to the analysis of the
data may be to comparatively analyze the protein expression changes between individuals who
develop recurrence and those that did not.
4.5 Materials and Methods
4.5.1 Sample collection and properties
Samples were obtained from patients following informed consent and use of Institutional Review
Board approved protocols at Urology of Virginia and Eastern Virginia Medical School and the
Research Ethics Review Board at the University Health Network. Twenty ml of EPS-urine was
centrifuged at 1100 g for 15 minutes at 4 °C to pellet debris. The resulting EPS-urine was
aliquoted in volumes of 3.5 ml and diluted with 2.5 ml of PBS (pH 7.4). The mixture was
vortexed, combined, and centrifuged at 1100 g for 15 minutes. The resulting supernatants were
stored at -40 °C. The clinical information for patients enrolled in the study are detailed in
Appendix Tables 4.4 and 4.5. Prostate cancer patients were selected on the basis of pathological
stage pT2 or pT3. Noncancer individuals had biopsy confirmed BPH or were considered as
individuals with no indication of prostatic disease based on biopsy results.
131
4.5.2 Sample preparation for mass spectrometry
Ultrapure-grade tetrafluoroethylene (TFE), trifluoroacetic acid (TFA), iodoacetamide (IAA), and
dithiotreitol (DTT) were from Sigma-Aldrich. HPLC-grade solvents (methanol, acetonitrile, and
water) and formic acid were from Fisher Scientific. Mass spectrometry-grade trypsin/lys-c was
from Promega (Madison, WI). Amicon spin filter columns, 0.5 ml, 3 kDa MWCO, were from
Millipore. Solid phase extraction C18 tips were from Agilent.
Four ml of urine and EPS-urine were concentrated to approximately 500 μl by using a spin
column with a molecular weight cutoff of 3 kDa, and proteins were precipitated overnight by the
addition of ice-cold 100% methanol. Protein pellets were washed twice with 100% methanol and
air dried. Protein resolubilization and extraction was performed by the addition of 50% TFE at
60 °C for 2 hours. Following reduction with DTT and alkylation with IAA, proteins were
digested overnight at 37 °C using mass trypsin/lys-c. The reaction was quenched by the addition
of TFA. Desalting was performed by solid phase extraction using C18 tips. Solvents were
removed by vacuum centrifugation and peptides were resolubilized in 5% acetonitrile, 0.1 %
formic acid. Peptide concentrations were determined by the micro BCA assay kit (Thermo Fisher
Scientific).
4.5.3 Target acquisition by selected reaction monitoring mass
spectrometry
Samples were analyzed on a TSQ Vantage triple quadrupole mass spectrometer (Thermo Fisher
Scientific) equipped with an EASY-Spray (Thermo Fisher Scientific) electrospray ion source.
Separations were performed on EASY-Spray columns (15 cm x 75 µm ID packed with 3µm C18
particles, Thermo Fisher Scientific) heated to 50 °C. Peptides were kept at 4° C and loaded onto
the column from an EASY-nLC (Thermo Fisher Scientific) autosampler. Chromatographic
conditions were as follows: 40 minute gradient at a flow rate of 300 nl/min starting with 100% A
(water), stepping up to 5% B (ACN) in 5 minutes, followed by 25% B at 35 minutes, followed
by a steep increase to 50% B at 38 minutes and 100% B at 40 minutes.
Targeted acquisition of eluting ions was performed by the mass spectrometer operated in SRM-
MS mode with Q1 and Q3 set to 0.7 m/z fwhm resolution and a cycle time of 1 second. For all
132
SRM-MS runs with the exception of the measurement of Phase 3 peptides EPS-urines from
Cohort B, unscheduled methods were used, each targeting approximately 200 transitions over a 1
second cycle time. For Cohort B, a single scheduled method was utilized (2 minute elution
window), monitoring approximately 200 transitions (3 transitions per peptide) over a 1 second
cycle time.
4.5.4 Phase 2 peptide selection
In Chapter 3, 133 differentially expressed proteins were identified when comparing the proteome
profiles of 16 direct-EPS samples from individuals with EC and OC prostatic tumors. A spectral
library was built from the resulting LTQ-Orbitrap XL data obtained using the Skyline software
tool (version 2.1.0) (350). All spectra had scores that passed a stringent peptide score as
determined by an X!Tandem target decoy search criteria of 0.5% decoy spectral matches. Protein
sequences were converted to FASTA format and uploaded into Skyline for the prediction of
proteotypic peptides. Peptides were chosen based on previously reported specifications (241). A
total of 232 proteotypic peptides (Phase 1 peptides) were selected and purchased as bulk heavy-
isotope labeled peptide standards (JPT Peptide Technologies). In order to assess the suitability of
the Phase 1 peptides for SRM-MS, 250 fmol of each heavy peptide standard was spiked into 1 ug
of EPS-urine-derived peptides, with 4-6 transitions monitored over a 40-minute chromatographic
gradient. Of the 232 Phase 1 peptides, 147 (Phase 2 peptides) were reproducibly detectable with
a minimum of three transitions in the complex EPS-urine background.
4.5.5 Phase 2 peptide quantification and Phase 3 peptide selection
A cohort of individual EPS-urines (n = 74) from a heterogeneous population of patients with EC,
OC, and control (BPH, normal) (Cohort A) was used to analyze the Phase 2 peptides. A total of 1
µg of peptide from each sample was spiked with 200 fmol of the heavy peptide standards.
Visualization and inspection of peaks was performed on Skyline. Each peptide was quantified in
a sample by integrating the quantifier ion (most intense ion) of the light peptide with its co-
eluting heavy peptide ion, in order to derive a light-to-heavy peptide ratio. The student’s T-test
was used to compare the ratios between cancer and controls, as well as EC and OC. A K-fold
cross-validation was performed to investigate the diagnostic and prognostic power of the
peptides at different p-value cutoffs. For the diagnostic and prognostic peptide candidates, p-
133
value cutoffs of 0.05 and 0.1 were used, respectively. Further refinements to the lists were made
by including peptides that did not meet the p-value cutoffs but were potentially promising. For
instance, peptides SSEDPNEDIVER from protein IGJ and TPAQFDADELR from protein
ANXA1 were added to the list of putative prognostic candidates because they were the only
candidates that were elevated in the EC tumor group (p-value = 0.25). Two KLK3 peptides
(HSQPWQVLVASR and LSEPAELTDAVK) were also added in order to monitor PSA levels in
EPS-urine. A total of 34 peptides (24 diagnostic and 14 prognostic peptides, of which 4
overlapped) were taken forward for verification. These peptides are henceforth referred to as
Phase 3 peptides.
4.5.6 Generation of response curves and determination of figures of
merit
Each of the Phase 3 peptides were obtained as highly purified, soluble light and heavy peptides
at defined concentrations (AQUA peptides, Thermo Fisher). A reversed response curve was
generated for each peptide by spiking various concentrations (0-250 fmol, 7 points) of the heavy
peptide into a background of 1 μg urine peptides and 50 fmol of the corresponding light peptide.
Each concentration point was measured in 3-5 technical replicates. The LOD and LLOQ for each
peptide were derived using the following equation: LOD or LOQ = (F x SDblank) / b, where F =
3.3 for LOD and 10 for LOQ and b = slope.
4.5.7 Multiplexed, scheduled selected reaction monitoring mass
spectrometry method
In order to increase throughput, a multiplexed SRM-MS assay was developed by scheduling all
34 candidates in a single 40-minute chromatographic gradient. A total of 3 transitions were
monitored for the light and heavy versions of each peptide for a total of 204 transitions per
analysis. A 2-minute acquisition time window was scheduled around the expected peptide
elution time.
4.5.8 Verification of Phase 3 peptides in EPS-urines
The verification a heterogeneous population of patients with pathological stage pT3 and pT2
prostate tumors, BPH, and normal individuals (n = 207) was enrolled (Cohort B). One μg of total
134
peptide from each sample was spiked with 100 fmol of heavy peptide and 10 fmol of
corresponding light peptide for all candidates with the exception of the KLK3 peptide,
HSQPWQVLVASR, which was spiked in at 500 fmol of heavy peptide. Visualization and
inspection of peaks was performed on Skyline. Each sample was analyzed in two technical
replicates using the same instrument parameters as described above.
4.5.9 Quantitative and statistical analyses
Each of the light and heavy peptides were checked for quality of data by observing co-elution of
all three transitions, alignment of light and heavy peptide elution times, and reproducibility
between technical replicates. Relative ratios of the area under the curve of the most predominant
ion (quantifier) of the light peptide versus the corresponding heavy quantifier were calculated.
For statistical analyses, the SRMstats software tool (version 1.6) (358) was utilized. Peptides
were comparatively tested for abundance differences between Cancer versus Normal; Cancer
versus BPH; Cancer versus Noncancer; BPH versus Normal; and pathological stages pT3 versus
pathological stage pT2. Peptides with a p-value ≤0.05 and a fold change ≥1.5 were considered
significant.
135
Chapter 5 Summary and future direction
5 Summary and future direction
5.1 Summary of key findings
In the current thesis, I used a proteomics platform encompassing the systematic discovery and
verification of putative protein biomarkers in prostate-proximal fluids. Overall, this body of work
provides a high-quality archive of proteins in expressed prostatic secretions (EPS). In addition,
proteins that are differentially expressed among subtypes of prostate cancer those that are
extracapsular (EC) and those that are organ-confined (OC) – were identified. By systematically
following up on these proteins, a number of peptides were verified to be differentially abundant
among these prostate cancer subtypes and a quantitative assay was developed. Ongoing
bioinformatics and statistical analyses are being used to generate a multi-feature signature. These
findings may facilitate the clinical management of prostate cancer by identifying patients who
may be spared from unnecessary medical procedures and have favourable prognoses, circumvent
the need for biopsies, and may be used for longitudinal tracking of disease progression.
Furthermore, proteomic analyses of additional EPS-urine-derived exosomes (126) were
performed during my candidacy, but were not covered in depth in this thesis. Taken together,
over 2000 proteins from prostate-proximal fluids have been comprehensively catalogued, with an
emphasis on the identification of prostate cancer biomarkers (Figure 5.1). A full database of
these datasets is available and can be further interrogated to answer various clinical questions.
5.1.1 In-depth proteomic analyses characterizing the direct and urinary expressed prostatic secretion proteomes
In the first investigation into the EPS proteome in our lab, myself and colleagues analyzed direct-
EPS from nine prostate cancer patients (260). Using multidimensional protein identification
technology (MudPIT) and spectral counting, 916 unique proteins were identified. This was
integrated with publicly available datasets, such as mRNA expression levels across various
human tissues, antibody availability, other proteomics datasets from related fluids and prostate
cancer cell secretomes, in order to characterize this proteome.
136
To describe the EPS-urine proteome, myself and colleagues identified 1022 unique proteins from
EPS-urines from prostate cancer and benign prostatic hyperplasia (BPH) patients (127).
Additionally, in order to identify prostate-enriched proteins, five patients with prostate cancer
and five noncancer individuals were screened pre- and post-digital rectal examination to obtain
urine and EPS-urine, respectively. This valuable, internally controlled sample set provided a total
of 444 unique proteins of which 49 proteins were significantly enriched in EPS-urine compared
to urine using a semi-quantitative method.
5.1.2 Discovery phase: identification of candidate biomarkers of extracapsular prostate cancer
In the discovery phase, a shotgun approach was taken to catalogue the proteomes of a prostate-
proximal fluid, direct expressed prostatic secretions (direct-EPS), from EC and OC prostate
cancers. In total, over 600 proteins were identified, of which 133 were differentially abundant in
the two groups by spectral counting. In an initial verification step, a systematic data mining
strategy was used to select a small subset of differentially expressed proteins. These 14
candidates were then analyzed by Western blotting in a cohort of direct-EPS linked with clinical
outcome information related to biochemical recurrence (BCR) within a two-year period post-
prostatectomy. Five candidates – SFN, MME, PARK7, TIMP1, TGM4 − showed a similar
pattern of differential expression between recurrent and non-recurrent patients as they did for EC
and OC, respectively. As one of the goals of my thesis was to demonstrate the detection of
potential candidates in a clinically relevant biofluid for non-invasive detection, EPS-urine was
obtained from EC and OC prostate cancers. As a proof-of-concept, selected reaction monitoring
mass spectrometry (SRM-MS) was employed for quantification of these candidates. By targeting
at least one proteotypic peptide, all candidates were quantified in the pooled EPS-urines by
SRM-MS. In conclusion, I established EPS as a promising resource of prostate cancer
biomarkers. Furthermore, I demonstrated SRM-MS as a valid method of rapid verification of
putative candidates in biological fluids.
5.1.3 Verification phase: targeted quantification of selected prostate cancer biomarker candidates
The successful application of SRM-MS to EPS-urines in the discovery phase prompted me to
utilize this high-throughput method for the accurate quantification of a larger number of
137
biomarker candidates. By revisiting the 133 differentially abundant proteins determined by
spectral counting, 147 proteotypic peptides corresponding to 82 proteins, were selected and
relatively quantified by SRM-MS in a cohort of individual EPS-urines from EC and OC patients,
as well as noncancer control individuals. From this analysis, 34 candidates were demonstrated to
have diagnostic and prognostic potential. Next, to verify these candidates, absolute quantification
was performed utilizing highly purified peptide standards and a single multiplexed SRM-MS
assay in a new cohort of EPS-urines. This high-quality information is being utilized to generate a
multi-feature diagnostic and prognostic signature with a high potential for facilitating the clinical
management of prostate cancer patients.
5.1.4 Additional proteomics analysis of prostate-proximal fluids
Although not covered in detail in this thesis, I carried out an additional proteomics investigation
into EPS-urine derived exosomes (126). In this study, myself and colleagues analyzed EPS-
urine-derived exosomes from prostate cancer patients and noncancer individuals using mass
spectrometry (126). A total of 877 unique proteins were identified. Characterization of this
proteome revealed a fraction of proteins that were not previously detected in the EPS (direct-EPS
and EPS-urine) proteome or other exosome proteomes.
Taken together, I comprehensively characterized the proteomes of various prostate-proximal
fluids. Through this, I demonstrated that these biofluids are rich sources of prostate-related
proteins and may be a valuable resource of potential disease biomarkers (Figure 5.1).
138
Figure 5.1 Overview of all proteomics datasets generated for various prostate-proximal
fluids. A total of approximately 2000 proteins are available in an in-house database. PCa,
prostate cancer; NC, noncancer; EC, extracapsular; OC, organ-confined; BPH, benign prostatic
hyperplasia.
5.2 Future directions
The data presented in this thesis provides verified proteomic information to generate a potential
non-invasive biomarker panel for the prognosis of prostate cancer. The studies fall into Phases 1
(preclinical exploratory studies) and 2 (clinical assay development) out of the 5 phases of the
biomarker development pipeline as set out by the Early Detection Research Network (112). The
results fulfill the primary aims of Phase 1 by identifying and prioritizing potential biomarkers in
a systematic manner, and providing quantitative measures of protein abundance changes between
subtypes of prostate cancers. In Phase 2, an assay is developed for a non-invasive specimen. Its
primary aim is to assess the clinical characteristics of the assay (such as its sensitivity and
specificity) and its ability to distinguish patient groups. In partial fulfillment of this, a single
multiplexed SRM-MS assay was systematically developed and utilized to generate a biomarker
signature.
139
Following the work presented in the current thesis, a number of critical steps need to be
undertaken in order to further validate the biomarkers for clinical use. First, the verified
differentially expressed peptides are currently being statistically evaluated to generate a multi-
feature signature for the non-invasive diagnosis and prognosis of prostate cancer. Following this,
a multi-institutional evaluation of the signature should be performed. For this, standardized
procedures for the harmonization of sample collection, processing, and storage need to be
instituted. The SRM-MS assay should be optimized for a clinical laboratory setting and standard
operating procedures should be drafted. Once implemented, the intra- and inter-laboratory
reproducibility of the standardized assay, and its analytical robustness must be validated.
Furthermore, the biomarkers presented in the thesis are continuous variables; thus clinically
relevant thresholds and time points that can accurately stratify patients need to be established and
validated (361). One of the major advantages of utilizing EPS-urine is the ease with which the
specimens can be drawn from patients. Therefore, the longitudinal collection of EPS-urine at
regular intervals should be performed in order to monitor changes in the levels of biomarkers and
their relationship to disease progression. Prospective and cancer-control studies should be
performed in large patient cohorts in order to determine the clinical utility of the biomarkers. The
primary aim of a prospective study is to determine the detection rate and the false referral rate of
the biomarker in a relevant population (112). In cancer-control studies, the aim is to estimate the
reduction in cancer burden in the population as a result of testing. Factors that may inhibit a
biomarker from being taken into the clinic include morbidity associated with the test, poor
compliance, and over-diagnosis of cancers that, if left undetected, would pose no threat to
patients (112).
One area that was not investigated in this thesis is the biological functions of putative candidates
and the molecular mechanisms that they are involved in. In Chapter 3, it was shown that a
number of different pathways are enriched in my dataset, including those that regulate the cell
cycle and metabolism (Figures 3.9 and 3.10). These and other candidates can be followed up by
studying the effects of silencing or over-expressing them in prostate cancer cell lines.
Characteristics such as proliferation, apoptosis, migration, and invasiveness can be evaluated
using methods such as proliferation and colony formation assays, annexin V/propidium iodide
apoptosis assay, wound healing, and Matrigel invasion assays. Furthermore, by integrating
140
genomic, transcriptomic, and epigenetic information into the analysis, a systems level view of
prostate cancer biology might be uncovered.
Our laboratory has been cataloguing the proteome of EPS for over 6 years (126, 127, 244, 260),
having identified over 2000 proteins in total (Figure 5.1). Advancing techniques are expected to
provide even deeper insight into this immensely complex, yet highly valuable class of biofluids.
Subfractionation, by extracting exosomes or glycoproteins, for instance, is an active area of
investigation for the identification of novel secreted biomarkers and molecular players in disease
progression. Strategies that enrich for exosomes (e.g., differential ultracentrifugation) and
glycoprotein capture (e.g., solid phase extraction of glycopeptides) greatly enhance the dynamic
range and limit of detection of proteins in highly complex patient specimens.
Extracellular vesicles are small (40-5000 nm diameter) membrane-bound vesicles that carry
biomolecular cargo (proteins, nucleic acids, and lipids) from their cell of origin (178, 362), and
are involved in a myriad of biological processes (179, 363-367). Cancer cells are reported to
increase the release of extracellular vesicles into the extracellular environment compared to
noncancer cells (368-370), and thus these factors may represent an important source of fluid-
derived cancer biomarkers. Indeed, studies have demonstrated the presence of “prostasomes” –
prostate-derived extracellular vesicles – in body fluids, cancer cell lines, and tissue (126, 371-
373). In our lab, myself and colleagues have characterized the proteomes of EPS-urine-derived
exosomes using shotgun proteomics (126). Almost 900 proteins were identified, of which
approximately 200 were not previously detected in direct-EPS and EPS-urines analyzed in our
lab.
Protein glycosylation, the post-translational modification of proteins by the covalent addition of
sugar moieties, plays a fundamental role in a wide range of biological processes (374, 375).
Indeed, alterations in protein glycosylation patterns is a recognized hallmark of cancer
progression (376, 377). Profiling the changes that occur in this subproteome as a result of disease
can provide novel biomarkers for diagnosis and prognosis of cancer. The majority of membrane
and secreted proteins are glycosylated (378); in fact, many of the clinical biomarkers in use
today, such as PSA in prostate cancer, CA125 in ovarian cancer, and Her2/neu in breast cancer,
are glycoproteins.
141
As a continuation of the work presented in this thesis, an in-depth proteomic profiling of such
EPS subproteomes will provide additional putative biomarker candidates that were not
previously identified. As mentioned above, by enriching for these proteins, low abundance
species may be more readily resolved.
Proteomics technologies are continuously evolving to provide even deeper proteome coverage,
better quantitation, and higher throughput. The data independent acquisition (DIA) mode of mass
spectrometry analysis has recently emerged as a powerful tool for simultaneous peptide
identification and quantification (379). In DIA mode, all peptides in a mass-to-charge (m/z)
window of a defined width and a retention time range are fragmented, and MS/MS data is
acquired from all detectable precursor ions in the isolation window. This process is repeated for
the next m/z window in an iterative fashion across the full m/z range, producing a complete and
permanent time-resolved record of all fragment ions of detectable peptide precursors in a sample
(380). With recent improvements in DIA data analysis platforms, the highly complex, composite
fragment ion spectra can be linked back to the precursors from which they originate using a
priori information contained in MS/MS spectral libraries (381). This method increases coverage
and reproducibility of the data by acquiring MS/MS data from all detectable precursor ions in a
sample, while making retrospective data analysis a possibility.
Some of the limitations of SRM-MS assays can be circumvented by a high-resolution, accurate-
mass mass spectrometer. For instance, the Thermo Scientific Q Exactive is a hybrid quadrupole-
Orbitrap mass spectrometer that can be utilized for sensitive targeted quantification of proteins in
complex biological material. In parallel reaction monitoring (PRM) mode, a target precursor ion
is isolated in a quadrupole, fragmented by high energy collisional dissociation, and all resulting
product ions are detected in an Orbitrap mass analyzer, yielding a MS/MS spectrum (382, 383).
Quantification is achieved post-acquisition, by extracting one or more of the fragment ions.
Peterson and colleagues demonstrated similar analytical performance of PRM to that of SRM-
MS in terms of dynamic range, linearity, and precision (382). The major advantage of PRM over
SRM-MS is the improved selectivity by recording all fragment ions in the MS/MS scan, over the
few transitions recorded in SRM-MS. In this way, the assay development time is also reduced
since there is no need to preselect target transitions.
142
5.3 Concluding remarks
Personalized medicine, a field of health care that utilizes each individual’s unique clinical,
genetic, genomic, and environmental information to optimize and guide clinical decision making
(384), is emerging as a feasible and promising modality of cancer management. This is largely
due to tremendous progress made in genomic medicine that encompasses information from
genomes and their cognate factors such as RNA, proteins, and metabolites. This information is
helping researchers navigate the complex landscape of tumor biology and providing a
comprehensive view, enabling more precise risk assessment, diagnosis and prognosis, and
response to therapy (385). For example, protein expression patterns, such as those that were
presented in this thesis, can definitively classify patients into different prognostic subgroups,
facilitating strategic health planning. Continued scientific efforts are expected to lead the way in
personalized medicine and shift the paradigm of healthcare from disease treatment to disease
prevention (384).
143
References
1. McNeal, J. E. (1988) Normal histology of the prostate. The American journal of surgical
pathology 12, 619-633
2. Abate-Shen, C., and Shen, M. M. (2000) Molecular genetics of prostate cancer. Genes Dev
14, 2410-2434
3. Brawer, M. K., Peehl, D. M., Stamey, T. A., and Bostwick, D. G. (1985) Keratin
immunoreactivity in the benign and neoplastic human prostate. Cancer research 45, 3663-
3667
4. Liu, A. Y., True, L. D., LaTray, L., Nelson, P. S., Ellis, W. J., Vessella, R. L., Lange, P. H.,
Hood, L., and van den Engh, G. (1997) Cell-cell interaction in prostate gene regulation and
cytodifferentiation. Proc Natl Acad Sci U S A 94, 10705-10710
5. Nagle, R. B., Ahmann, F. R., McDaniel, K. M., Paquin, M. L., Clark, V. A., and Celniker,
A. (1987) Cytokeratin characterization of human prostatic carcinoma and its derived cell
lines. Cancer research 47, 281-286
6. Sherwood, E. R., Berg, L. A., Mitchell, N. J., McNeal, J. E., Kozlowski, J. M., and Lee, C.
(1990) Differential cytokeratin expression in normal, hyperplastic and malignant epithelial
cells from human prostate. J Urol 143, 167-171
7. Verhagen, A. P., Aalders, T. W., Ramaekers, F. C., Debruyne, F. M., and Schalken, J. A.
(1988) Differential expression of keratins in the basal and luminal compartments of rat
prostatic epithelium during degeneration and regeneration. The Prostate 13, 25-38
8. Bui, M., and Reiter, R. E. (1998) Stem cell genes in androgen-independent prostate cancer.
Cancer metastasis reviews 17, 391-399
9. Abrahamsson, P. A., Cockett, A. T., and di Sant'Agnese, P. A. (1998) Prognostic
significance of neuroendocrine differentiation in clinically localized prostatic carcinoma.
The Prostate. Supplement 8, 37-42
10. di Sant'Agnese, P. A. (1998) Neuroendocrine cells of the prostate and neuroendocrine
differentiation in prostatic carcinoma: a review of morphologic aspects. Urology 51, 121-
124
11. (2014) Canadian Cancer Statistics. In: Society, C. C., ed.
12. Sakr, W. A., Haas, G. P., Cassin, B. F., Pontes, J. E., and Crissman, J. D. (1993) The
frequency of carcinoma and intraepithelial neoplasia of the prostate in young male patients.
J Urol 150, 379-385
13. (1998-2002) World Health Organization Mortality Database. World Health Organization
Mortality Database
144
14. Gronberg, H. (2003) Prostate cancer epidemiology. Lancet 361, 859-864
15. Greene, K. L., Cowan, J. E., Cooperberg, M. R., Meng, M. V., DuChane, J., and Carroll, P.
R. (2005) Who is the average patient presenting with prostate cancer? Urology 66, 76-82
16. Schroder, F. H., Hugosson, J., Roobol, M. J., Tammela, T. L., Ciatto, S., Nelen, V.,
Kwiatkowski, M., Lujan, M., Lilja, H., Zappa, M., Denis, L. J., Recker, F., Berenguer, A.,
Maattanen, L., Bangma, C. H., Aus, G., Villers, A., Rebillard, X., van der Kwast, T.,
Blijenberg, B. G., Moss, S. M., de Koning, H. J., and Auvinen, A. (2009) Screening and
prostate-cancer mortality in a randomized European study. N Engl J Med 360, 1320-1328
17. Parkin, D. M., Bray, F. I., and Devesa, S. S. (2001) Cancer burden in the year 2000. The
global picture. European journal of cancer 37 Suppl 8, S4-66
18. Potosky, A. L., Miller, B. A., Albertsen, P. C., and Kramer, B. S. (1995) The role of
increasing detection in the rising incidence of prostate cancer. JAMA 273, 548-552
19. Osegbe, D. N. (1997) Prostate cancer in Nigerians: facts and nonfacts. J Urol 157, 1340-
1343
20. Ruijter, E., van de Kaa, C., Miller, G., Ruiter, D., Debruyne, F., and Schalken, J. (1999)
Molecular genetics and epidemiology of prostate carcinoma. Endocrine reviews 20, 22-45
21. Meikle, A. W., Smith, J. A., and West, D. W. (1985) Familial factors affecting prostatic
cancer risk and plasma sex-steroid levels. The Prostate 6, 121-128
22. Isaacs, S. D., Kiemeney, L. A., Baffoe-Bonnie, A., Beaty, T. H., and Walsh, P. C. (1995)
Risk of cancer in relatives of prostate cancer probands. Journal of the National Cancer
Institute 87, 991-996
23. Carter, B. S., Bova, G. S., Beaty, T. H., Steinberg, G. D., Childs, B., Isaacs, W. B., and
Walsh, P. C. (1993) Hereditary prostate cancer: epidemiologic and clinical features. J Urol
150, 797-802
24. Gronberg, H., Isaacs, S. D., Smith, J. R., Carpten, J. D., Bova, G. S., Freije, D., Xu, J.,
Meyers, D. A., Collins, F. S., Trent, J. M., Walsh, P. C., and Isaacs, W. B. (1997)
Characteristics of prostate cancer in families potentially linked to the hereditary prostate
cancer 1 (HPC1) locus. JAMA 278, 1251-1255
25. Serrano, M., Hannon, G. J., and Beach, D. (1993) A new regulatory motif in cell-cycle
control causing specific inhibition of cyclin D/CDK4. Nature 366, 704-707
26. Morton, R. A., Jr., Watkins, J. J., Bova, G. S., Wales, M. M., Baylin, S. B., and Isaacs, W.
B. (1996) Hypermethylation of chromosome 17P locus D17S5 in human prostate tissue. J
Urol 156, 512-516
27. Wilbert, D. M., Griffin, J. E., and Wilson, J. D. (1983) Characterization of the cytosol
androgen receptor of the human prostate. The Journal of clinical endocrinology and
metabolism 56, 113-120
145
28. Taplin, M. E., Bubley, G. J., Shuster, T. D., Frantz, M. E., Spooner, A. E., Ogata, G. K.,
Keer, H. N., and Balk, S. P. (1995) Mutation of the androgen-receptor gene in metastatic
androgen-independent prostate cancer. N Engl J Med 332, 1393-1398
29. Labrie, F., Belanger, A., Dupont, A., Luu-The, V., Simard, J., and Labrie, C. (1993) Science
behind total androgen blockade: from gene to combination therapy. Clinical and
investigative medicine. Medecine clinique et experimentale 16, 475-492
30. Culig, Z., Klocker, H., Eberle, J., Kaspar, F., Hobisch, A., Cronauer, M. V., and Bartsch, G.
(1993) DNA sequence of the androgen receptor in prostatic tumor cell lines and tissue
specimens assessed by means of the polymerase chain reaction. The Prostate 22, 11-22
31. Hobisch, A., Culig, Z., Radmayr, C., Bartsch, G., Klocker, H., and Hittmair, A. (1995)
Distant metastases from prostatic carcinoma express androgen receptor protein. Cancer
research 55, 3068-3072
32. Gottlieb, B., Lehvaslaiho, H., Beitel, L. K., Lumbroso, R., Pinsky, L., and Trifiro, M. (1998)
The Androgen Receptor Gene Mutations Database. Nucleic Acids Res 26, 234-238
33. Aihara, M., Wheeler, T. M., Ohori, M., and Scardino, P. T. (1994) Heterogeneity of prostate
cancer in radical prostatectomy specimens. Urology 43, 60-66; discussion 66-67
34. Bostwick, D. G., Shan, A., Qian, J., Darson, M., Maihle, N. J., Jenkins, R. B., and Cheng, L.
(1998) Independent origin of multiple foci of prostatic intraepithelial neoplasia: comparison
with matched foci of prostate carcinoma. Cancer 83, 1995-2002
35. Clark, J., Attard, G., Jhavar, S., Flohr, P., Reid, A., De-Bono, J., Eeles, R., Scardino, P.,
Cuzick, J., Fisher, G., Parker, M. D., Foster, C. S., Berney, D., Kovacs, G., and Cooper, C.
S. (2008) Complex patterns of ETS gene alteration arise during cancer development in the
human prostate. Oncogene 27, 1993-2003
36. Macintosh, C. A., Stower, M., Reid, N., and Maitland, N. J. (1998) Precise microdissection
of human prostate cancers reveals genotypic heterogeneity. Cancer research 58, 23-28
37. Mehra, R., Han, B., Tomlins, S. A., Wang, L., Menon, A., Wasco, M. J., Shen, R., Montie,
J. E., Chinnaiyan, A. M., and Shah, R. B. (2007) Heterogeneity of TMPRSS2 gene
rearrangements in multifocal prostate adenocarcinoma: molecular evidence for an
independent group of diseases. Cancer research 67, 7991-7995
38. Liu, W., Laitinen, S., Khan, S., Vihinen, M., Kowalski, J., Yu, G., Chen, L., Ewing, C. M.,
Eisenberger, M. A., Carducci, M. A., Nelson, W. G., Yegnasubramanian, S., Luo, J., Wang,
Y., Xu, J., Isaacs, W. B., Visakorpi, T., and Bova, G. S. (2009) Copy number analysis
indicates monoclonal origin of lethal metastatic prostate cancer. Nature medicine 15, 559-
565
39. Mehra, R., Tomlins, S. A., Yu, J., Cao, X., Wang, L., Menon, A., Rubin, M. A., Pienta, K.
J., Shah, R. B., and Chinnaiyan, A. M. (2008) Characterization of TMPRSS2-ETS gene
aberrations in androgen-independent metastatic prostate cancer. Cancer research 68, 3584-
3590
146
40. Shen, M. M., and Abate-Shen, C. (2010) Molecular genetics of prostate cancer: new
prospects for old challenges. Genes Dev 24, 1967-2000
41. Sakr, W. A., Grignon, D. J., Crissman, J. D., Heilbrun, L. K., Cassin, B. J., Pontes, J. J., and
Haas, G. P. (1994) High grade prostatic intraepithelial neoplasia (HGPIN) and prostatic
adenocarcinoma between the ages of 20-69: an autopsy study of 249 cases. In vivo 8, 439-
443
42. Shiraishi, T., Watanabe, M., Matsuura, H., Kusano, I., Yatani, R., and Stemmermann, G. N.
(1994) The frequency of latent prostatic carcinoma in young males: the Japanese experience.
In vivo 8, 445-447
43. Yatani, R., Kusano, I., Shiraishi, T., Hayashi, T., and Stemmermann, G. N. (1989) Latent
prostatic carcinoma: pathological and epidemiological aspects. Japanese journal of clinical
oncology 19, 319-326
44. Bostwick, D. G., and Cheng, L. (2012) Precursors of prostate cancer. Histopathology 60, 4-
27
45. Bostwick, D. G. (1989) Prostatic intraepithelial neoplasia (PIN). Urology 34, 16-22
46. De Marzo, A. M., Meeker, A. K., Zha, S., Luo, J., Nakayama, M., Platz, E. A., Isaacs, W.
B., and Nelson, W. G. (2003) Human prostate cancer precursors and pathobiology. Urology
62, 55-62
47. Bostwick, D. G., and Brawer, M. K. (1987) Prostatic intra-epithelial neoplasia and early
invasion in prostate cancer. Cancer 59, 788-794
48. Haggman, M. J., Macoska, J. A., Wojno, K. J., and Oesterling, J. E. (1997) The relationship
between prostatic intraepithelial neoplasia and prostate cancer: critical issues. J Urol 158,
12-22
49. Haggman, M. J., Wojno, K. J., Pearsall, C. P., and Macoska, J. A. (1997) Allelic loss of 8p
sequences in prostatic intraepithelial neoplasia and carcinoma. Urology 50, 643-647
50. Nagle, R. B., Brawer, M. K., Kittelson, J., and Clark, V. (1991) Phenotypic relationships of
prostatic intraepithelial neoplasia to invasive prostatic carcinoma. The American journal of
pathology 138, 119-128
51. Qian, J., Bostwick, D. G., Takahashi, S., Borell, T. J., Herath, J. F., Lieber, M. M., and
Jenkins, R. B. (1995) Chromosomal anomalies in prostatic intraepithelial neoplasia and
carcinoma detected by fluorescence in situ hybridization. Cancer research 55, 5408-5414
52. Sakr, W. A., Macoska, J. A., Benson, P., Grignon, D. J., Wolman, S. R., Pontes, J. E., and
Crissman, J. D. (1994) Allelic loss in locally metastatic, multisampled prostate cancer.
Cancer research 54, 3273-3277
53. Vocke, C. D., Pozzatti, R. O., Bostwick, D. G., Florence, C. D., Jennings, S. B., Strup, S. E.,
Duray, P. H., Liotta, L. A., Emmert-Buck, M. R., and Linehan, W. M. (1996) Analysis of 99
147
microdissected prostate carcinomas reveals a high frequency of allelic loss on chromosome
8p12-21. Cancer research 56, 2411-2416
54. Lavery, H. J., and Droller, M. J. (2012) Do Gleason patterns 3 and 4 prostate cancer
represent separate disease states? J Urol 188, 1667-1675
55. Bubendorf, L., Schopfer, A., Wagner, U., Sauter, G., Moch, H., Willi, N., Gasser, T. C., and
Mihatsch, M. J. (2000) Metastatic patterns of prostate cancer: an autopsy study of 1,589
patients. Human pathology 31, 578-583
56. Logothetis, C. J., and Lin, S. H. (2005) Osteoblasts in prostate cancer metastasis to bone.
Nat Rev Cancer 5, 21-28
57. Attard, G., Swennenhuis, J. F., Olmos, D., Reid, A. H., Vickers, E., A'Hern, R., Levink, R.,
Coumans, F., Moreira, J., Riisnaes, R., Oommen, N. B., Hawche, G., Jameson, C.,
Thompson, E., Sipkema, R., Carden, C. P., Parker, C., Dearnaley, D., Kaye, S. B., Cooper,
C. S., Molina, A., Cox, M. E., Terstappen, L. W., and de Bono, J. S. (2009) Characterization
of ERG, AR and PTEN gene status in circulating tumor cells from patients with castration-
resistant prostate cancer. Cancer research 69, 2912-2918
58. Holcomb, I. N., Grove, D. I., Kinnunen, M., Friedman, C. L., Gallaher, I. S., Morgan, T. M.,
Sather, C. L., Delrow, J. J., Nelson, P. S., Lange, P. H., Ellis, W. J., True, L. D., Young, J.
M., Hsu, L., Trask, B. J., and Vessella, R. L. (2008) Genomic alterations indicate tumor
origin and varied metastatic potential of disseminated cells from prostate cancer patients.
Cancer research 68, 5599-5608
59. Leversha, M. A., Han, J., Asgari, Z., Danila, D. C., Lin, O., Gonzalez-Espinoza, R., Anand,
A., Lilja, H., Heller, G., Fleisher, M., and Scher, H. I. (2009) Fluorescence in situ
hybridization analysis of circulating tumor cells in metastatic prostate cancer. Clin Cancer
Res 15, 2091-2097
60. Ploussard, G., Epstein, J. I., Montironi, R., Carroll, P. R., Wirth, M., Grimm, M. O., Bjartell,
A. S., Montorsi, F., Freedland, S. J., Erbersdobler, A., and van der Kwast, T. H. (2011) The
contemporary concept of significant versus insignificant prostate cancer. Eur Urol 60, 291-
303
61. Epstein, J. I., Walsh, P. C., Carmichael, M., and Brendler, C. B. (1994) Pathologic and
clinical findings to predict tumor extent of nonpalpable (stage T1c) prostate cancer. JAMA
271, 368-374
62. Loeb, S., Bjurlin, M. A., Nicholson, J., Tammela, T. L., Penson, D. F., Carter, H. B.,
Carroll, P., and Etzioni, R. (2014) Overdiagnosis and overtreatment of prostate cancer. Eur
Urol 65, 1046-1055
63. Epstein, J. I., Pizov, G., and Walsh, P. C. (1993) Correlation of pathologic findings with
progression after radical retropubic prostatectomy. Cancer 71, 3582-3593
64. Epstein, J. I., Chan, D. W., Sokoll, L. J., Walsh, P. C., Cox, J. L., Rittenhouse, H., Wolfert,
R., and Carter, H. B. (1998) Nonpalpable stage T1c prostate cancer: prediction of
148
insignificant disease using free/total prostate specific antigen levels and needle biopsy
findings. J Urol 160, 2407-2411
65. Kapoor, J., Namdarian, B., Pedersen, J., Hovens, C., Moon, D., Peters, J., Costello, A. J.,
Ruljancich, P., and Corcoran, N. M. (2013) Extraprostatic extension into periprostatic fat is
a more important determinant of prostate cancer recurrence than an invasive phenotype. J
Urol 190, 2061-2066
66. van Oort, I. M., Kok, D. E., Kiemeney, L. A., Hulsbergen-van de Kaa, C. A., and Witjes, J.
A. (2009) A single institution experience with biochemical recurrence after radical
prostatectomy for tumors that on pathology are of small volume or "insignificant". Urol
Oncol 27, 509-513
67. Moul, J. W. (2000) Prostate specific antigen only progression of prostate cancer. J Urol 163,
1632-1642
68. Polascik, T. J., Oesterling, J. E., and Partin, A. W. (1999) Prostate specific antigen: a decade
of discovery--what we have learned and where we are going. J Urol 162, 293-306
69. Boorjian, S. A., Thompson, R. H., Tollefson, M. K., Rangel, L. J., Bergstralh, E. J., Blute,
M. L., and Karnes, R. J. (2011) Long-term risk of clinical progression after biochemical
recurrence following radical prostatectomy: the impact of time from surgery to recurrence.
Eur Urol 59, 893-899
70. Pound, C. R., Partin, A. W., Eisenberger, M. A., Chan, D. W., Pearson, J. D., and Walsh, P.
C. (1999) Natural history of progression after PSA elevation following radical
prostatectomy. JAMA 281, 1591-1597
71. Kapadia, N. S., Olson, K., Sandler, H. M., Feng, F. Y., and Hamstra, D. A. (2012) Interval
to biochemical failure as a biomarker for cause-specific and overall survival after dose-
escalated external beam radiation therapy for prostate cancer. Cancer 118, 2059-2068
72. Freedland, S. J., Humphreys, E. B., Mangold, L. A., Eisenberger, M., Dorey, F. J., Walsh, P.
C., and Partin, A. W. (2005) Risk of prostate cancer-specific mortality following
biochemical recurrence after radical prostatectomy. JAMA 294, 433-439
73. Baca, S. C., Prandi, D., Lawrence, M. S., Mosquera, J. M., Romanel, A., Drier, Y., Park, K.,
Kitabayashi, N., MacDonald, T. Y., Ghandi, M., Van Allen, E., Kryukov, G. V., Sboner, A.,
Theurillat, J. P., Soong, T. D., Nickerson, E., Auclair, D., Tewari, A., Beltran, H., Onofrio,
R. C., Boysen, G., Guiducci, C., Barbieri, C. E., Cibulskis, K., Sivachenko, A., Carter, S. L.,
Saksena, G., Voet, D., Ramos, A. H., Winckler, W., Cipicchio, M., Ardlie, K., Kantoff, P.
W., Berger, M. F., Gabriel, S. B., Golub, T. R., Meyerson, M., Lander, E. S., Elemento, O.,
Getz, G., Demichelis, F., Rubin, M. A., and Garraway, L. A. (2013) Punctuated evolution of
prostate cancer genomes. Cell 153, 666-677
74. Schoenborn, J. R., Nelson, P., and Fang, M. (2013) Genomic profiling defines subtypes of
prostate cancer with the potential for therapeutic stratification. Clin Cancer Res 19, 4058-
4066
149
75. Beroukhim, R., Mermel, C. H., Porter, D., Wei, G., Raychaudhuri, S., Donovan, J.,
Barretina, J., Boehm, J. S., Dobson, J., Urashima, M., Mc Henry, K. T., Pinchback, R. M.,
Ligon, A. H., Cho, Y. J., Haery, L., Greulich, H., Reich, M., Winckler, W., Lawrence, M.
S., Weir, B. A., Tanaka, K. E., Chiang, D. Y., Bass, A. J., Loo, A., Hoffman, C., Prensner,
J., Liefeld, T., Gao, Q., Yecies, D., Signoretti, S., Maher, E., Kaye, F. J., Sasaki, H., Tepper,
J. E., Fletcher, J. A., Tabernero, J., Baselga, J., Tsao, M. S., Demichelis, F., Rubin, M. A.,
Janne, P. A., Daly, M. J., Nucera, C., Levine, R. L., Ebert, B. L., Gabriel, S., Rustgi, A. K.,
Antonescu, C. R., Ladanyi, M., Letai, A., Garraway, L. A., Loda, M., Beer, D. G., True, L.
D., Okamoto, A., Pomeroy, S. L., Singer, S., Golub, T. R., Lander, E. S., Getz, G., Sellers,
W. R., and Meyerson, M. (2010) The landscape of somatic copy-number alteration across
human cancers. Nature 463, 899-905
76. Taylor, B. S., Schultz, N., Hieronymus, H., Gopalan, A., Xiao, Y., Carver, B. S., Arora, V.
K., Kaushik, P., Cerami, E., Reva, B., Antipin, Y., Mitsiades, N., Landers, T., Dolgalev, I.,
Major, J. E., Wilson, M., Socci, N. D., Lash, A. E., Heguy, A., Eastham, J. A., Scher, H. I.,
Reuter, V. E., Scardino, P. T., Sander, C., Sawyers, C. L., and Gerald, W. L. (2010)
Integrative genomic profiling of human prostate cancer. Cancer Cell 18, 11-22
77. Tomlins, S. A., Rhodes, D. R., Perner, S., Dhanasekaran, S. M., Mehra, R., Sun, X. W.,
Varambally, S., Cao, X., Tchinda, J., Kuefer, R., Lee, C., Montie, J. E., Shah, R. B., Pienta,
K. J., Rubin, M. A., and Chinnaiyan, A. M. (2005) Recurrent fusion of TMPRSS2 and ETS
transcription factor genes in prostate cancer. Science 310, 644-648
78. Barbieri, C. E., Baca, S. C., Lawrence, M. S., Demichelis, F., Blattner, M., Theurillat, J. P.,
White, T. A., Stojanov, P., Van Allen, E., Stransky, N., Nickerson, E., Chae, S. S., Boysen,
G., Auclair, D., Onofrio, R. C., Park, K., Kitabayashi, N., MacDonald, T. Y., Sheikh, K.,
Vuong, T., Guiducci, C., Cibulskis, K., Sivachenko, A., Carter, S. L., Saksena, G., Voet, D.,
Hussain, W. M., Ramos, A. H., Winckler, W., Redman, M. C., Ardlie, K., Tewari, A. K.,
Mosquera, J. M., Rupp, N., Wild, P. J., Moch, H., Morrissey, C., Nelson, P. S., Kantoff, P.
W., Gabriel, S. B., Golub, T. R., Meyerson, M., Lander, E. S., Getz, G., Rubin, M. A., and
Garraway, L. A. (2012) Exome sequencing identifies recurrent SPOP, FOXA1 and MED12
mutations in prostate cancer. Nature genetics 44, 685-689
79. Kan, Z., Jaiswal, B. S., Stinson, J., Janakiraman, V., Bhatt, D., Stern, H. M., Yue, P.,
Haverty, P. M., Bourgon, R., Zheng, J., Moorhead, M., Chaudhuri, S., Tomsho, L. P.,
Peters, B. A., Pujara, K., Cordes, S., Davis, D. P., Carlton, V. E., Yuan, W., Li, L., Wang,
W., Eigenbrot, C., Kaminker, J. S., Eberhard, D. A., Waring, P., Schuster, S. C., Modrusan,
Z., Zhang, Z., Stokoe, D., de Sauvage, F. J., Faham, M., and Seshagiri, S. (2010) Diverse
somatic mutation patterns and pathway alterations in human cancers. Nature 466, 869-873
80. Kumar, A., White, T. A., MacKenzie, A. P., Clegg, N., Lee, C., Dumpit, R. F., Coleman, I.,
Ng, S. B., Salipante, S. J., Rieder, M. J., Nickerson, D. A., Corey, E., Lange, P. H.,
Morrissey, C., Vessella, R. L., Nelson, P. S., and Shendure, J. (2011) Exome sequencing
identifies a spectrum of mutation frequencies in advanced and lethal prostate cancers. Proc
Natl Acad Sci U S A 108, 17087-17092
81. Grasso, C. S., Wu, Y. M., Robinson, D. R., Cao, X., Dhanasekaran, S. M., Khan, A. P.,
Quist, M. J., Jing, X., Lonigro, R. J., Brenner, J. C., Asangani, I. A., Ateeq, B., Chun, S. Y.,
150
Siddiqui, J., Sam, L., Anstett, M., Mehra, R., Prensner, J. R., Palanisamy, N., Ryslik, G. A.,
Vandin, F., Raphael, B. J., Kunju, L. P., Rhodes, D. R., Pienta, K. J., Chinnaiyan, A. M.,
and Tomlins, S. A. (2012) The mutational landscape of lethal castration-resistant prostate
cancer. Nature 487, 239-243
82. Eastham, J. A., Stapleton, A. M., Gousse, A. E., Timme, T. L., Yang, G., Slawin, K. M.,
Wheeler, T. M., Scardino, P. T., and Thompson, T. C. (1995) Association of p53 mutations
with metastatic prostate cancer. Clin Cancer Res 1, 1111-1118
83. Cairns, P., Okami, K., Halachmi, S., Halachmi, N., Esteller, M., Herman, J. G., Jen, J.,
Isaacs, W. B., Bova, G. S., and Sidransky, D. (1997) Frequent inactivation of
PTEN/MMAC1 in primary prostate cancer. Cancer research 57, 4997-5000
84. Herschman, J. D., Smith, D. S., and Catalona, W. J. (1997) Effect of ejaculation on serum
total and free prostate-specific antigen concentrations. Urology 50, 239-243
85. Wolf, A. M., Wender, R. C., Etzioni, R. B., Thompson, I. M., D'Amico, A. V., Volk, R. J.,
Brooks, D. D., Dash, C., Guessous, I., Andrews, K., DeSantis, C., Smith, R. A., and
American Cancer Society Prostate Cancer Advisory, C. (2010) American Cancer Society
guideline for the early detection of prostate cancer: update 2010. CA: a cancer journal for
clinicians 60, 70-98
86. Brawer, M. K., Chetner, M. P., Beatie, J., Buchner, D. M., Vessella, R. L., and Lange, P. H.
(1992) Screening for prostatic carcinoma with prostate specific antigen. J Urol 147, 841-845
87. Catalona, W. J., Richie, J. P., Ahmann, F. R., Hudson, M. A., Scardino, P. T., Flanigan, R.
C., deKernion, J. B., Ratliff, T. L., Kavoussi, L. R., and Dalkin, B. L. (1994) Comparison of
digital rectal examination and serum prostate specific antigen in the early detection of
prostate cancer: results of a multicenter clinical trial of 6,630 men. J Urol 151, 1283-1290
88. Schroder, F. H., van der Cruijsen-Koeter, I., de Koning, H. J., Vis, A. N., Hoedemaeker, R.
F., and Kranse, R. (2000) Prostate cancer detection at low prostate specific antigen. J Urol
163, 806-812
89. Coley, C. M., Barry, M. J., Fleming, C., and Mulley, A. G. (1997) Early detection of
prostate cancer. Part I: Prior probability and effectiveness of tests. The American College of
Physicians. Annals of internal medicine 126, 394-406
90. Thompson, I. M., Pauler, D. K., Goodman, P. J., Tangen, C. M., Lucia, M. S., Parnes, H. L.,
Minasian, L. M., Ford, L. G., Lippman, S. M., Crawford, E. D., Crowley, J. J., and Coltman,
C. A., Jr. (2004) Prevalence of prostate cancer among men with a prostate-specific antigen
level < or =4.0 ng per milliliter. N Engl J Med 350, 2239-2246
91. Loeb, S., Vonesh, E. F., Metter, E. J., Carter, H. B., Gann, P. H., and Catalona, W. J. (2011)
What is the true number needed to screen and treat to save a life with prostate-specific
antigen testing? J Clin Oncol 29, 464-467
92. Grisanzio, C., and Signoretti, S. (2008) p63 in prostate biology and pathology. Journal of
cellular biochemistry 103, 1354-1368
151
93. Humphrey, P. A. (2007) Diagnosis of adenocarcinoma in prostate needle biopsy tissue.
Journal of clinical pathology 60, 35-42
94. Jiang, Z., Li, C., Fischer, A., Dresser, K., and Woda, B. A. (2005) Using an AMACR
(P504S)/34betaE12/p63 cocktail for the detection of small focal prostate carcinoma in
needle biopsy specimens. American journal of clinical pathology 123, 231-236
95. Luo, J., Zha, S., Gage, W. R., Dunn, T. A., Hicks, J. L., Bennett, C. J., Ewing, C. M., Platz,
E. A., Ferdinandusse, S., Wanders, R. J., Trent, J. M., Isaacs, W. B., and De Marzo, A. M.
(2002) Alpha-methylacyl-CoA racemase: a new molecular marker for prostate cancer.
Cancer research 62, 2220-2226
96. Hayes, D. F., Bast, R. C., Desch, C. E., Fritsche, H., Jr., Kemeny, N. E., Jessup, J. M.,
Locker, G. Y., Macdonald, J. S., Mennel, R. G., Norton, L., Ravdin, P., Taube, S., and
Winn, R. J. (1996) Tumor marker utility grading system: a framework to evaluate clinical
utility of tumor markers. Journal of the National Cancer Institute 88, 1456-1466
97. Gleason, D. F., and Mellinger, G. T. (1974) Prediction of prognosis for prostatic
adenocarcinoma by combined histological grading and clinical staging. J Urol 111, 58-64
98. Gleason, D. F. (1990) Histologic grading of prostatic carcinoma. In Pathology of the
Prostate, Churchill Livingstone, New York
99. Martin, N. E., Mucci, L. A., Loda, M., and Depinho, R. A. (2011) Prognostic determinants
in prostate cancer. Cancer J 17, 429-437
100. D'Amico, A. V., Whittington, R., Malkowicz, S. B., Schultz, D., Blank, K., Broderick, G.
A., Tomaszewski, J. E., Renshaw, A. A., Kaplan, I., Beard, C. J., and Wein, A. (1998)
Biochemical outcome after radical prostatectomy, external beam radiation therapy, or
interstitial radiation therapy for clinically localized prostate cancer. JAMA 280, 969-974
101. Stark, J. R., Perner, S., Stampfer, M. J., Sinnott, J. A., Finn, S., Eisenstein, A. S., Ma, J.,
Fiorentino, M., Kurth, T., Loda, M., Giovannucci, E. L., Rubin, M. A., and Mucci, L. A.
(2009) Gleason score and lethal prostate cancer: does 3 + 4 = 4 + 3? J Clin Oncol 27, 3459-
3464
102. Wright, J. L., Salinas, C. A., Lin, D. W., Kolb, S., Koopmeiners, J., Feng, Z., and Stanford,
J. L. (2009) Prostate cancer specific mortality and Gleason 7 disease differences in prostate
cancer outcomes between cases with Gleason 4 + 3 and Gleason 3 + 4 tumors in a
population based cohort. J Urol 182, 2702-2707
103. Bostwick, D. G., Leske, D. A., Qian, J., and Sinha, A. A. (1995) Prostatic intraepithelial
neoplasia and well differentiated adenocarcinoma maintain an intact basement membrane.
Pathology, research and practice 191, 850-855
104. Epstein, J. I., Partin, A. W., Sauvageot, J., and Walsh, P. C. (1996) Prediction of progression
following radical prostatectomy. A multivariate analysis of 721 men with long-term follow-
up. The American journal of surgical pathology 20, 286-292
152
105. Albertsen, P. C., Hanley, J. A., Gleason, D. F., and Barry, M. J. (1998) Competing risk
analysis of men aged 55 to 74 years at diagnosis managed conservatively for clinically
localized prostate cancer. JAMA 280, 975-980
106. Barry, M. J., Albertsen, P. C., Bagshaw, M. A., Blute, M. L., Cox, R., Middleton, R. G.,
Gleason, D. F., Zincke, H., Bergstralh, E. J., and Jacobsen, S. J. (2001) Outcomes for men
with clinically nonmetastatic prostate carcinoma managed with radical prostactectomy,
external beam radiotherapy, or expectant management: a retrospective analysis. Cancer 91,
2302-2314
107. Humphrey, P. A. (2004) Gleason grading and prognostic factors in carcinoma of the
prostate. Modern pathology : an official journal of the United States and Canadian
Academy of Pathology, Inc 17, 292-306
108. Ruijter, E. T., van de Kaa, C. A., Schalken, J. A., Debruyne, F. M., and Ruiter, D. J. (1996)
Histological grade heterogeneity in multifocal prostate cancer. Biological and clinical
implications. J Pathol 180, 295-299
109. Pan, C. C., Potter, S. R., Partin, A. W., and Epstein, J. I. (2000) The prognostic significance
of tertiary Gleason patterns of higher grade in radical prostatectomy specimens: a proposal
to modify the Gleason grading system. The American journal of surgical pathology 24, 563-
569
110. Srigley, J. R., Amin, M. B., Bostwick, D. G., Grignon, D. J., and Hammond, M. E. (2000)
Updated protocol for the examination of specimens from patients with carcinomas of the
prostate gland: a basis for checklists. Cancer Committee. Arch Pathol Lab Med 124, 1034-
1039
111. Prensner, J. R., Rubin, M. A., Wei, J. T., and Chinnaiyan, A. M. (2012) Beyond PSA: the
next generation of prostate cancer biomarkers. Sci Transl Med 4, 127rv123
112. Pepe, M. S., Etzioni, R., Feng, Z., Potter, J. D., Thompson, M. L., Thornquist, M., Winget,
M., and Yasui, Y. (2001) Phases of biomarker development for early detection of cancer.
Journal of the National Cancer Institute 93, 1054-1061
113. Fuzery, A. K., Levin, J., Chan, M. M., and Chan, D. W. (2013) Translation of proteomic
biomarkers into FDA approved cancer diagnostics: issues and challenges. Clin Proteomics
10, 13
114. Zweig, M. H. (1993) ROC plots display test accuracy, but are still limited by the study
design. Clin Chem 39, 1345-1346
115. Zweig, M. H., and Campbell, G. (1993) Receiver-operating characteristic (ROC) plots: a
fundamental evaluation tool in clinical medicine. Clin Chem 39, 561-577
116. Scott, M. G. (2010) When do new biomarkers make economic sense? Scandinavian journal
of clinical and laboratory investigation. Supplementum 242, 90-95
153
117. Mani, D. R., Abbatiello, S. E., and Carr, S. A. (2012) Statistical characterization of
multiple-reaction monitoring mass spectrometry (MRM-MS) assays for quantitative
proteomics. BMC Bioinformatics 13 Suppl 16, S9
118. Rifai, N., Gillette, M. A., and Carr, S. A. (2006) Protein biomarker discovery and validation:
the long and uncertain path to clinical utility. Nat Biotechnol 24, 971-983
119. Kim, Y., and Kislinger, T. (2013) Novel approaches for the identification of biomarkers of
aggressive prostate cancer. Genome medicine 5, 56
120. Rosty, C., Christa, L., Kuzdzal, S., Baldwin, W. M., Zahurak, M. L., Carnot, F., Chan, D.
W., Canto, M., Lillemoe, K. D., Cameron, J. L., Yeo, C. J., Hruban, R. H., and Goggins, M.
(2002) Identification of hepatocarcinoma-intestine-pancreas/pancreatitis-associated protein I
as a biomarker for pancreatic ductal adenocarcinoma by protein biochip technology. Cancer
research 62, 1868-1875
121. Sedlaczek, P., Frydecka, I., Gabrys, M., Van Dalen, A., Einarsson, R., and Harlozinska, A.
(2002) Comparative analysis of CA125, tissue polypeptide specific antigen, and soluble
interleukin-2 receptor alpha levels in sera, cyst, and ascitic fluids from patients with ovarian
carcinoma. Cancer 95, 1886-1893
122. Srivastava, A., Goldberger, H., Dimtchev, A., Ramalinga, M., Chijioke, J., Marian, C.,
Oermann, E. K., Uhm, S., Kim, J. S., Chen, L. N., Li, X., Berry, D. L., Kallakury, B. V.,
Chauhan, S. C., Collins, S. P., Suy, S., and Kumar, D. (2013) MicroRNA profiling in
prostate cancer--the diagnostic potential of urinary miR-205 and miR-214. PLoS One 8,
e76994
123. Lee, B., Mazar, J., Aftab, M. N., Qi, F., Shelley, J., Li, J. L., Govindarajan, S., Valerio, F.,
Rivera, I., Thurn, T., Tran, T. A., Kameh, D., Patel, V., and Perera, R. J. (2014) Long
noncoding RNAs as putative biomarkers for prostate cancer detection. The Journal of
molecular diagnostics : JMD 16, 615-626
124. Adachi, J., Kumar, C., Zhang, Y., Olsen, J. V., and Mann, M. (2006) The human urinary
proteome contains more than 1500 proteins, including a large proportion of membrane
proteins. Genome Biol 7, R80
125. Gonzales, P. A., Zhou, H., Pisitkun, T., Wang, N. S., Star, R. A., Knepper, M. A., and Yuen,
P. S. (2010) Isolation and purification of exosomes in urine. Methods in molecular biology
641, 89-99
126. Principe, S., Jones, E. E., Kim, Y., Sinha, A., Nyalwidhe, J. O., Brooks, J., Semmes, O. J.,
Troyer, D. A., Lance, R. S., Kislinger, T., and Drake, R. R. (2013) In-depth proteomic
analyses of exosomes isolated from expressed prostatic secretions in urine. Proteomics 13,
1667-1671
127. Principe, S., Kim, Y., Fontana, S., Ignatchenko, V., Nyalwidhe, J. O., Lance, R. S., Troyer,
D. A., Alessandro, R., Semmes, O. J., Kislinger, T., Drake, R. R., and Medin, J. A. (2012)
Identification of prostate-enriched proteins by in-depth proteomic analyses of expressed
prostatic secretions in urine. J Proteome Res 11, 2386-2396
154
128. Theodorescu, D., Schiffer, E., Bauer, H. W., Douwes, F., Eichhorn, F., Polley, R., Schmidt,
T., Schofer, W., Zurbig, P., Good, D. M., Coon, J. J., and Mischak, H. (2008) Discovery and
validation of urinary biomarkers for prostate cancer. Proteomics Clin Appl 2, 556-570
129. Kulasingam, V., and Diamandis, E. P. (2008) Strategies for discovering novel cancer
biomarkers through utilization of emerging technologies. Nature clinical practice. Oncology
5, 588-599
130. Jarjanazi, H., Savas, S., Pabalan, N., Dennis, J. W., and Ozcelik, H. (2008) Biological
implications of SNPs in signal peptide domains of human proteins. Proteins 70, 394-403
131. Kay, P., Yang, Y. C., and Paraoan, L. (2013) Directional protein secretion by the retinal
pigment epithelium: roles in retinal health and the development of age-related macular
degeneration. Journal of cellular and molecular medicine 17, 833-843
132. Scholzen, T., and Gerdes, J. (2000) The Ki-67 protein: from the known and the unknown. J
Cell Physiol 182, 311-322
133. Berney, D. M., Gopalan, A., Kudahetti, S., Fisher, G., Ambroisine, L., Foster, C. S., Reuter,
V., Eastham, J., Moller, H., Kattan, M. W., Gerald, W., Cooper, C., Scardino, P., and
Cuzick, J. (2009) Ki-67 and outcome in clinically localised prostate cancer: analysis of
conservatively treated prostate cancer patients from the Trans-Atlantic Prostate Group
study. Br J Cancer 100, 888-893
134. Khor, L. Y., Bae, K., Paulus, R., Al-Saleem, T., Hammond, M. E., Grignon, D. J., Che, M.,
Venkatesan, V., Byhardt, R. W., Rotman, M., Hanks, G. E., Sandler, H. M., and Pollack, A.
(2009) MDM2 and Ki-67 predict for distant metastasis and mortality in men treated with
radiotherapy and androgen deprivation for prostate cancer: RTOG 92-02. J Clin Oncol 27,
3177-3184
135. Bettencourt, M. C., Bauer, J. J., Sesterhenn, I. A., Mostofi, F. K., McLeod, D. G., and Moul,
J. W. (1996) Ki-67 expression is a prognostic marker of prostate cancer recurrence after
radical prostatectomy. J Urol 156, 1064-1068
136. Fisher, G., Yang, Z. H., Kudahetti, S., Moller, H., Scardino, P., Cuzick, J., and Berney, D.
M. (2013) Prognostic value of Ki-67 for prostate cancer death in a conservatively managed
cohort. Br J Cancer 108, 271-277
137. Murphy, A. J., Hughes, C. A., Lannigan, G., Sheils, O., O'Leary, J., and Loftus, B. (2007)
Heterogeneous expression of alpha-methylacyl-CoA racemase in prostatic cancer correlates
with Gleason score. Histopathology 50, 243-251
138. Rubin, M. A., Bismar, T. A., Andren, O., Mucci, L., Kim, R., Shen, R., Ghosh, D., Wei, J.
T., Chinnaiyan, A. M., Adami, H. O., Kantoff, P. W., and Johansson, J. E. (2005) Decreased
alpha-methylacyl CoA racemase expression in localized prostate cancer is associated with
an increased rate of biochemical recurrence and cancer-specific death. Cancer Epidemiol
Biomarkers Prev 14, 1424-1432
155
139. Chang, S. S. (2004) Overview of prostate-specific membrane antigen. Rev Urol 6 Suppl 10,
S13-18
140. Ross, J. S., Sheehan, C. E., Fisher, H. A., Kaufman, R. P., Jr., Kaur, P., Gray, K., Webb, I.,
Gray, G. S., Mosher, R., and Kallakury, B. V. (2003) Correlation of primary tumor prostate-
specific membrane antigen expression with disease recurrence in prostate cancer. Clin
Cancer Res 9, 6357-6362
141. Perner, S., Hofer, M. D., Kim, R., Shah, R. B., Li, H., Moller, P., Hautmann, R. E.,
Gschwend, J. E., Kuefer, R., and Rubin, M. A. (2007) Prostate-specific membrane antigen
expression as a predictor of prostate cancer progression. Human pathology 38, 696-701
142. Cuzick, J., Swanson, G. P., Fisher, G., Brothman, A. R., Berney, D. M., Reid, J. E., Mesher,
D., Speights, V. O., Stankiewicz, E., Foster, C. S., Moller, H., Scardino, P., Warren, J. D.,
Park, J., Younus, A., Flake, D. D., 2nd, Wagner, S., Gutin, A., Lanchbury, J. S., and Stone,
S. (2011) Prognostic value of an RNA expression signature derived from cell cycle
proliferation genes in patients with prostate cancer: a retrospective study. Lancet Oncol 12,
245-255
143. Ding, Z., Wu, C. J., Chu, G. C., Xiao, Y., Ho, D., Zhang, J., Perry, S. R., Labrot, E. S., Wu,
X., Lis, R., Hoshida, Y., Hiller, D., Hu, B., Jiang, S., Zheng, H., Stegh, A. H., Scott, K. L.,
Signoretti, S., Bardeesy, N., Wang, Y. A., Hill, D. E., Golub, T. R., Stampfer, M. J., Wong,
W. H., Loda, M., Mucci, L., Chin, L., and DePinho, R. A. (2011) SMAD4-dependent barrier
constrains prostate cancer growth and metastatic progression. Nature 470, 269-273
144. Lalonde, E., Ishkanian, A. S., Sykes, J., Fraser, M., Ross-Adams, H., Erho, N., Dunning, M.
J., Halim, S., Lamb, A. D., Moon, N. C., Zafarana, G., Warren, A. Y., Meng, X., Thoms, J.,
Grzadkowski, M. R., Berlin, A., Have, C. L., Ramnarine, V. R., Yao, C. Q., Malloff, C. A.,
Lam, L. L., Xie, H., Harding, N. J., Mak, D. Y., Chu, K. C., Chong, L. C., Sendorek, D. H.,
P'ng, C., Collins, C. C., Squire, J. A., Jurisica, I., Cooper, C., Eeles, R., Pintilie, M., Dal Pra,
A., Davicioni, E., Lam, W. L., Milosevic, M., Neal, D. E., van der Kwast, T., Boutros, P. C.,
and Bristow, R. G. (2014) Tumour genomic and microenvironmental heterogeneity for
integrated prediction of 5-year biochemical recurrence of prostate cancer: a retrospective
cohort study. Lancet Oncol 15, 1521-1532
145. Penney, K. L., Sinnott, J. A., Fall, K., Pawitan, Y., Hoshida, Y., Kraft, P., Stark, J. R.,
Fiorentino, M., Perner, S., Finn, S., Calza, S., Flavin, R., Freedman, M. L., Setlur, S., Sesso,
H. D., Andersson, S. O., Martin, N., Kantoff, P. W., Johansson, J. E., Adami, H. O., Rubin,
M. A., Loda, M., Golub, T. R., Andren, O., Stampfer, M. J., and Mucci, L. A. (2011)
mRNA expression signature of Gleason grade predicts lethal prostate cancer. J Clin Oncol
29, 2391-2396
146. Cottrell, S., Jung, K., Kristiansen, G., Eltze, E., Semjonow, A., Ittmann, M., Hartmann, A.,
Stamey, T., Haefliger, C., and Weiss, G. (2007) Discovery and validation of 3 novel DNA
methylation markers of prostate cancer prognosis. J Urol 177, 1753-1758
147. Porkka, K. P., Pfeiffer, M. J., Waltering, K. K., Vessella, R. L., Tammela, T. L., and
Visakorpi, T. (2007) MicroRNA expression profiling in prostate cancer. Cancer research
67, 6130-6135
156
148. Mitchell, P. S., Parkin, R. K., Kroh, E. M., Fritz, B. R., Wyman, S. K., Pogosova-
Agadjanyan, E. L., Peterson, A., Noteboom, J., O'Briant, K. C., Allen, A., Lin, D. W.,
Urban, N., Drescher, C. W., Knudsen, B. S., Stirewalt, D. L., Gentleman, R., Vessella, R.
L., Nelson, P. S., Martin, D. B., and Tewari, M. (2008) Circulating microRNAs as stable
blood-based markers for cancer detection. Proc Natl Acad Sci U S A 105, 10513-10518
149. Khan, A. P., Poisson, L. M., Bhat, V. B., Fermin, D., Zhao, R., Kalyana-Sundaram, S.,
Michailidis, G., Nesvizhskii, A. I., Omenn, G. S., Chinnaiyan, A. M., and Sreekumar, A.
(2010) Quantitative proteomic profiling of prostate cancer reveals a role for miR-128 in
prostate cancer. Mol Cell Proteomics 9, 298-312
150. Gupta, A., Lotan, Y., Ashfaq, R., Roehrborn, C. G., Raj, G. V., Aragaki, C. C., Montorsi, F.,
and Shariat, S. F. (2009) Predictive value of the differential expression of the urokinase
plasminogen activation axis in radical prostatectomy patients. Eur Urol 55, 1124-1133
151. Shariat, S. F., Menesses-Diaz, A., Kim, I. Y., Muramoto, M., Wheeler, T. M., and Slawin,
K. M. (2004) Tissue expression of transforming growth factor-beta1 and its receptors:
correlation with pathologic features and biochemical progression in patients undergoing
radical prostatectomy. Urology 63, 1191-1197
152. Shariat, S. F., Kattan, M. W., Traxel, E., Andrews, B., Zhu, K., Wheeler, T. M., and Slawin,
K. M. (2004) Association of pre- and postoperative plasma levels of transforming growth
factor beta(1) and interleukin 6 and its soluble receptor with prostate cancer progression.
Clin Cancer Res 10, 1992-1999
153. Wood, D. P., Jr., and Banerjee, M. (1997) Presence of circulating prostate cells in the bone
marrow of patients undergoing radical prostatectomy is predictive of disease-free survival. J
Clin Oncol 15, 3451-3457
154. Berg, A., Berner, A., Lilleby, W., Bruland, O. S., Fossa, S. D., Nesland, J. M., and
Kvalheim, G. (2007) Impact of disseminated tumor cells in bone marrow at diagnosis in
patients with nonmetastatic prostate cancer treated by definitive radiotherapy. Int J Cancer
120, 1603-1609
155. Kantoff, P. W., Halabi, S., Farmer, D. A., Hayes, D. F., Vogelzang, N. A., and Small, E. J.
(2001) Prognostic significance of reverse transcriptase polymerase chain reaction for
prostate-specific antigen in men with hormone-refractory prostate cancer. J Clin Oncol 19,
3025-3028
156. Goodman, O. B., Jr., Fink, L. M., Symanowski, J. T., Wong, B., Grobaski, B., Pomerantz,
D., Ma, Y., Ward, D. C., and Vogelzang, N. J. (2009) Circulating tumor cells in patients
with castration-resistant prostate cancer baseline values and correlation with prognostic
factors. Cancer Epidemiol Biomarkers Prev 18, 1904-1913
157. Bussemakers, M. J., van Bokhoven, A., Verhaegh, G. W., Smit, F. P., Karthaus, H. F.,
Schalken, J. A., Debruyne, F. M., Ru, N., and Isaacs, W. B. (1999) DD3: a new prostate-
specific gene, highly overexpressed in prostate cancer. Cancer research 59, 5975-5979
157
158. Groskopf, J., Aubin, S. M., Deras, I. L., Blase, A., Bodrug, S., Clark, C., Brentano, S.,
Mathis, J., Pham, J., Meyer, T., Cass, M., Hodge, P., Macairan, M. L., Marks, L. S., and
Rittenhouse, H. (2006) APTIMA PCA3 molecular urine test: development of a method to
aid in the diagnosis of prostate cancer. Clin Chem 52, 1089-1095
159. Nakanishi, H., Groskopf, J., Fritsche, H. A., Bhadkamkar, V., Blase, A., Kumar, S. V.,
Davis, J. W., Troncoso, P., Rittenhouse, H., and Babaian, R. J. (2008) PCA3 molecular
urine assay correlates with prostate cancer tumor volume: implication in selecting
candidates for active surveillance. J Urol 179, 1804-1809; discussion 1809-1810
160. Whitman, E. J., Groskopf, J., Ali, A., Chen, Y., Blase, A., Furusato, B., Petrovics, G.,
Ibrahim, M., Elsamanoudi, S., Cullen, J., Sesterhenn, I. A., Brassell, S., Rittenhouse, H.,
Srivastava, S., and McLeod, D. G. (2008) PCA3 score before radical prostatectomy predicts
extracapsular extension and tumor volume. J Urol 180, 1975-1978; discussion 1978-1979
161. Augustin, H., Mayrhofer, K., Pummer, K., and Mannweiler, S. (2013) Relationship between
prostate cancer gene 3 (PCA3) and characteristics of tumor aggressiveness. The Prostate 73,
203-210
162. Laxman, B., Tomlins, S. A., Mehra, R., Morris, D. S., Wang, L., Helgeson, B. E., Shah, R.
B., Rubin, M. A., Wei, J. T., and Chinnaiyan, A. M. (2006) Noninvasive detection of
TMPRSS2:ERG fusion transcripts in the urine of men with prostate cancer. Neoplasia 8,
885-888
163. Hessels, D., Smit, F. P., Verhaegh, G. W., Witjes, J. A., Cornel, E. B., and Schalken, J. A.
(2007) Detection of TMPRSS2-ERG fusion transcripts and prostate cancer antigen 3 in
urinary sediments may improve diagnosis of prostate cancer. Clin Cancer Res 13, 5103-
5108
164. Shay, J. W., and Wright, W. E. (1996) Telomerase activity in human cancer. Curr Opin
Oncol 8, 66-71
165. Meid, F. H., Gygi, C. M., Leisinger, H. J., Bosman, F. T., and Benhattar, J. (2001) The use
of telomerase activity for the detection of prostatic cancer cells after prostatic massage. J
Urol 165, 1802-1805
166. Kim, N. W., and Wu, F. (1997) Advances in quantification and characterization of
telomerase activity by the telomeric repeat amplification protocol (TRAP). Nucleic Acids
Res 25, 2595-2597
167. Schostak, M., Schwall, G. P., Poznanovic, S., Groebe, K., Muller, M., Messinger, D.,
Miller, K., Krause, H., Pelzer, A., Horninger, W., Klocker, H., Hennenlotter, J., Feyerabend,
S., Stenzl, A., and Schrattenholz, A. (2009) Annexin A3 in urine: a highly specific
noninvasive marker for prostate cancer early detection. J Urol 181, 343-353
168. Egeblad, M., and Werb, Z. (2002) New functions for the matrix metalloproteinases in cancer
progression. Nat Rev Cancer 2, 161-174
158
169. Roy, R., Louis, G., Loughlin, K. R., Wiederschain, D., Kilroy, S. M., Lamb, C. C.,
Zurakowski, D., and Moses, M. A. (2008) Tumor-specific urinary matrix metalloproteinase
fingerprinting: identification of high molecular weight urinary matrix metalloproteinase
species. Clin Cancer Res 14, 6610-6617
170. Di Carlo, A., Mariano, A., Terracciano, D., Ferro, M., Montanaro, V., Marsicano, M., Di
Lorenzo, G., Altieri, V., and Macchia, V. (2010) Matrix metalloproteinase-2 and -9 in the
urine of prostate cancer patients. Oncol Rep 24, 3-8
171. Zhou, H., Yuen, P. S., Pisitkun, T., Gonzales, P. A., Yasuda, H., Dear, J. W., Gross, P.,
Knepper, M. A., and Star, R. A. (2006) Collection, storage, preservation, and normalization
of human urinary exosomes for biomarker discovery. Kidney Int 69, 1471-1476
172. Duijvesz, D., Luider, T., Bangma, C. H., and Jenster, G. (2011) Exosomes as biomarker
treasure chests for prostate cancer. Eur Urol 59, 823-831
173. Nilsson, J., Skog, J., Nordstrand, A., Baranov, V., Mincheva-Nilsson, L., Breakefield, X. O.,
and Widmark, A. (2009) Prostate cancer-derived urine exosomes: a novel approach to
biomarkers for prostate cancer. Br J Cancer 100, 1603-1607
174. Vlassov, A. V., Magdaleno, S., Setterquist, R., and Conrad, R. (2012) Exosomes: current
knowledge of their composition, biological functions, and diagnostic and therapeutic
potentials. Biochim Biophys Acta 1820, 940-948
175. Wang, Z., Hill, S., Luther, J. M., Hachey, D. L., and Schey, K. L. (2012) Proteomic analysis
of urine exosomes by multidimensional protein identification technology (MudPIT).
Proteomics 12, 329-338
176. Mitchell, P. J., Welton, J., Staffurth, J., Court, J., Mason, M. D., Tabi, Z., and Clayton, A.
(2009) Can urinary exosomes act as treatment response markers in prostate cancer? J Transl
Med 7, 4
177. Skog, J., Wurdinger, T., van Rijn, S., Meijer, D. H., Gainche, L., Sena-Esteves, M., Curry,
W. T., Jr., Carter, B. S., Krichevsky, A. M., and Breakefield, X. O. (2008) Glioblastoma
microvesicles transport RNA and proteins that promote tumour growth and provide
diagnostic biomarkers. Nat Cell Biol 10, 1470-1476
178. Taylor, D. D., and Gercel-Taylor, C. (2008) MicroRNA signatures of tumor-derived
exosomes as diagnostic biomarkers of ovarian cancer. Gynecol Oncol 110, 13-21
179. Valadi, H., Ekstrom, K., Bossios, A., Sjostrand, M., Lee, J. J., and Lotvall, J. O. (2007)
Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic
exchange between cells. Nat Cell Biol 9, 654-659
180. Wasinger, V. C., Cordwell, S. J., Cerpa-Poljak, A., Yan, J. X., Gooley, A. A., Wilkins, M.
R., Duncan, M. W., Harris, R., Williams, K. L., and Humphery-Smith, I. (1995) Progress
with gene-product mapping of the Mollicutes: Mycoplasma genitalium. Electrophoresis 16,
1090-1094
159
181. Camerini, S., and Mauri, P. (2015) The role of protein and peptide separation before mass
spectrometry analysis in clinical proteomics. Journal of chromatography. A 1381, 1-12
182. Wu, S., Lourette, N. M., Tolic, N., Zhao, R., Robinson, E. W., Tolmachev, A. V., Smith, R.
D., and Pasa-Tolic, L. (2009) An integrated top-down and bottom-up strategy for broadly
characterizing protein isoforms and modifications. J Proteome Res 8, 1347-1357
183. Zhang, Y., Fonslow, B. R., Shan, B., Baek, M. C., and Yates, J. R., 3rd (2013) Protein
analysis by shotgun/bottom-up proteomics. Chem Rev 113, 2343-2394
184. Wolters, D. A., Washburn, M. P., and Yates, J. R., 3rd (2001) An automated
multidimensional protein identification technology for shotgun proteomics. Analytical
chemistry 73, 5683-5690
185. Yates, J. R., 3rd (2004) Mass spectral analysis in proteomics. Annual review of biophysics
and biomolecular structure 33, 297-316
186. Link, A. J., Eng, J., Schieltz, D. M., Carmack, E., Mize, G. J., Morris, D. R., Garvik, B. M.,
and Yates, J. R., 3rd (1999) Direct analysis of protein complexes using mass spectrometry.
Nat Biotechnol 17, 676-682
187. Aebersold, R., and Mann, M. (2003) Mass spectrometry-based proteomics. Nature 422, 198-
207
188. Fenn, J. B., Mann, M., Meng, C. K., Wong, S. F., and Whitehouse, C. M. (1989)
Electrospray ionization for mass spectrometry of large biomolecules. Science 246, 64-71
189. Tanaka, K., Waki, H., Ido, Y., Akita, S., Yoshida, Y., Yoshida, T., and Matsuo, T. (1988)
Protein and polymer analyses up to m/z 100 000 by laser ionization time-of-flight mass
spectrometry. Rapid Commun. Mass Spectrom. 2, 151-153
190. Wiley, W. C., and Mclaren, I. H. (1955) Time-of-Flight Mass Spectrometer with Improved
Resolution. Rev Sci Instrum 26, 1150-1157
191. Katzenstein, H. S., and Friedland, S. S. (1955) New Time-of-Flight Mass Spectrometer. Rev
Sci Instrum 26, 324-327
192. Wolff, M. M., and Stephens, W. E. (1953) A Pulsed Mass Spectrometer with Time
Dispersion. Rev Sci Instrum 24, 616-617
193. Cameron, A. E., and Eggers, D. F. (1948) An Ion Velocitron. Rev Sci Instrum 19, 605-607
194. Stephens, W. E. (1946) A Pulsed Mass Spectrometer with Time Dispersion. Phys Rev 69,
691-691
195. Paul, W., and Steinwedel, H. (1953) *Ein Neues Massenspektrometer Ohne Magnetfeld. Z
Naturforsch A 8, 448-450
160
196. Steen, H., and Mann, M. (2004) The ABC's (and XYZ's) of peptide sequencing. Nature
reviews. Molecular cell biology 5, 699-711
197. Hu, Q. Z., Noll, R. J., Li, H. Y., Makarov, A., Hardman, M., and Cooks, R. G. (2005) The
Orbitrap: a new mass spectrometer. J Mass Spectrom 40, 430-443
198. Hardman, M., and Makarov, A. A. (2003) Interfacing the orbitrap mass analyzer to an
electrospray ion source. Analytical chemistry 75, 1699-1705
199. Makarov, A. (2000) Electrostatic axially harmonic orbital trapping: a high-performance
technique of mass analysis. Analytical chemistry 72, 1156-1162
200. Senko, M. W., Canterbury, J. D., Guan, S., and Marshall, A. G. (1996) A high-performance
modular data system for Fourier transform ion cyclotron resonance mass spectrometry.
Rapid Commun Mass Spectrom 10, 1839-1844
201. Marshall, A. G., Hendrickson, C. L., and Jackson, G. S. (1998) Fourier transform ion
cyclotron resonance mass spectrometry: A primer. Mass Spectrom Rev 17, 1-35
202. Liu, H., Sadygov, R. G., and Yates, J. R., 3rd (2004) A model for random sampling and
estimation of relative protein abundance in shotgun proteomics. Analytical chemistry 76,
4193-4201
203. Washburn, M. P., Wolters, D., and Yates, J. R., 3rd (2001) Large-scale analysis of the yeast
proteome by multidimensional protein identification technology. Nat Biotechnol 19, 242-
247
204. Taylor, P., Nielsen, P. A., Trelle, M. B., Horning, O. B., Andersen, M. B., Vorm, O., Moran,
M. F., and Kislinger, T. (2009) Automated 2D peptide separation on a 1D nano-LC-MS
system. J Proteome Res 8, 1610-1616
205. Taylor, P., Nielsen, P. A., Trelle, M. B., Horning, O. B., Andersen, M. B., Vorm, O., Moran,
M. F., and Kislinger, T. (2009) Automated 2D Peptide Separation on a 1D Nano-LC-MS
System. Journal of Proteome Research 8, 1610-1616
206. Maiolica, A., Junger, M. A., Ezkurdia, I., and Aebersold, R. (2012) Targeted proteome
investigation via selected reaction monitoring mass spectrometry. J Proteomics 75, 3495-
3513
207. Sadygov, R. G., Cociorva, D., and Yates, J. R., 3rd (2004) Large-scale database searching
using tandem mass spectra: looking up the answer in the back of the book. Nature methods
1, 195-202
208. Schiess, R., Wollscheid, B., and Aebersold, R. (2009) Targeted proteomic strategy for
clinical biomarker discovery. Molecular oncology 3, 33-44
209. Anderson, N. L., and Anderson, N. G. (2002) The human plasma proteome: history,
character, and diagnostic prospects. Mol Cell Proteomics 1, 845-867
161
210. Brand, J., Haslberger, T., Zolg, W., Pestlin, G., and Palme, S. (2006) Depletion efficiency
and recovery of trace markers from a multiparameter immunodepletion column. Proteomics
6, 3236-3242
211. Fonslow, B. R., Stein, B. D., Webb, K. J., Xu, T., Choi, J., Park, S. K., and Yates, J. R., 3rd
(2013) Digestion and depletion of abundant proteins improves proteomic coverage. Nature
methods 10, 54-56
212. Huttenhain, R., Surinova, S., Ossola, R., Sun, Z., Campbell, D., Cerciello, F., Schiess, R.,
Bausch-Fluck, D., Rosenberger, G., Chen, J., Rinner, O., Kusebauch, U., Hajduch, M.,
Moritz, R. L., Wollscheid, B., and Aebersold, R. (2013) N-glycoprotein SRMAtlas: a
resource of mass spectrometric assays for N-glycosites enabling consistent and multiplexed
protein quantification for clinical applications. Mol Cell Proteomics 12, 1005-1016
213. Gilchrist, A., Au, C. E., Hiding, J., Bell, A. W., Fernandez-Rodriguez, J., Lesimple, S.,
Nagaya, H., Roy, L., Gosline, S. J., Hallett, M., Paiement, J., Kearney, R. E., Nilsson, T.,
and Bergeron, J. J. (2006) Quantitative proteomics analysis of the secretory pathway. Cell
127, 1265-1281
214. Cutillas, P. R., and Vanhaesebroeck, B. (2007) Quantitative profile of five murine core
proteomes using label-free functional proteomics. Mol Cell Proteomics 6, 1560-1573
215. Old, W. M., Meyer-Arendt, K., Aveline-Wolf, L., Pierce, K. G., Mendoza, A., Sevinsky, J.
R., Resing, K. A., and Ahn, N. G. (2005) Comparison of label-free methods for quantifying
human proteins by shotgun proteomics. Mol Cell Proteomics 4, 1487-1502
216. Chelius, D., and Bondarenko, P. V. (2002) Quantitative profiling of proteins in complex
mixtures using liquid chromatography and mass spectrometry. J Proteome Res 1, 317-323
217. Wang, W., Zhou, H., Lin, H., Roy, S., Shaler, T. A., Hill, L. R., Norton, S., Kumar, P.,
Anderle, M., and Becker, C. H. (2003) Quantification of proteins and metabolites by mass
spectrometry without isotopic labeling or spiked standards. Analytical chemistry 75, 4818-
4826
218. Wang, M., You, J., Bemis, K. G., Tegeler, T. J., and Brown, D. P. (2008) Label-free mass
spectrometry-based protein quantification technologies in proteomic analysis. Brief Funct
Genomic Proteomic 7, 329-339
219. Bondarenko, P. V., Chelius, D., and Shaler, T. A. (2002) Identification and relative
quantitation of protein mixtures by enzymatic digestion followed by capillary reversed-
phase liquid chromatography-tandem mass spectrometry. Analytical chemistry 74, 4741-
4749
220. Wang, G., Wu, W. W., Zeng, W., Chou, C. L., and Shen, R. F. (2006) Label-free protein
quantification using LC-coupled ion trap or FT mass spectrometry: Reproducibility,
linearity, and application with complex proteomes. J Proteome Res 5, 1214-1223
162
221. Higgs, R. E., Knierman, M. D., Gelfanova, V., Butler, J. P., and Hale, J. E. (2005)
Comprehensive label-free method for the relative quantification of proteins from biological
samples. J Proteome Res 4, 1442-1450
222. Ono, M., Shitashige, M., Honda, K., Isobe, T., Kuwabara, H., Matsuzuki, H., Hirohashi, S.,
and Yamada, T. (2006) Label-free quantitative proteomics using large peptide data sets
generated by nanoflow liquid chromatography and mass spectrometry. Mol Cell Proteomics
5, 1338-1347
223. Wiener, M. C., Sachs, J. R., Deyanova, E. G., and Yates, N. A. (2004) Differential mass
spectrometry: a label-free LC-MS method for finding significant differences in complex
peptide and protein mixtures. Analytical chemistry 76, 6085-6096
224. Ong, S. E., Blagoev, B., Kratchmarova, I., Kristensen, D. B., Steen, H., Pandey, A., and
Mann, M. (2002) Stable isotope labeling by amino acids in cell culture, SILAC, as a simple
and accurate approach to expression proteomics. Mol Cell Proteomics 1, 376-386
225. Gygi, S. P., Rist, B., Gerber, S. A., Turecek, F., Gelb, M. H., and Aebersold, R. (1999)
Quantitative analysis of complex protein mixtures using isotope-coded affinity tags. Nat
Biotechnol 17, 994-999
226. Ross, P. L., Huang, Y. N., Marchese, J. N., Williamson, B., Parker, K., Hattan, S.,
Khainovski, N., Pillai, S., Dey, S., Daniels, S., Purkayastha, S., Juhasz, P., Martin, S.,
Bartlet-Jones, M., He, F., Jacobson, A., and Pappin, D. J. (2004) Multiplexed protein
quantitation in Saccharomyces cerevisiae using amine-reactive isobaric tagging reagents.
Mol Cell Proteomics 3, 1154-1169
227. Thompson, A., Schafer, J., Kuhn, K., Kienle, S., Schwarz, J., Schmidt, G., Neumann, T.,
Johnstone, R., Mohammed, A. K., and Hamon, C. (2003) Tandem mass tags: a novel
quantification strategy for comparative analysis of complex protein mixtures by MS/MS.
Analytical chemistry 75, 1895-1904
228. Bowers, G. N., Jr., Fassett, J. D., and White, E. t. (1993) Isotope dilution mass spectrometry
and the National Reference System. Analytical chemistry 65, 475R-479R
229. Viswanathan, C. T., Bansal, S., Booth, B., DeStefano, A. J., Rose, M. J., Sailstad, J., Shah,
V. P., Skelly, J. P., Swann, P. G., and Weiner, R. (2007) Quantitative bioanalytical methods
validation and implementation: best practices for chromatographic and ligand binding
assays. Pharmaceutical research 24, 1962-1973
230. Gerber, S. A., Rush, J., Stemman, O., Kirschner, M. W., and Gygi, S. P. (2003) Absolute
quantification of proteins and phosphoproteins from cell lysates by tandem MS. Proc Natl
Acad Sci U S A 100, 6940-6945
231. Beynon, R. J., Doherty, M. K., Pratt, J. M., and Gaskell, S. J. (2005) Multiplexed absolute
quantification in proteomics using artificial QCAT proteins of concatenated signature
peptides. Nature methods 2, 587-589
163
232. Brun, V., Dupuis, A., Adrait, A., Marcellin, M., Thomas, D., Court, M., Vandenesch, F.,
and Garin, J. (2007) Isotope-labeled protein standards: toward absolute quantitative
proteomics. Mol Cell Proteomics 6, 2139-2149
233. Coppinger, J. A., Hutt, D. M., Razvi, A., Koulov, A. V., Pankow, S., Yates, J. R., 3rd, and
Balch, W. E. (2012) A chaperone trap contributes to the onset of cystic fibrosis. PLoS One
7, e37682
234. Hanke, S., Besir, H., Oesterhelt, D., and Mann, M. (2008) Absolute SILAC for accurate
quantitation of proteins in complex mixtures down to the attomole level. J Proteome Res 7,
1118-1130
235. Arsene, C. G., Ohlendorf, R., Burkitt, W., Pritchard, C., Henrion, A., O'Connor, G., Bunk,
D. M., and Guttler, B. (2008) Protein quantification by isotope dilution mass spectrometry
of proteolytic fragments: cleavage rate and accuracy. Analytical chemistry 80, 4154-4160
236. Kondrat, K. L., McClusky, G. A., and Cooks, R. G. (1978) Multiple reaction monitoring
mass spectrometry/mass spectrometry for direct analysis of complex mixtures. Analytical
chemistry, 2017-2021
237. Anderson, L., and Hunter, C. L. (2006) Quantitative mass spectrometric multiple reaction
monitoring assays for major plasma proteins. Mol Cell Proteomics 5, 573-588
238. Keshishian, H., Addona, T., Burgess, M., Kuhn, E., and Carr, S. A. (2007) Quantitative,
multiplexed assays for low abundance proteins in plasma by targeted mass spectrometry and
stable isotope dilution. Mol Cell Proteomics 6, 2212-2229
239. Addona, T. A., Abbatiello, S. E., Schilling, B., Skates, S. J., Mani, D. R., Bunk, D. M.,
Spiegelman, C. H., Zimmerman, L. J., Ham, A. J., Keshishian, H., Hall, S. C., Allen, S.,
Blackman, R. K., Borchers, C. H., Buck, C., Cardasis, H. L., Cusack, M. P., Dodder, N. G.,
Gibson, B. W., Held, J. M., Hiltke, T., Jackson, A., Johansen, E. B., Kinsinger, C. R., Li, J.,
Mesri, M., Neubert, T. A., Niles, R. K., Pulsipher, T. C., Ransohoff, D., Rodriguez, H.,
Rudnick, P. A., Smith, D., Tabb, D. L., Tegeler, T. J., Variyath, A. M., Vega-Montoto, L. J.,
Wahlander, A., Waldemarson, S., Wang, M., Whiteaker, J. R., Zhao, L., Anderson, N. L.,
Fisher, S. J., Liebler, D. C., Paulovich, A. G., Regnier, F. E., Tempst, P., and Carr, S. A.
(2009) Multi-site assessment of the precision and reproducibility of multiple reaction
monitoring-based measurements of proteins in plasma. Nat Biotechnol 27, 633-641
240. Cima, I., Schiess, R., Wild, P., Kaelin, M., Schuffler, P., Lange, V., Picotti, P., Ossola, R.,
Templeton, A., Schubert, O., Fuchs, T., Leippold, T., Wyler, S., Zehetner, J., Jochum, W.,
Buhmann, J., Cerny, T., Moch, H., Gillessen, S., Aebersold, R., and Krek, W. (2011) Cancer
genetics-guided discovery of serum biomarker signatures for diagnosis and prognosis of
prostate cancer. Proc Natl Acad Sci U S A 108, 3342-3347
241. Elschenbroich, S., Ignatchenko, V., Clarke, B., Kalloger, S. E., Boutros, P. C., Gramolini,
A. O., Shaw, P., Jurisica, I., and Kislinger, T. (2011) In-depth proteomics of ovarian cancer
ascites: combining shotgun proteomics and selected reaction monitoring mass spectrometry.
J Proteome Res 10, 2286-2299
164
242. Selevsek, N., Matondo, M., Sanchez Carbayo, M., Aebersold, R., and Domon, B. (2011)
Systematic quantification of peptides/proteins in urine using selected reaction monitoring.
Proteomics 11, 1135-1147
243. Wei, Y., Tong, J., Taylor, P., Strumpf, D., Ignatchenko, V., Pham, N. A., Yanagawa, N.,
Liu, G., Jurisica, I., Shepherd, F. A., Tsao, M. S., Kislinger, T., and Moran, M. F. (2011)
Primary tumor xenografts of human lung adeno and squamous cell carcinoma express
distinct proteomic signatures. J Proteome Res 10, 161-174
244. Kim, Y., Ignatchenko, V., Yao, C. Q., Kalatskaya, I., Nyalwidhe, J. O., Lance, R. S.,
Gramolini, A. O., Troyer, D. A., Stein, L. D., Boutros, P. C., Medin, J. A., Semmes, O. J.,
Drake, R. R., and Kislinger, T. (2012) Identification of differentially expressed proteins in
direct expressed prostatic secretions of men with organ-confined versus extracapsular
prostate cancer. Mol Cell Proteomics 11, 1870-1884
245. Mallick, P., Schirle, M., Chen, S. S., Flory, M. R., Lee, H., Martin, D., Ranish, J., Raught,
B., Schmitt, R., Werner, T., Kuster, B., and Aebersold, R. (2007) Computational prediction
of proteotypic peptides for quantitative proteomics. Nat Biotechnol 25, 125-131
246. Rost, H., Malmstrom, L., and Aebersold, R. (2012) A computational tool to detect and avoid
redundancy in selected reaction monitoring. Mol Cell Proteomics 11, 540-549
247. Stahl-Zeng, J., Lange, V., Ossola, R., Eckhardt, K., Krek, W., Aebersold, R., and Domon, B.
(2007) High sensitivity detection of plasma proteins by multiple reaction monitoring of N-
glycosites. Mol Cell Proteomics 6, 1809-1817
248. Wang, P., Whiteaker, J. R., and Paulovich, A. G. (2009) The evolving role of mass
spectrometry in cancer biomarker discovery. Cancer biology & therapy 8, 1083-1094
249. Zhi, W., Sharma, A., Purohit, S., Miller, E., Bode, B., Anderson, S. W., Reed, J. C., Steed,
R. D., Steed, L., Hopkins, D., and She, J. X. (2011) Discovery and validation of serum
protein changes in type 1 diabetes patients using high throughput two dimensional liquid
chromatography-mass spectrometry and immunoassays. Mol Cell Proteomics 10, M111
012203
250. Fortin, T., Salvador, A., Charrier, J. P., Lenz, C., Lacoux, X., Morla, A., Choquet-
Kastylevsky, G., and Lemoine, J. (2009) Clinical quantitation of prostate-specific antigen
biomarker in the low nanogram/milliliter range by conventional bore liquid
chromatography-tandem mass spectrometry (multiple reaction monitoring) coupling and
correlation with ELISA tests. Mol Cell Proteomics 8, 1006-1015
251. Vegvari, A., Rezeli, M., Welinder, C., Malm, J., Lilja, H., Marko-Varga, G., and Laurell, T.
(2010) Identification of prostate-specific antigen (PSA) isoforms in complex biological
samples utilizing complementary platforms. J Proteomics 73, 1137-1147
252. Picotti, P., Bodenmiller, B., Mueller, L. N., Domon, B., and Aebersold, R. (2009) Full
dynamic range proteome analysis of S. cerevisiae by targeted proteomics. Cell 138, 795-806
165
253. Huttenhain, R., Soste, M., Selevsek, N., Rost, H., Sethi, A., Carapito, C., Farrah, T.,
Deutsch, E. W., Kusebauch, U., Moritz, R. L., Nimeus-Malmstrom, E., Rinner, O., and
Aebersold, R. (2012) Reproducible quantification of cancer-associated proteins in body
fluids using targeted proteomics. Sci Transl Med 4, 142ra194
254. Eggener, S. E., Scardino, P. T., Carroll, P. R., Zelefsky, M. J., Sartor, O., Hricak, H.,
Wheeler, T. M., Fine, S. W., Trachtenberg, J., Rubin, M. A., Ohori, M., Kuroiwa, K.,
Rossignol, M., Abenhaim, L., International Task Force on Prostate, C., and the Focal
Lesion, P. (2007) Focal therapy for localized prostate cancer: a critical appraisal of rationale
and modalities. J Urol 178, 2260-2267
255. Drake, R. R., White, K. Y., Fuller, T. W., Igwe, E., Clements, M. A., Nyalwidhe, J. O.,
Given, R. W., Lance, R. S., and Semmes, O. J. (2009) Clinical collection and protein
properties of expressed prostatic secretions as a source for biomarkers of prostatic disease. J
Proteomics 72, 907-917
256. Collins, A. T., and Maitland, N. J. (2006) Prostate cancer stem cells. European journal of
cancer 42, 1213-1218
257. Deras, I. L., Aubin, S. M., Blase, A., Day, J. R., Koo, S., Partin, A. W., Ellis, W. J., Marks,
L. S., Fradet, Y., Rittenhouse, H., and Groskopf, J. (2008) PCA3: a molecular urine assay
for predicting prostate biopsy outcome. J Urol 179, 1587-1592
258. Hessels, D., and Schalken, J. A. (2009) The use of PCA3 in the diagnosis of prostate cancer.
Nat Rev Urol 6, 255-261
259. Wang, R., Chinnaiyan, A. M., Dunn, R. L., Wojno, K. J., and Wei, J. T. (2009) Rational
approach to implementation of prostate cancer antigen 3 into clinical care. Cancer 115,
3879-3886
260. Drake, R. R., Elschenbroich, S., Lopez-Perez, O., Kim, Y., Ignatchenko, V., Ignatchenko,
A., Nyalwidhe, J. O., Basu, G., Wilkins, C. E., Gjurich, B., Lance, R. S., Semmes, O. J.,
Medin, J. A., and Kislinger, T. (2010) In-depth proteomic analyses of direct expressed
prostatic secretions. J Proteome Res 9, 2109-2116
261. Wu, C., Orozco, C., Boyer, J., Leglise, M., Goodale, J., Batalov, S., Hodge, C. L., Haase, J.,
Janes, J., Huss, J. W., 3rd, and Su, A. I. (2009) BioGPS: an extensible and customizable
portal for querying and organizing gene annotation resources. Genome Biol 10, R130
262. Ponten, F., Gry, M., Fagerberg, L., Lundberg, E., Asplund, A., Berglund, L., Oksvold, P.,
Bjorling, E., Hober, S., Kampf, C., Navani, S., Nilsson, P., Ottosson, J., Persson, A.,
Wernerus, H., Wester, K., and Uhlen, M. (2009) A global view of protein expression in
human cells, tissues, and organs. Mol Syst Biol 5, 337
263. Uhlen, M., Bjorling, E., Agaton, C., Szigyarto, C. A., Amini, B., Andersen, E., Andersson,
A. C., Angelidou, P., Asplund, A., Asplund, C., Berglund, L., Bergstrom, K., Brumer, H.,
Cerjan, D., Ekstrom, M., Elobeid, A., Eriksson, C., Fagerberg, L., Falk, R., Fall, J.,
Forsberg, M., Bjorklund, M. G., Gumbel, K., Halimi, A., Hallin, I., Hamsten, C., Hansson,
M., Hedhammar, M., Hercules, G., Kampf, C., Larsson, K., Lindskog, M., Lodewyckx, W.,
166
Lund, J., Lundeberg, J., Magnusson, K., Malm, E., Nilsson, P., Odling, J., Oksvold, P.,
Olsson, I., Oster, E., Ottosson, J., Paavilainen, L., Persson, A., Rimini, R., Rockberg, J.,
Runeson, M., Sivertsson, A., Skollermo, A., Steen, J., Stenvall, M., Sterky, F., Stromberg,
S., Sundberg, M., Tegel, H., Tourle, S., Wahlund, E., Walden, A., Wan, J., Wernerus, H.,
Westberg, J., Wester, K., Wrethagen, U., Xu, L. L., Hober, S., and Ponten, F. (2005) A
human protein atlas for normal and cancer tissues based on antibody proteomics. Mol Cell
Proteomics 4, 1920-1932
264. Li, R., Guo, Y., Han, B. M., Yan, X., Utleg, A. G., Li, W., Tu, L. C., Wang, J., Hood, L.,
Xia, S., and Lin, B. (2008) Proteomics cataloging analysis of human expressed prostatic
secretions reveals rich source of biomarker candidates. Proteomics Clin Appl 2, 543-555
265. Su, A. I., Wiltshire, T., Batalov, S., Lapp, H., Ching, K. A., Block, D., Zhang, J., Soden, R.,
Hayakawa, M., Kreiman, G., Cooke, M. P., Walker, J. R., and Hogenesch, J. B. (2004) A
gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci U
S A 101, 6062-6067
266. Ponten, F., Jirstrom, K., and Uhlen, M. (2008) The Human Protein Atlas--a tool for
pathology. J Pathol 216, 387-393
267. Pilch, B., and Mann, M. (2006) Large-scale and high-confidence proteomic analysis of
human seminal plasma. Genome Biol 7, R40
268. Sardana, G., Jung, K., Stephan, C., and Diamandis, E. P. (2008) Proteomic analysis of
conditioned media from the PC3, LNCaP, and 22Rv1 prostate cancer cell lines: discovery
and validation of candidate prostate cancer biomarkers. J Proteome Res 7, 3329-3338
269. Cox, B., Kotlyar, M., Evangelou, A. I., Ignatchenko, V., Ignatchenko, A., Whiteley, K.,
Jurisica, I., Adamson, S. L., Rossant, J., and Kislinger, T. (2009) Comparative systems
biology of human and mouse as a tool to guide the modeling of human placental pathology.
Mol Syst Biol 5, 279
270. Kislinger, T., Cox, B., Kannan, A., Chung, C., Hu, P., Ignatchenko, A., Scott, M. S.,
Gramolini, A. O., Morris, Q., Hallett, M. T., Rossant, J., Hughes, T. R., Frey, B., and Emili,
A. (2006) Global survey of organ and organelle protein expression in mouse: combined
proteomic and transcriptomic profiling. Cell 125, 173-186
271. Thongboonkerd, V. (2007) Practical points in urinary proteomics. J Proteome Res 6, 3881-
3890
272. Court, M., Selevsek, N., Matondo, M., Allory, Y., Garin, J., Masselon, C. D., and Domon,
B. (2011) Toward a standardized urine proteome analysis methodology. Proteomics Clin
Appl 5, 547
273. Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., Davis, A. P.,
Dolinski, K., Dwight, S. S., Eppig, J. T., Harris, M. A., Hill, D. P., Issel-Tarver, L.,
Kasarskis, A., Lewis, S., Matese, J. C., Richardson, J. E., Ringwald, M., Rubin, G. M., and
Sherlock, G. (2000) Gene ontology: tool for the unification of biology. The Gene Ontology
Consortium. Nature genetics 25, 25-29
167
274. Kanehisa, M., and Goto, S. (2000) KEGG: kyoto encyclopedia of genes and genomes.
Nucleic Acids Res 28, 27-30
275. Choi, H., Fermin, D., and Nesvizhskii, A. I. (2008) Significance analysis of spectral count
data in label-free shotgun proteomics. Mol Cell Proteomics 7, 2373-2385
276. Cox, B., Kislinger, T., Wigle, D. A., Kannan, A., Brown, K., Okubo, T., Hogan, B., Jurisica,
I., Frey, B., Rossant, J., and Emili, A. (2007) Integrated proteomic and transcriptomic
profiling of mouse lung development and Nmyc target genes. Mol Syst Biol 3, 109
277. Gygi, S. P., Rochon, Y., Franza, B. R., and Aebersold, R. (1999) Correlation between
protein and mRNA abundance in yeast. Mol Cell Biol 19, 1720-1730
278. UniProt, C. (2010) The Universal Protein Resource (UniProt) in 2010. Nucleic Acids Res
38, D142-148
279. Whitaker, H. C., Warren, A. Y., Eeles, R., Kote-Jarai, Z., and Neal, D. E. (2010) The
potential value of microseminoprotein-beta as a prostate cancer biomarker and therapeutic
target. The Prostate 70, 333-340
280. Cooner, W. H., Mosley, B. R., Rutherford, C. L., Jr., Beard, J. H., Pond, H. S., Terry, W. J.,
Igel, T. C., and Kidd, D. D. (1990) Prostate cancer detection in a clinical urological practice
by ultrasonography, digital rectal examination and prostate specific antigen. J Urol 143,
1146-1152; discussion 1152-1144
281. Thompson, I. M., Ankerst, D. P., Chi, C., Lucia, M. S., Goodman, P. J., Crowley, J. J.,
Parnes, H. L., and Coltman, C. A., Jr. (2005) Operating characteristics of prostate-specific
antigen in men with an initial PSA level of 3.0 ng/ml or lower. JAMA 294, 66-70
282. Han, M., Piantadosi, S., Zahurak, M. L., Sokoll, L. J., Chan, D. W., Epstein, J. I., Walsh, P.
C., and Partin, A. W. (2001) Serum acid phosphatase level and biochemical recurrence
following radical prostatectomy for men with clinically localized prostate cancer. Urology
57, 707-711
283. Veeramani, S., Yuan, T. C., Chen, S. J., Lin, F. F., Petersen, J. E., Shaheduzzaman, S.,
Srivastava, S., MacDonald, R. G., and Lin, M. F. (2005) Cellular prostatic acid phosphatase:
a protein tyrosine phosphatase involved in androgen-independent proliferation of prostate
cancer. Endocr Relat Cancer 12, 805-822
284. Chen, Y. W., Lee, M. S., Lucht, A., Chou, F. P., Huang, W., Havighurst, T. C., Kim, K.,
Wang, J. K., Antalis, T. M., Johnson, M. D., and Lin, C. Y. (2010) TMPRSS2, a serine
protease expressed in the prostate on the apical surface of luminal epithelial cells and
released into semen in prostasomes, is misregulated in prostate cancer cells. The American
journal of pathology 176, 2986-2996
285. Murphy, G. P., Kenny, G. M., Ragde, H., Wolfert, R. L., Boynton, A. L., Holmes, E. H.,
Misrock, S. L., Bartsch, G., Klocker, H., Pointner, J., Reissigl, A., McLeod, D. G., Douglas,
T., Morgan, T., and Gilbaugh, J., Jr. (1998) Measurement of serum prostate-specific
membrane antigen, a new prognostic marker for prostate cancer. Urology 51, 89-97
168
286. Hood, B. L., Malehorn, D. E., Conrads, T. P., and Bigbee, W. L. (2009) Serum proteomics
using mass spectrometry. Methods in molecular biology 520, 107-128
287. Byrne, J. C., Downes, M. R., O'Donoghue, N., O'Keane, C., O'Neill, A., Fan, Y.,
Fitzpatrick, J. M., Dunn, M., and Watson, R. W. (2009) 2D-DIGE as a strategy to identify
serum markers for the progression of prostate cancer. J Proteome Res 8, 942-957
288. Garbis, S. D., Tyritzis, S. I., Roumeliotis, T., Zerefos, P., Giannopoulou, E. G., Vlahou, A.,
Kossida, S., Diaz, J., Vourekas, S., Tamvakopoulos, C., Pavlakis, K., Sanoudou, D., and
Constantinides, C. A. (2008) Search for potential markers for prostate cancer diagnosis,
prognosis and treatment in clinical tissue specimens using amine-specific isobaric tagging
(iTRAQ) with two-dimensional liquid chromatography and tandem mass spectrometry. J
Proteome Res 7, 3146-3158
289. Laxman, B., Morris, D. S., Yu, J., Siddiqui, J., Cao, J., Mehra, R., Lonigro, R. J., Tsodikov,
A., Wei, J. T., Tomlins, S. A., and Chinnaiyan, A. M. (2008) A first-generation multiplex
biomarker analysis of urine for the early detection of prostate cancer. Cancer research 68,
645-649
290. van Gils, M. P., Hessels, D., van Hooij, O., Jannink, S. A., Peelen, W. P., Hanssen, S. L.,
Witjes, J. A., Cornel, E. B., Karthaus, H. F., Smits, G. A., Dijkman, G. A., Mulders, P. F.,
and Schalken, J. A. (2007) The time-resolved fluorescence-based PCA3 test on urinary
sediments after digital rectal examination; a Dutch multicenter validation of the diagnostic
performance. Clin Cancer Res 13, 939-943
291. Barry, M. J. (2001) Clinical practice. Prostate-specific-antigen testing for early diagnosis of
prostate cancer. N Engl J Med 344, 1373-1377
292. Draisma, G., Boer, R., Otto, S. J., van der Cruijsen, I. W., Damhuis, R. A., Schroder, F. H.,
and de Koning, H. J. (2003) Lead times and overdetection due to prostate-specific antigen
screening: estimates from the European Randomized Study of Screening for Prostate
Cancer. Journal of the National Cancer Institute 95, 868-878
293. Esserman, L., Shieh, Y., and Thompson, I. (2009) Rethinking screening for breast cancer
and prostate cancer. Jama 302, 1685-1692
294. Roobol, M. J., Zappa, M., Maattanen, L., and Ciatto, S. (2007) The value of different
screening tests in predicting prostate biopsy outcome in screening for prostate cancer data
from a multicenter study (ERSPC). The Prostate 67, 439-446
295. Elschenbroich, S., Ignatchenko, V., Sharma, P., Schmitt-Ulms, G., Gramolini, A. O., and
Kislinger, T. (2009) Peptide separations by on-line MudPIT compared to isoelectric
focusing in an off-gel format: application to a membrane-enriched fraction from C2C12
mouse skeletal muscle cells. J Proteome Res 8, 4860-4869
296. Gortzak-Uzan, L., Ignatchenko, A., Evangelou, A. I., Agochiya, M., Brown, K. A., St Onge,
P., Kireeva, I., Schmitt-Ulms, G., Brown, T. J., Murphy, J., Rosen, B., Shaw, P., Jurisica, I.,
and Kislinger, T. (2008) A proteome resource of ovarian cancer ascites: integrated
169
proteomic and bioinformatic analyses to identify putative biomarkers. J Proteome Res 7,
339-351
297. McNeal, J. E. (1992) Cancer volume and site of origin of adenocarcinoma in the prostate:
relationship to local and distant spread. Human pathology 23, 258-266
298. Sodek, K. L., Evangelou, A. I., Ignatchenko, A., Agochiya, M., Brown, T. J., Ringuette, M.
J., Jurisica, I., and Kislinger, T. (2008) Identification of pathways associated with invasive
behavior by ovarian cancer cells using multidimensional protein identification technology
(MudPIT). Mol Biosyst 4, 762-773
299. Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009) Systematic and integrative
analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4, 44-57
300. Ploussard, G., and de la Taille, A. (2010) Urine biomarkers in prostate cancer. Nat Rev Urol
7, 101-109
301. Chung, B. I., Tarin, T. V., Ferrari, M., and Brooks, J. D. (2011) Comparison of prostate
cancer tumor volume and percent cancer in prediction of biochemical recurrence and cancer
specific survival. Urol Oncol 29, 314-318
302. D'Amico, A. V., Whittington, R., Malkowicz, S. B., Schultz, D., Kaplan, I., Beard, C. J.,
Tomaszewski, J. E., Renshaw, A. A., Loughlin, K. R., Richie, J. P., and Wein, A. (1998)
Calculated prostate cancer volume greater than 4.0 cm3 identifies patients with localized
prostate cancer who have a poor prognosis following radical prostatectomy or external-
beam radiation therapy. J Clin Oncol 16, 3094-3100
303. Yossepowitch, O., Eggener, S. E., Bianco, F. J., Jr., Carver, B. S., Serio, A., Scardino, P. T.,
and Eastham, J. A. (2007) Radical prostatectomy for clinically localized, high risk prostate
cancer: critical analysis of risk assessment methods. J Urol 178, 493-499; discussion 499
304. Han, M., Partin, A. W., Pound, C. R., Epstein, J. I., and Walsh, P. C. (2001) Long-term
biochemical disease-free and cancer-specific survival following anatomic radical retropubic
prostatectomy. The 15-year Johns Hopkins experience. Urol Clin North Am 28, 555-565
305. Roehl, K. A., Han, M., Ramos, C. G., Antenor, J. A., and Catalona, W. J. (2004) Cancer
progression and survival rates following anatomical radical retropubic prostatectomy in
3,478 consecutive patients: long-term results. J Urol 172, 910-914
306. Hull, G. W., Rabbani, F., Abbas, F., Wheeler, T. M., Kattan, M. W., and Scardino, P. T.
(2002) Cancer control with radical prostatectomy alone in 1,000 consecutive patients. J Urol
167, 528-534
307. Amling, C. L., Blute, M. L., Bergstralh, E. J., Seay, T. M., Slezak, J., and Zincke, H. (2000)
Long-term hazard of progression after radical prostatectomy for clinically localized prostate
cancer: continued risk of biochemical failure after 5 years. J Urol 164, 101-105
308. Stephenson, A. J., and Kattan, M. W. (2006) Nomograms for prostate cancer. BJU Int 98,
39-46
170
309. Chodak, G. W., and Warren, K. S. (2006) Watchful waiting for prostate cancer: a review
article. Prostate Cancer Prostatic Dis 9, 25-29
310. Klotz, L. (2008) Active surveillance for prostate cancer: trials and tribulations. World J Urol
26, 437-442
311. Paulovich, A. G., Whiteaker, J. R., Hoofnagle, A. N., and Wang, P. (2008) The interface
between biomarker discovery and clinical validation: The tar pit of the protein biomarker
pipeline. Proteomics Clin Appl 2, 1386-1402
312. Batruch, I., Lecker, I., Kagedan, D., Smith, C. R., Mullen, B. J., Grober, E., Lo, K. C.,
Diamandis, E. P., and Jarvi, K. A. (2011) Proteomic analysis of seminal plasma from
normal volunteers and post-vasectomy patients identifies over 2000 proteins and candidate
biomarkers of the urogenital system. J Proteome Res 10, 941-953
313. Dowling, P., Wormald, R., Meleady, P., Henry, M., Curran, A., and Clynes, M. (2008)
Analysis of the saliva proteome from patients with head and neck squamous cell carcinoma
reveals differences in abundance levels of proteins associated with tumour progression and
metastasis. J Proteomics 71, 168-175
314. Hu, S., Yu, T., Xie, Y., Yang, Y., Li, Y., Zhou, X., Tsung, S., Loo, R. R., Loo, J. R., and
Wong, D. T. (2007) Discovery of oral fluid biomarkers for human oral cancer by mass
spectrometry. Cancer Genomics Proteomics 4, 55-64
315. Zhao, T., Zeng, X., Bateman, N. W., Sun, M., Teng, P. N., Bigbee, W. L., Dhir, R., Nelson,
J. B., Conrads, T. P., and Hood, B. L. (2012) Relative quantitation of proteins in expressed
prostatic secretion with a stable isotope labeled secretome standard. J Proteome Res 11,
1089-1099
316. Woodson, K., O'Reilly, K. J., Hanson, J. C., Nelson, D., Walk, E. L., and Tangrea, J. A.
(2008) The usefulness of the detection of GSTP1 methylation in urine as a biomarker in the
diagnosis of prostate cancer. J Urol 179, 508-511; discussion 511-502
317. Hessels, D., Klein Gunnewiek, J. M., van Oort, I., Karthaus, H. F., van Leenders, G. J., van
Balken, B., Kiemeney, L. A., Witjes, J. A., and Schalken, J. A. (2003) DD3(PCA3)-based
molecular urine analysis for the diagnosis of prostate cancer. Eur Urol 44, 8-15; discussion
15-16
318. Irani, J., Salomon, L., Soulie, M., Zlotta, A., de la Taille, A., Dore, B., and Millet, C. (2005)
Urinary/serum prostate-specific antigen ratio: comparison with free/total serum prostate-
specific antigen ratio in improving prostate cancer detection. Urology 65, 533-537
319. Bolduc, S., Lacombe, L., Naud, A., Gregoire, M., Fradet, Y., and Tremblay, R. R. (2007)
Urinary PSA: a potential useful marker when serum PSA is between 2.5 ng/mL and 10
ng/mL. Can Urol Assoc J 1, 377-381
320. Sumitomo, M., Iwase, A., Zheng, R., Navarro, D., Kaminetzky, D., Shen, R., Georgescu, M.
M., and Nanus, D. M. (2004) Synergy in tumor suppression by direct interaction of neutral
endopeptidase with PTEN. Cancer Cell 5, 67-78
171
321. Osman, I., Yee, H., Taneja, S. S., Levinson, B., Zeleniuch-Jacquotte, A., Chang, C., Nobert,
C., and Nanus, D. M. (2004) Neutral endopeptidase protein expression and prognosis in
localized prostate cancer. Clin Cancer Res 10, 4096-4100
322. De Marzo, A. M., Platz, E. A., Sutcliffe, S., Xu, J., Gronberg, H., Drake, C. G., Nakai, Y.,
Isaacs, W. B., and Nelson, W. G. (2007) Inflammation in prostate carcinogenesis. Nat Rev
Cancer 7, 256-269
323. Sfanos, K. S., and De Marzo, A. M. (2012) Prostate cancer and inflammation: the evidence.
Histopathology 60, 199-215
324. Nonomura, N., Takayama, H., Nakayama, M., Nakai, Y., Kawashima, A., Mukai, M.,
Nagahara, A., Aozasa, K., and Tsujimura, A. (2011) Infiltration of tumour-associated
macrophages in prostate biopsy specimens is predictive of disease progression after
hormonal therapy for prostate cancer. BJU Int 107, 1918-1922
325. Gannon, P. O., Poisson, A. O., Delvoye, N., Lapointe, R., Mes-Masson, A. M., and Saad, F.
(2009) Characterization of the intra-prostatic immune cell infiltration in androgen-deprived
prostate cancer patients. J Immunol Methods 348, 9-17
326. Fredolini, C., Meani, F., Luchini, A., Zhou, W., Russo, P., Ross, M., Patanarut, A.,
Tamburro, D., Gambara, G., Ornstein, D., Odicino, F., Ragnoli, M., Ravaggi, A., Novelli,
F., Collura, D., D'Urso, L., Muto, G., Belluco, C., Pecorelli, S., Liotta, L., and Petricoin, E.
F., 3rd (2010) Investigation of the ovarian and prostate cancer peptidome for candidate early
detection markers using a novel nanoparticle biomarker capture technology. AAPS J 12,
504-518
327. Longo, C., Patanarut, A., George, T., Bishop, B., Zhou, W., Fredolini, C., Ross, M. M.,
Espina, V., Pellacani, G., Petricoin, E. F., 3rd, Liotta, L. A., and Luchini, A. (2009) Core-
shell hydrogel particles harvest, concentrate and preserve labile low abundance biomarkers.
PLoS One 4, e4763
328. Tamburro, D., Fredolini, C., Espina, V., Douglas, T. A., Ranganathan, A., Ilag, L., Zhou,
W., Russo, P., Espina, B. H., Muto, G., Petricoin, E. F., 3rd, Liotta, L. A., and Luchini, A.
(2011) Multifunctional core-shell nanoparticles: discovery of previously invisible
biomarkers. J Am Chem Soc 133, 19178-19188
329. Roth, J. (2002) Protein N-glycosylation along the secretory pathway: relationship to
organelle topography and function, protein quality control, and cell interactions. Chem Rev
102, 285-303
330. Tian, Y., Zhou, Y., Elliott, S., Aebersold, R., and Zhang, H. (2007) Solid-phase extraction
of N-linked glycopeptides. Nat Protoc 2, 334-339
331. Zhang, H., Li, X. J., Martin, D. B., and Aebersold, R. (2003) Identification and
quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling
and mass spectrometry. Nat Biotechnol 21, 660-666
172
332. Kaji, H., Yamauchi, Y., Takahashi, N., and Isobe, T. (2006) Mass spectrometric
identification of N-linked glycopeptides using lectin-mediated affinity capture and
glycosylation site-specific stable isotope tagging. Nat Protoc 1, 3019-3027
333. Sogabe, M., Nozaki, H., Tanaka, N., Kubota, T., Kaji, H., Kuno, A., Togayachi, A., Gotoh,
M., Nakanishi, H., Nakanishi, T., Mikami, M., Suzuki, N., Kiguchi, K., Ikehara, Y., and
Narimatsu, H. (2014) Novel glycobiomarker for ovarian cancer that detects clear cell
carcinoma. J Proteome Res 13, 1624-1635
334. Drake, R. R., Schwegler, E. E., Malik, G., Diaz, J., Block, T., Mehta, A., and Semmes, O. J.
(2006) Lectin capture strategies combined with mass spectrometry for the discovery of
serum glycoprotein biomarkers. Mol Cell Proteomics 5, 1957-1967
335. Anderson, N. L., Anderson, N. G., Haines, L. R., Hardie, D. B., Olafson, R. W., and
Pearson, T. W. (2004) Mass spectrometric quantitation of peptides and proteins using Stable
Isotope Standards and Capture by Anti-Peptide Antibodies (SISCAPA). J Proteome Res 3,
235-244
336. Anderson, N. L., Jackson, A., Smith, D., Hardie, D., Borchers, C., and Pearson, T. W.
(2009) SISCAPA peptide enrichment on magnetic beads using an in-line bead trap device.
Mol Cell Proteomics 8, 995-1005
337. Mataija-Botelho, D., Murphy, P., Pinto, D. M., Maclellan, D. L., Langlois, C., and Doucette,
A. A. (2009) A qualitative proteome investigation of the sediment portion of human urine:
Implications in the biomarker discovery process. Proteomics Clin Appl 3, 95-105
338. Nawaz, M., Camussi, G., Valadi, H., Nazarenko, I., Ekstrom, K., Wang, X., Principe, S.,
Shah, N., Ashraf, N. M., Fatima, F., Neder, L., and Kislinger, T. (2014) The emerging role
of extracellular vesicles as biomarkers for urogenital cancers. Nat Rev Urol 11, 688-701
339. Chen, Y. T., Chen, H. W., Domanski, D., Smith, D. S., Liang, K. H., Wu, C. C., Chen, C.
L., Chung, T., Chen, M. C., Chang, Y. S., Parker, C. E., Borchers, C. H., and Yu, J. S.
(2012) Multiplexed quantification of 63 proteins in human urine by multiple reaction
monitoring-based mass spectrometry for discovery of potential bladder cancer biomarkers. J
Proteomics 75, 3529-3545
340. Zhi, W., Wang, M., and She, J. X. (2011) Selected reaction monitoring (SRM) mass
spectrometry without isotope labeling can be used for rapid protein quantification. Rapid
Commun Mass Spectrom 25, 1583-1588
341. Smith, L. M., Kelleher, N. L., and Proteomics, C. T. D. (2013) Proteoform: a single term
describing protein complexity. Nature methods 10, 186-187
342. Pavelka, N., Fournier, M. L., Swanson, S. K., Pelizzola, M., Ricciardi-Castagnoli, P.,
Florens, L., and Washburn, M. P. (2008) Statistical similarities between transcriptomics and
quantitative shotgun proteomics data. Mol Cell Proteomics 7, 631-644
173
343. Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009) Bioinformatics enrichment
tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids
Res 37, 1-13
344. Wu, G., Feng, X., and Stein, L. (2010) A human functional protein interaction network and
its application to cancer data analysis. Genome Biol 11, R53
345. Gross, J. L., and Yellen, J. (1999) Graph theory and its applications, CRC Press, Boca
Raton, Fla.
346. Girvan, M., and Newman, M. E. (2002) Community structure in social and biological
networks. Proc Natl Acad Sci U S A 99, 7821-7826
347. Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., Amin, N.,
Schwikowski, B., and Ideker, T. (2003) Cytoscape: a software environment for integrated
models of biomolecular interaction networks. Genome Res 13, 2498-2504
348. Uhlen, M., Oksvold, P., Fagerberg, L., Lundberg, E., Jonasson, K., Forsberg, M., Zwahlen,
M., Kampf, C., Wester, K., Hober, S., Wernerus, H., Bjorling, L., and Ponten, F. (2010)
Towards a knowledge-based Human Protein Atlas. Nat Biotechnol 28, 1248-1250
349. Alexander, H., Stegner, A. L., Wagner-Mann, C., Du Bois, G. C., Alexander, S., and Sauter,
E. R. (2004) Proteomic analysis to identify breast cancer biomarkers in nipple aspirate fluid.
Clin Cancer Res 10, 7500-7510
350. MacLean, B., Tomazela, D. M., Shulman, N., Chambers, M., Finney, G. L., Frewen, B.,
Kern, R., Tabb, D. L., Liebler, D. C., and MacCoss, M. J. (2010) Skyline: an open source
document editor for creating and analyzing targeted proteomics experiments. Bioinformatics
26, 966-968
351. Han, B., and Higgs, R. E. (2008) Proteomics: from hypothesis to quantitative assay on a
single platform. Guidelines for developing MRM assays using ion trap mass spectrometers.
Brief Funct Genomic Proteomic 7, 340-354
352. Lange, V., Picotti, P., Domon, B., and Aebersold, R. (2008) Selected reaction monitoring
for quantitative proteomics: a tutorial. Mol Syst Biol 4, 222
353. Cooperberg, M. R., Broering, J. M., and Carroll, P. R. (2010) Time trends and local
variation in primary treatment of localized prostate cancer. J Clin Oncol 28, 1117-1123
354. Welch, H. G., and Albertsen, P. C. (2009) Prostate cancer diagnosis and treatment after the
introduction of prostate-specific antigen screening: 1986-2005. Journal of the National
Cancer Institute 101, 1325-1329
355. Thompson, I. M., and Klotz, L. (2010) Active surveillance for prostate cancer. JAMA 304,
2411-2412
356. Ganz, P. A., Barry, J. M., Burke, W., Col, N. F., Corso, P. S., Dodson, E., Hammond, M. E.,
Kogan, B. A., Lynch, C. F., Newcomer, L., Seifter, E. J., Tooze, J. A., Viswanath, K., and
174
Wessells, H. (2012) National Institutes of Health State-of-the-Science Conference: role of
active surveillance in the management of men with localized prostate cancer. Annals of
internal medicine 156, 591-595
357. Sieh, W., Lichtensztajn, D. Y., Nelson, D. O., Cockburn, M., West, D. W., Brooks, J. D.,
and Chang, E. T. (2013) Treatment and Mortality in Men with Localized Prostate Cancer: A
Population-Based Study in California. The open prostate cancer journal 6, 1-9
358. Chang, C. Y., Picotti, P., Huttenhain, R., Heinzelmann-Schwarz, V., Jovanovic, M.,
Aebersold, R., and Vitek, O. (2012) Protein significance analysis in selected reaction
monitoring (SRM) measurements. Mol Cell Proteomics 11, M111 014662
359. Polanski, M., and Anderson, N. L. (2007) A list of candidate cancer biomarkers for targeted
proteomics. Biomarker insights 1, 1-48
360. Schreiner, D., Simicevic, J., Ahrne, E., Schmidt, A., and Scheiffele, P. (2015) Quantitative
isoform-profiling of highly diversified recognition molecules. eLife 4
361. de Gramont, A., Watson, S., Ellis, L. M., Rodon, J., Tabernero, J., de Gramont, A., and
Hamilton, S. R. (2015) Pragmatic issues in biomarker evaluation for targeted therapies in
cancer. Nature reviews. Clinical oncology 12, 197-212
362. Kobayashi, M., Salomon, C., Tapia, J., Illanes, S. E., Mitchell, M. D., and Rice, G. E.
(2014) Ovarian cancer cell invasiveness is associated with discordant exosomal
sequestration of Let-7 miRNA and miR-200. J Transl Med 12, 4
363. Thery, C., Ostrowski, M., and Segura, E. (2009) Membrane vesicles as conveyors of
immune responses. Nature reviews. Immunology 9, 581-593
364. Shedden, K., Xie, X. T., Chandaroy, P., Chang, Y. T., and Rosania, G. R. (2003) Expulsion
of small molecules in vesicles shed by cancer cells: association with gene expression and
chemosensitivity profiles. Cancer research 63, 4331-4337
365. Gyorgy, B., Szabo, T. G., Pasztoi, M., Pal, Z., Misjak, P., Aradi, B., Laszlo, V., Pallinger,
E., Pap, E., Kittel, A., Nagy, G., Falus, A., and Buzas, E. I. (2011) Membrane vesicles,
current state-of-the-art: emerging role of extracellular vesicles. Cellular and molecular life
sciences : CMLS 68, 2667-2688
366. D'Souza-Schorey, C., and Clancy, J. W. (2012) Tumor-derived microvesicles: shedding
light on novel microenvironment modulators and prospective cancer biomarkers. Genes Dev
26, 1287-1299
367. Buzas, E. I., Gyorgy, B., Nagy, G., Falus, A., and Gay, S. (2014) Emerging role of
extracellular vesicles in inflammatory diseases. Nature reviews. Rheumatology 10, 356-364
368. Kim, H. K., Song, K. S., Park, Y. S., Kang, Y. H., Lee, Y. J., Lee, K. R., Kim, H. K., Ryu,
K. W., Bae, J. M., and Kim, S. (2003) Elevated levels of circulating platelet microparticles,
VEGF, IL-6 and RANTES in patients with gastric cancer: possible role of a metastasis
predictor. European journal of cancer 39, 184-191
175
369. Graves, L. E., Ariztia, E. V., Navari, J. R., Matzel, H. J., Stack, M. S., and Fishman, D. A.
(2004) Proinvasive properties of ovarian cancer ascites-derived membrane vesicles. Cancer
research 64, 7045-7049
370. van Doormaal, F. F., Kleinjan, A., Di Nisio, M., Buller, H. R., and Nieuwland, R. (2009)
Cell-derived microvesicles and cancer. The Netherlands journal of medicine 67, 266-273
371. Poliakov, A., Spilman, M., Dokland, T., Amling, C. L., and Mobley, J. A. (2009) Structural
heterogeneity and protein composition of exosome-like vesicles (prostasomes) in human
semen. The Prostate 69, 159-167
372. Drake, R. R., and Kislinger, T. (2014) The proteomics of prostate cancer exosomes. Expert
review of proteomics 11, 167-177
373. Aalberts, M., Stout, T. A., and Stoorvogel, W. (2014) Prostasomes: extracellular vesicles
from the prostate. Reproduction 147, R1-14
374. Moremen, K. W., Tiemeyer, M., and Nairn, A. V. (2012) Vertebrate protein glycosylation:
diversity, synthesis and function. Nature reviews. Molecular cell biology 13, 448-462
375. Varki, A. (1993) Biological roles of oligosaccharides: all of the theories are correct.
Glycobiology 3, 97-130
376. Hakomori, S. (2002) Glycosylation defining cancer malignancy: new wine in an old bottle.
Proc Natl Acad Sci U S A 99, 10231-10233
377. Pierce, M., Buckhaults, P., Chen, L., and Fregien, N. (1997) Regulation of N-
acetylglucosaminyltransferase V and Asn-linked oligosaccharide beta(1,6) branching by a
growth factor signaling pathway and effects on cell adhesion and metastatic potential.
Glycoconjugate journal 14, 623-630
378. Rudd, P. M., and Dwek, R. A. (1997) Glycosylation: heterogeneity and the 3D structure of
proteins. Critical reviews in biochemistry and molecular biology 32, 1-100
379. Venable, J. D., Dong, M. Q., Wohlschlegel, J., Dillin, A., and Yates, J. R. (2004) Automated
approach for quantitative analysis of complex peptide mixtures from tandem mass spectra.
Nature methods 1, 39-45
380. Rost, H. L., Rosenberger, G., Navarro, P., Gillet, L., Miladinovic, S. M., Schubert, O. T.,
Wolski, W., Collins, B. C., Malmstrom, J., Malmstrom, L., and Aebersold, R. (2014)
OpenSWATH enables automated, targeted analysis of data-independent acquisition MS
data. Nat Biotechnol 32, 219-223
381. Gillet, L. C., Navarro, P., Tate, S., Rost, H., Selevsek, N., Reiter, L., Bonner, R., and
Aebersold, R. (2012) Targeted data extraction of the MS/MS spectra generated by data-
independent acquisition: a new concept for consistent and accurate proteome analysis. Mol
Cell Proteomics 11, O111 016717
176
382. Peterson, A. C., Russell, J. D., Bailey, D. J., Westphall, M. S., and Coon, J. J. (2012)
Parallel reaction monitoring for high resolution and high mass accuracy quantitative,
targeted proteomics. Mol Cell Proteomics 11, 1475-1488
383. Gallien, S., Duriez, E., Crone, C., Kellmann, M., Moehring, T., and Domon, B. (2012)
Targeted proteomic quantification on quadrupole-orbitrap mass spectrometer. Mol Cell
Proteomics 11, 1709-1723
384. Ginsburg, G. S., and Willard, H. F. (2009) Genomic and personalized medicine: foundations
and applications. Translational research : the journal of laboratory and clinical medicine
154, 277-287
385. Hamburg, M. A., and Collins, F. S. (2010) The path to personalized medicine. N Engl J Med
363, 301-304
177