thesis intro for linkedin

16
INTRODUCTION General Introduction A key process in cancer diagnostics is identifying mutations in tumor suppressor genes and oncogenes. In 2014, Dr. Carl Wittwer stated that “a revolution [is] in progress” regarding molecular diagnostics, adding that “ The number of targeted cancer therapeutics continues to increase” (Chiu et al., 2015). In this thesis, I will explore the use of a finicky molecular beacon probe called KRAS G12V to identify a particular single-nucleotide polymorphism (hereafter referred to as a “SNP”) in the oncogene KRAS under two hypotheses: (1) It is possible to develop an assay using linear-after-the-exponential-PCR (LATE-PCR) and the aforementioned molecular beacon probe to distinguish the KRAS G12V SNP from other KRAS genotypes and (2) said LATE-PCR assay can be developed through design of new primers. The G12V SNP is a guanine to thymine mutation, so named because the wild-type gene codes for glycine and the mutation produces valine (Chan, 2014). It is one of six potential

Upload: melissa-c-maranto

Post on 16-Apr-2017

39 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Thesis Intro for LinkedIn

INTRODUCTION

General Introduction

A key process in cancer diagnostics is identifying mutations in tumor suppressor

genes and oncogenes. In 2014, Dr. Carl Wittwer stated that “a revolution [is] in

progress” regarding molecular diagnostics, adding that “The number of targeted cancer

therapeutics continues to increase” (Chiu et al., 2015). In this thesis, I will explore the

use of a finicky molecular beacon probe called KRAS G12V to identify a particular

single-nucleotide polymorphism (hereafter referred to as a “SNP”) in the oncogene KRAS

under two hypotheses: (1) It is possible to develop an assay using linear-after-the-

exponential-PCR (LATE-PCR) and the aforementioned molecular beacon probe to

distinguish the KRAS G12V SNP from other KRAS genotypes and (2) said LATE-PCR

assay can be developed through design of new primers. The G12V SNP is a guanine to

thymine mutation, so named because the wild-type gene codes for glycine and the

mutation produces valine (Chan, 2014). It is one of six potential cancer-related SNPs that

can occur in codon twelve of KRAS (Chan, 2014). Sequences can be found on page 58 of

the appendix.

In order to fully explicate this project, I will describe the use of real-time

quantitative polymerase chain reactions (hereafter referred to as RT-qPCR) and LATE-

PCR and the optimization of both of those PCR formats. I will detail the design and

mechanism of the KRAS G12V molecular beacon probe, the implementation of the assay

employing the probe with LATE-PCR, and the results of that assay probe. I will

Page 2: Thesis Intro for LinkedIn

conclude by discussing the further implications of my work on the use of molecular

beacons designed to detect SNPs in the KRAS gene.

Identifying which of a potentially large number of SNPs may or may not be

present in a given gene is immensely advantageous. Current technology relies on PCR in

conjunction with fluorescent probes such as molecular beacon probes, TaqMan probes,

and adjacent hybridization probe assays (Kwok et al., 2003). While there are multiple

methods of SNP detection, there are advantages to using molecular beacons for this

purpose. For example, TaqMan probes rely on the activity of Taq DNA polymerase,

limiting the experimenter to PCR assays that can only be performed using extension

phase temperatures at which Taq activity is optimal (Shiotani et al., 2014). Conversely,

molecular beacons can be designed after the PCR is optimized, allowing the experimenter

greater flexibility (Tyagi & Kramer, 1996).

Real-Time Quantitative Polymerase Chain Reactions: The Basics

PCR has been a useful technique in biotechnology since before the use of the

thermostable Thermus aquaticus ("Taq") polymerase in PCR reactions, prior to which

new polymerase molecules had to be added to each sample at each new step and thermal

cyclers had to be designed to allow for this (Henson, 2004). In 1985, Taq polymerase

was successfully synthesized in vitro; in 1991, the first rapid thermal cycler for PCR was

patented and put into production (Henson, 2004). This advance allowed PCR to become

considerably more useful (Lo, 2014). The rest is history, but it is a history that is still

ongoing, so it might be more accurate to refer to this “history” as a present and a future as

well.

Page 3: Thesis Intro for LinkedIn

The polymerase chain reaction is a delicate assay that depends on a number of

reagents: the aforementioned thermostable polymerase, two short (ordinarily 10-20

nucleotides) single-stranded oligonucleotide sequences – called primers – complementary

to the 3’ end of both the sense and antisense strands of the region to be amplified,

deoxyribonucleotide triphosphates (called dNTPs), magnesium or manganese cations,

and a buffer solution that is optimal for the polymerase of choice. Each individual

reaction consists of these components as well as DNAse-free water, DNA template, and –

in the case of RT-qPCR – a fluorescent probe or reagent. Under the proper conditions,

the primers hybridize to their target sequence on their complementary strands, the

polymerase binds at the 3’ end of the primer, and the polymerase extends the primer to

form a new strand complementary to the target strand using the dNTPs. Each new strand

is referred to as an amplicon (Brunstein, 2013).

One cycle of PCR includes three stages: denaturation, annealing of primers, and

extension of the primers. During the extension stage, new amplicons are created

(Brunstein, 2013). At 100% efficiency, each target strand is replicated during one cycle,

causing an exponential increase in the number of amplicons with each cycle. The

number of cycles in a PCR assay may range from 35 to 80 depending on the amount of

target DNA in the initial sample and the amount of product required for detection. Figure

1.1 depicts the accumulation of DNA product within the first four cycles assuming 100%

efficiency (Hollis, 2013).

Page 4: Thesis Intro for LinkedIn

Figure 1.1: Visual representation of the exponential increase in DNA strands during four cycles of PCR. Modified image from Hollis, 2013.

A major difference between traditional PCR and RT-qPCR is that detection of the

resulting amplicons is only possible with traditional PCR following the reaction’s

completion, while RT-qPCR allows detection while the reaction is ongoing. Another

major difference between the two formats of PCR is that RT-qPCR is quantitative

(Applied Biosystems, 2014). Another advantage of RT-qPCR is that tubes are always

closed for such an assay, avoiding cross-contamination. RT-qPCR also allows the

experimenter to control at which stage of the PCR the fluorescence signal is detected by

the spectrofluorometric thermal cycler. In the case of RT-qPCR using SYBR Green, the

SYBR Green is not fluorescent until it intercalates into double-stranded DNA (dsDNA)

(Deprez et al., 2002). This is demonstrated in Figure 1.2. It is also important to mention

that SYBR Green fluorescing only when intercalated into dsDNA is not an absolute; it is

normal for SYBR Green to produce a lower level of fluorescence in the presence of

Page 5: Thesis Intro for LinkedIn

single-stranded DNA (ssDNA). As such, SYBR Green emits a fluorescent signal

detectable by the appropriate PCR instrument upon excitation by a laser during the

annealing and extension stages (Deprez et al., 2002). The results from the quantitation of

fluorescence at each annealing and/or extension stage can be graphed, visualizing the

accumulation of the amplicons (Gevertz et al., 2005).

Figure 1.2: Depiction of how SYBR Green intercalates into double-stranded DNA, producing fluorescence during the annealing and extension stage of PCR. Modified image from Zipper et al., 2004.

At a certain point in the PCR, the signal is intense enough to surpass the

background fluorescence and become detectable by the instrument. This point is

quantified by a value called the threshold cycle (CT) (Gevertz et al., 2005). This term

refers to the cycle at which the reaction switches over from the exponential stage to the

pseudolinear stage, i.e., crosses the threshold from an undetectable signal to a detectable

signal (Applied Biosystems, 2014). The CT value is heavily dependent on the initial

concentration of target DNA and is calculated by a standardized statistical method

Page 6: Thesis Intro for LinkedIn

(Gevertz, 2005). Figure 1.3 depicts an RT-qPCR amplification plot and threshold line of

a PCR assay carried out for this thesis.

Figure 1.3: Typical RT-qPCR amplification plot, with cycle as the abscissa and fluorescence intensity as the ordinate; a method of quantifying the accumulation of amplicons. The threshold line, in black, indicates the CT value for the listed PCR reactions.

Linear-after-the-Exponential-PCR

As mentioned previously, a major advantage of real-time quantitative PCR over

traditional PCR is that traditional PCR does not yield data about the CT (Applied

Biosystems, 2014). CT is dependent on the initial concentration of target DNA and the

efficiency of the PCR, and as such CT is a useful metric by which to gauge the efficiency

of the assay. PCR assay efficiency is dependent on a variety of factors, including primer

concentration and bivalent cation concentration (Pfaffl, 2001). When there is a

significant difference between the forward and reverse primer concentration for the

purpose of producing more of one of either the sense or antisense strand amplicons, the

protocol is referred to as asymmetric PCR (Sanchez et al., 2004). While this has the

advantage of producing more of the strand to which a fluorescent probe binds, it has the

Page 7: Thesis Intro for LinkedIn

disadvantage of frequently being 20-30% less efficient than symmetric PCR (Sanchez et

al., 2004).

For this thesis, an improved type of asymmetric PCR called linear-after-the-

exponential PCR (hereafter referred to as LATE-PCR) was used. This subset of PCR is

so called because it is designed to avoid the tendency of symmetric PCR to plateau

stochastically as a result of PCR products competing with primers for binding to the

DNA target; therefore, the de novo synthesis of PCR product can be linear, as opposed to

plateauing, after the exponential stage (Pierce et al., 2005). LATE-PCR is an ingenious

method of producing an excess of one product strand without sacrificing efficiency. The

reason non-LATE asymmetric PCR is so inefficient compared to its symmetric

counterpart is that the Tm of the limiting primer is dependent upon its concentration

(Owczarzy, 2015). When asymmetric PCR first was developed, its low efficiency posed

a seemingly incorrigible conundrum until it was realized that lowering the primer

concentration lowered the melting temperature of the limiting primer, decreasing

efficiency with which the primer binds to its template (Sanchez et al., 2004).

The concept of LATE-PCR is simple: the limiting primer is redesigned so that it

has a Tm that is higher than the Tm of the excess primer (Pierce et al., 2005). This

calculation can be done with software that also takes into account cation concentration,

another factor that affects melting temperature. The task of increasing melting

temperature is accomplished by increasing the length of the primer. For optimal

efficiency in LATE-PCR, the melting temperature (Tm) of the limiting primer should be

between 5 °C and 7 °C above the Tm of the excess primer (Sanchez et al., 2004).

Finicky Molecular Beacon Probes

Page 8: Thesis Intro for LinkedIn

LATE-PCR is ideal for use with molecular beacon probes due to the fact that each

molecular beacon probe is designed to bind only to the sense strand. As such, the best

results with a molecular beacon are obtained when the PCR produces an excess of the

sense strand (El-Hajj et al., 2009). This way, there are far fewer antisense strands present

to compete with the molecular beacon probe for binding.

Working with molecular beacons requires a significant understanding of their

underlying mechanism. While there exist many fluorescent probes that rely on

fluorescence detected when a probe’s fluorescent moiety and quencher are no longer in

close proximity due to the probe binding to its target (Kwok et al., 2003), molecular

beacon probes are unique in their structure. They are single-stranded oligonucleotides,

ordinarily about 30 nucleotides long, which form a hairpin structure at room temperature

in the absence of their target (Tyagi & Kramer, 1996). They have two “arm” sequences

that are complementary to each other and a loop sequence that is designed to be

complementary to the mutation to which the probe is designed to bind (Tyagi & Kramer,

1996).

A quencher molecule is linked to one end of the probe and a fluorescent moiety is

linked to the other end of the probe. Extensive experimentation by the probes’

developers has yielded a list of the optimal fluorophore(s) for use with each quencher

(Marras, 2008). Interestingly, when molecular beacon probes were invented, the

fluorescence signal-detection technique that the probes were designed to use was

fluorescence resonance energy transfer (FRET) (Tyagi et al., 1998). However, when the

probes were initially tested, the quencher first used (dabcyl), did not behave as expected –

in fact, it was capable of quenching fluorescence from fluorophores whose emission

Page 9: Thesis Intro for LinkedIn

spectra did not overlap with its absorption spectrum – leading the inventors to conclude

that FRET was not the mechanism of quenching. It was then discovered that when a

molecular beacon probe’s fluorophore and quencher are in close enough proximity,

chemical bonds are formed between the moieties that prevent photon emission by the

fluorophore (Marras et al., 2002).

When the probe has a hairpin structure (referred to as the “closed” state), the

quencher and the fluorescent moiety are in close enough proximity that the quencher

bonds to the fluorophore, preventing the fluorescent moiety from producing a signal

(Tyagi et al., 1998). The reason molecular beacon probes produce a fluorescent signal in

the presence of their target is that it is thermodynamically more favorable for the probe to

bind to its target than to itself (Marras, 2003). This is illustrated in Figure 1.4. This

configuration is referred to as the “open” state (Kramer et al., 2009).

Figure 1.4: Operation of molecular beacon probes displaying the lack of fluorescence in the “closed” configuration. When the fluorescent moiety and the quencher are held at a distance by the rigidity of the beacon/target hybrid’s double helix structure, the fluorescent moiety can emit photons. Modified from Tyagi & Kramer et al., 1996.

There are two types of molecular beacon probes: “sloppy” and “finicky”. As

demonstrated in Figure 1.5, sloppy molecular beacon probes are designed to be tolerant

of multiple mismatched base pairs (El-Hajj et al., 2009). This mismatch tolerance is

Page 10: Thesis Intro for LinkedIn

useful – especially when sloppy molecular beacon probes are being used to identify

multiple species of bacteria – amplicons from numerous species (El-Hajj et al., 2009).

Finicky molecular beacons, on the other hand, are designed to detect only single-

nucleotide polymorphisms (Tyagi & Kramer, 1996). When detecting single-nucleotide

polymorphisms, it is imperative that finicky probes be used in order to avoid false

positive results.

Figure 1.5: Operation of finicky vs. sloppy molecular beacon probes comparing binding ability of the two types. Modified from Tyagi & Kramer et al., 1996.