introduction

73
GEN 3040 Introduction The fundamental of the genome project is to complete the DNA sequence for the organism being studied and include together with the genetic and physical maps of the genome so that genes and other interesting features can be located within the DNA sequence. Therefore, the techniques and research strategies for the DNA sequencing had been explored by many researchers. Sequencing is the process to determine the exact order of the nucleotides in a region of DNA. A combination of the computer analysis and experimentation are required to find the location of the gens and their control regions (Brown, 2006). DNA sequencing methods include the chain termination method introduced by F.Sanger and A.R. Coulson and chemical degradation by A.Maxam and W.Gilbert. Both of these techniques are important in allowing DNA sequences to be determined in just a short time. For the chain termination method, the sequencing involved the enzymatic synthesis of the strand that complement to an existing template. Thus the DNA to be sequenced must be in single stranded form and the region to be sequenced must be in the 3’ flanked by known sequence (Sanger, 1981). The requirements for the Sanger- Coulson sequencing include a primer that complement to the known region to direct the synthesis of the chain, DNA polymerase, 4- deoxynucleotide triphosphates (dNTPs) and 4-dideoxynucleotide triphosphates (ddNTPs) (Brown, 2006). Primer is the single stranded oligonucleotides and since it complement with the DNA, Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 1

Upload: nishak21

Post on 28-Nov-2014

268 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction

GEN 3040

Introduction

The fundamental of the genome project is to complete the DNA sequence for the

organism being studied and include together with the genetic and physical maps of the genome

so that genes and other interesting features can be located within the DNA sequence. Therefore,

the techniques and research strategies for the DNA sequencing had been explored by many

researchers. Sequencing is the process to determine the exact order of the nucleotides in a region

of DNA. A combination of the computer analysis and experimentation are required to find the

location of the gens and their control regions (Brown, 2006).

DNA sequencing methods include the chain termination method introduced by F.Sanger

and A.R. Coulson and chemical degradation by A.Maxam and W.Gilbert. Both of these

techniques are important in allowing DNA sequences to be determined in just a short time. For

the chain termination method, the sequencing involved the enzymatic synthesis of the strand that

complement to an existing template. Thus the DNA to be sequenced must be in single stranded

form and the region to be sequenced must be in the 3’ flanked by known sequence (Sanger,

1981). The requirements for the Sanger-Coulson sequencing include a primer that complement to

the known region to direct the synthesis of the chain, DNA polymerase, 4-deoxynucleotide

triphosphates (dNTPs) and 4-dideoxynucleotide triphosphates (ddNTPs) (Brown, 2006). Primer

is the single stranded oligonucleotides and since it complement with the DNA, the primer would

easily hybridize with the DNA (Sanger, 1981). The DNA polymerization caused the 3’-OH to

change to a –H in ddNTPs, it is unable to form a phosphodiester bond by nucleophilic attack on

the phosphate and it would cause a termination in the DNA chain. There are two forms of

labeling, the radioactive form that include the labeled primer with 32P or 33P and labeled dNTP

35S while for the non-radioactive form the primer labeled and ddNTP labeled (big dye

terminator). As a result, labeled chains with different size that could be separated by gel

electrophoresis before visualized the fragments by using autoradiograph (Brown, 2006).

Different from Sanger-Coulson method, the automatic DNA sequencing though also

carried out using the chain termination procedure, the automatic DNA sequencing involves dye-

labeled deoxy terminators in which a dye label is inserted into the DNA as well as the

terminating base. This method is much safer compared to the Sanger-Coulson since no

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 1

Page 2: Introduction

GEN 3040

radioactive involved and the automated sequencers could read up to the 96 sequences thus the

data is acquired more rapid the manual sequencing (Brown, 2006).

The computer analysis and internet databases enable the DNA to be analyzed and

studied. One popular database is from NCBI in which contain the general database that is very

useful. The program BLAST in NCBI could be used to find the homology gene that is similar to

the gene being studied. The searching of the homology gene involves the whole collection of

sequences in the database. Thus, the sequence could be compared and the origin and functions

can be investigated.The nucleotides sequence also could be translated to cDNA, mRNA and

amino acid sequence easily by using computer programs. Only short time required and the result

obtained also more accurate. With these entire databases on the internet, people also could share

their findings easily with other people around the world (NCBI, 2010).

As for molecular cloning, the process begins with the isolation and amplification of the

gene of interest. The gene of interest is then inserted into a vector, another DNA molecule that

will drive the gene into living cells to be replicated. Recombinant DNA will be ultimately

produced to be inserted into the host cell. Clones will then can be generated and the inserted

foreign DNA will also be generated. The amplified foreign DNA can be purified for further

analysis.

In this study, the DNA that has been subcloned into a plasmid from a genomic clone

promoter fragment (pOS2) of the osmotin gene from Nicotiana tabacum was prepared and

sequenced. The desired DNA fragment was purified before sent for the sequencing. The basic of

the automated DNA sequence analysis was introduced during the practical. After the sequences

were obtained, the sequences were analyzed and mapping by using those database online. By

analyzing the sequences, a lot of information of the sequences was obtained. The sequences are

obtained from the Nicotiana tabacum gene. The sequence motifs that act as the transcription

factor binding sites are also determined.

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 2

Page 3: Introduction

GEN 3040

Materials and methods

The DNA has been subcloned into a plasmid from a genomic clone promoter fragment

(pOS2) of the osmotin gene from Nicotiana tabacum. The plasmid DNA then was sent for the

automated sequencing. Then the quantification of the nucleic acids by using Nanodrop and UV

spectrophotometry were done. The absorbance at certain wavelength was recorded and the DNA

concentration was calculated. All the steps and methods in this experiment are based on the lab

manual.

The vector used was pUC118 and the restriction enzymes used were pUC118, Xbal,

SubcloneSalI, SubcloneSpeI, SubcloneBglII, SubcloneXbal and SpeI, SubcloneSalI and BglII

and SubcloneXbal and BglII. The sequence obtained (1st_Base_308861_POS_2_M13F_-20) and

(1st_Base_308862_POS_2_M13R_-20) were first edited manually by using the BioEdit

Software. Later, the edited consensus sequence that had been reversed was analyzed in the

BLAST from NCBI. The homology gene of those sequences was obtained. ClustalW was used to

find the genetic similarities between five homology genes. The sequences also analyzed at the

Expasy-Translate to convert it into amino acid sequence. The sequence motifs were identified by

using the Patmatmotif and also the Fuzznuc.

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 3

Page 4: Introduction

GEN 3040

Results

Week 1: Preparation of Plasmid DNA Template for Automated Sequencing

Initially, only sample A was observed to show pink pellet. However, as the experiment was

continued, the pellet appeared in both sample A and B. Thus, both samples were used for next

experiment.

Week 2: Quantification of Nucleic Acids Using UV Spectrophotometry

Table 1. The OD reading of of sample A and B at 230nm, 260nm and 280nm and the OD

ratio by using Nanodrop

OD reading

230nm 260nm 280nm Ratio of

OD230:OD260:OD280

Sample A 0.2537 0.6162 0.327 1: 2.4:1.3

Sample B 0.2517 0.5734 0.2959 1:2.3:1.2

Concentration obtained from the Nanodrop:

Sample A: (624.5 + 607.9)/2 = 616.2ng/µL

Sample B: (563.8 + 583)/2 = 573.4 ng/µL

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 4

Page 5: Introduction

GEN 3040

Table 2. The OD reading of two different concentration of sample A at 230nm, 260nm and

280nm and the OD ratio (UV spectrophotometer)

OD reading

230nm 260nm 280nm

999µL H2O + 1µL DNA 0.0011 -0.0140 -0.0217

998µL H2O + 2µL DNA -2.2157 -1.9648 -2.1683

Table 3. The OD reading of two different concentration of sample B at 230nm, 260nm and

280nm and the OD ratio (UV spectrophotometer)

OD reading

230nm 260nm 280nm

999µL H2O + 1µL DNA 0.0018 -0.0110 -0.0198

998µL H2O + 2µL DNA -0.0826 1.133 1.023

The concentration of the samples can be calculated by using the formula given as follows:

Concentration of nucleic acid = absorbance at OD260 x 50µg/mL x dilution factor

Table 4. The distance migrated, size of markers, and log of length of markers

Distance migrated (cm) Size of markers Log of length (bp)

0.9 23130 4.3641756

1.3 9416 3.9738664

1.6 6557 3.8167052

2.1 4361 3.6395861

3.2 2322 3.3658622

3.3 2027 3.3068537

5.2 564 2.7512791

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 5

Page 6: Introduction

GEN 3040

6.5 125 2.0969100

1.5000000 2.0000000 2.5000000 3.0000000 3.5000000 4.0000000 4.5000000 5.00000000

1

2

3

4

5

6

7

f(x) = − 2.71589442280872 x + 12.285662965085R² = 0.977684109981008

Distance migrated by band versus Log of length

Log of length

Dist

ance

mig

rate

d by

ban

d (c

m)

Figure 1. Graph of distance migrated (cm) versus log of marker (bp) of markers

From the graph, the equation obtained is: y = -2.7159x + 12.286

Thus, to calculate the length of the markers,

x = (y – 12.286)/-2.7159

Length of restriction enzyme = 10x

Table 5. Distance migrated (cm) and length of restriction enzyme (cm)

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 6

Page 7: Introduction

GEN 3040

Restriction enzyme Distance migrated (cm)

Log of length (bp) Length (bp)

pUC118, Xbal 2.6 x = (2.6 – 12.286)/-2.7159

= 3.566

3681.29

SubcloneSalI 2.6, 4.2 x = (2.6 – 12.286)/-2.7159

= 3.566

x = (4.2 – 12.286)/-2.7159

= 2.977

3681.29

948.42

SubcloneSpeI 2.1 x = (2.1 – 12.286)/-2.7159

= 3.751

5636.38

SubcloneBglII 2.5, 4.4 x = (2.5 – 12.286)/-2.7159

= 3.603

x = (4.4 – 12.286)/-2.7159

= 2.904

4008.67

801.68

SubcloneXbal and SpeI 2.2, 6.3 x = (2.2 – 12.286)/-2.7159

= 3.714

x = (6.3 – 12.286)/-2.7159

= 2.204

5176.07

159.96

SubcloneSalI and BglII 2.6, 4.4, 6.6, 6.8 x = (2.6 – 12.286)/-2.7159

= 3.566

x = (4.4 – 12.286)/-2.7159

= 2.904

x = (6.6 – 12.286)/-2.7159

= 2.094

x = (6.8 – 12.286)/-2.7159

= 2.020

3681.29

801.68

124.17

104.71

SubcloneXbal and BglII 2.6, 4.4, 6.6 x = (2.6 – 12.286)/-2.7159 3681.29

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 7

Page 8: Introduction

GEN 3040

= 3.566

x = (4.4 – 12.286)/-2.7159

= 2.904

x = (6.6 – 12.286)/-2.7159

= 2.094

801.68

124.17

Figure 3. The restriction map of the cloned insert

The consensus sequence of pOS2

>pOS2

CGCGCGAGCGCCAACGGTGTACGGGCAGTTGTTTCGGACCTCGATAGTTGCAGCATAAGTACAAGTCACAAAGGCAAGGAGGAAGAAAACAAAAGAAGATCTCAAGTATTCCATGTTTGTGGACATTTTCTTTTTAACAAGTTGCGTTGTTGGATATAGTGACAAATTATTTGGTGGTTTATATAGGAGTATGGCGGCTTTTCGCACATAGACATTAATCATATATATATTTATTAACATTTGACTAGTAAATGGATAATTCTCAAGGATGCTTTAGTAAAGGTGGAGGCGGCTAAAAGTATAAGAGCCGCCTAATAAACTAATAATTTTATGAATATAGAAGTCAATTTATTTGGTGATGATGCTGACCTATAAGCTCTGTAACGTGACACTTGTAACAATATCTCTTTTAGTCAATTTCTTGCTTTCTTTTATCTATACCATTTTTTTAATTCACAAGAAATTGGCAGATAGCCATCATTTTGAAGCTATTCAATATTAGTTGCTTTTTAACATGAAAATAAACTTAAACAAAAATATTTTTTCGCAAAAATAAGAAAAATATATAAAATTTTTATTAGAGTTTAAATATTTATCAGTTTGAACTTCATATTTCATACTCCCGAAAATTTTTATGGAGTTTGCACAGATCATAAATTTTTCATGTAGTTTTAATACTAAATCTTGGAGTTTCACTTTTAATATTGATGTTATATTGTCCCCTACTTATTTTTTCTTCCTTCGTTTTCTTTTTGAGAACTATTTCATCTATTTATATATAGCAGCTCATTAAGTAGATGGCGTTATTCTGGATGTGGAATAAACAGAGAAGAACAACACAGAAGATTAATGCCAATCGGGTAGACTTGATCAACAATTTAACAAATAAGCTGATTGTAGATAACATAAACTCATAAAGTTACGCCAATTCAATAGAGTTTGTTGAACAATTGAACTAGCATAAAATCATGCCAACTGGATAGAGTTTGTGAACACTTAACAATATAGGTGGAGTAAATACAATATAAATTCTTCCGATAGAAAAATTTCTGAAGAACTACATGGGTCCGGAGATCATTTTTTAAATATATTATAAAATGGAGACAATATTTAAAGACCAGTCCGTTATAGGCCGGCCATTAATTCTTGTCCATCAAATTTGCATCTGCTGTTGGTTTTCTGCTAGATCTACTTAAAATTTATAAAACAATCAGTTTCGAAACAAAGTCTAGTGTAGTACTACCAGAACATCAACTCATATAAAAGTCGACTCTAGAGGTCCCCGGTACCGAGCTCGAATTCGA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 8

Page 9: Introduction

GEN 3040

Figure 2. The gel electrophoresis of the digestion products

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 9

Page 10: Introduction

GEN 3040

Figure 4. The pOS2 consensus similarities

Week 4: Collation of Sequence Data and Computer Analysis

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 10

Page 11: Introduction

GEN 3040

Figure 5. The edited sequences from BioEdit

Figure 6. The protein translation of the sequence by using BLAST program

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 11

Page 12: Introduction

GEN 3040

Figure 7. The Clustalw program to obtain the genetic relationship of the sequences

Figure 8. The Compute pI/Mw tool from Expasy

Discussion

During the plasmid preparation, the pink pellet only was observed in the sample A. However, as

the experiment was continued, the pink pellet was ultimately observed. Errors might occur

during the experiment such as the DNA might be too small or the dye was not completely bind to

the plasmid. The concentration of the nucleic acid was obtained directly from Nanodrop as the

values obtained from the UV spectrophotometer were erroneous which it gave negative optical

density readings. Thus, the values cannot be used by using the OD from the UV

spectrophotometer. It might be caused by the spectrophotometer itself, or by the sample as the

ratio calculated from the Nanodrop showed that the samples were impure.

The DNA concentration was calculated based on the concentration of a double stranded

DNA of 50µg/mL which is equivalent to an OD260 of 1. The OD of 230 as well as 280nm was

also recorded for the precaution purposes. Contamination of the samples by carbohydrates,

protein or other foreign components of the tissues will affect the OD260 readings. Thus errors

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 12

Page 13: Introduction

GEN 3040

when calculating the DNA concentration in the samples could happen. Basically, ratio of

OD230:OD260:OD280 of pure DNA sample is 1:1.8 and the accepted range of DNA sample for

further analysis is 1:1.6-2:1. However, the DNA sample obtained in this experiment for both

sample A and B were 1: 2.4:1.3 and 1:2.3:1.2 respectively. The ratios explained that the DNA

samples were not pure and rejected for further analysis. The lacks of DNA purity probably due to

the contamination during the DNA transfer from one container to others or poor centrifugation

process which caused some debris left in the supernatant. The contamination of the samples was

the reason why the OD readings were affected (Brown, 2006).

The DNA was cut by pUC118, Xbal, SubcloneSalI, SubcloneSpeI, SubcloneBglII,

SubcloneXbal and SpeI, SubcloneSalI and BglII and SubcloneXbal and BglII restriction enzymes

and separated based on the sizes by the gel electrophoresis. The lowest band on the gel was the

shorter fragment as it could travel further.

From the gel of digestion fragments, the length and size of the markers were measured. A

graph of distance migrated by the markers versus log of length of the restriction enzyme was

plotted and the equation obtained was used in order to calculate the length of the restriction

enzymes. From Figure 3, the restriction map of the cloned insert is shown. The exact location of

the insert as well as the restriction enzymes can be estimated. The similarities of the consensus

can be determined by using the online programs such as BLAST in NCBI that contains the

GenBank databases. The highest similarity obtained was 99%. The 1% unlikeliness might be due

to some of the degeneration of short fragment caused by the plasmid cutting process by

restriction enzymes.

When the BioEdit program was ran, the Restriction Map was performed as well. There

were 5 restriction enzymes detected with specific bases shown as follows:

GAATTC = EcoRI (1)

GTCGAC = AccI/HincII/SalI (1)

TCTAGA = XbaI

GAGCTC = SacI

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 13

Page 14: Introduction

GEN 3040

GGTACC = KpnI

Questions and answers

Q1. Do all regions of your sequence show exact homology to the best matching sequence you

have found in the database?

Not all regions in the sequence showed exact homology of the best matching sequence. The best

homology of the pOS sequence is the N.tabacum osmotin gene which shows 99% identical and

has 0 E value. The 1% that is incompatible with the cloned sequence probably due to the

impurities of the DNA that was sent for sequencing. There might be errors occurred during the

isolation process.

Q2. Using the BLAST output description, ‘matcher’ analyses, and the references cited, determine

the genetic relationship between the five most closely related DNA sequence and the genomic

sequence pf this gene been determined?

From BLAST, five most closely related DNA sequences of the consensus sequences pOS2 are

the N.tabacum osmotin gene, abscisic acid-activated of N.tabacum, promoter osmotin of

N.tabacum, N.tabacum ap24 gene asn Solanum phureja osmotin-like protein. The osmotin gene

is the most identical to the sequence with the 0 E-value that indicates the most significant results.

Figure 9. The pOS2 consensus similarities

>gi|1167853|emb|X95308.1| N.tabacum osmotin geneGTCGACTTTTATATGAGTTGATGTTCTGGTAGTACTACACTAGACTTTGTTTCGAAACTGATTGTTTTATAAATTTTAAGTAGATCTAGCAGAAAACCAACAGCAGATGCAAATTTGATGGACAAGAATTAATGGCCGGCCTATAACGGACTGGTCTTTAAATATTGTCTCCATTTTATAATATATTTAAAAAATGATCTCCGGACCCATGTAGTTCTTCAGAAATTTTTCTATCGGAAGAATTTATATTGTATTTACTCCACCTATATTGTTAAGTGTTCACAAACTCTATCCAGTTGGCATGATTTTATGCTAGTTCAATTGTTCAACAAACTCTATTGAATTGGCGT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 14

Page 15: Introduction

GEN 3040

AACTTTATGAGTTTATGTTATCTACAATCAGCTTATTTGTTAAATTGTTGATCAAGTCTACCCGATTGGCATTAATCTTCTGTGTTGTTCTTCTCTGTTTCTTCCACATCCAGAATAACGCCATCTACTTAATGAGCTGCTATATATAAATAGCTGAAATGATTCTCAAAAAGAAAACGAAGGAAGAAAAAATAAGTAGGGGACAATATAACATCAATATTAAAAGTGAAACTCCAAGATTTAGTATTAAAACTACATGAAAAATTTATGATCTGTGCAAACTCCATAAAAATTTTCGGGAGTATGAAATATGAAGTTCAAACTGATAAATATTTAAACTCTAATAAAAATTTTATATATTTTTCTTATTTTTGCGAAAAATATTTTTGTTTAAGTTTATTTTCATGTTAAAAAGCAACTAATATTGAATATGCTTCAAAATGATGGCTATCTGCCAATTTCTTGTGAATTAAAAAAATGGTATAGATAAAAGAAAGCAAGAAATTGACTAAAAGAGATATTGTTACAAGTGTCACGTTACAGAGCTTATAGGTCAGCATCATCACCAAATAAATTGACTTCTATATTCATAAAATTATTAGTTTATTAGGCGGCTCTTATACTTTTAGCCGCCTCCACCTTTACTAAAGCATCCTTGAGAATTATCCATTTACTAGTCAAATGTTAATAAATATATATATGATTAATGTCTATGTGCGAAAAGCCGCCATACTCCTATATAAACCACAAATAATTTGTCACTATATCCAACAACGCAACTTGTTAAAAAGAAAATGTCCACAAACATGGAATACTTGAGATCTTCTTTTGTTTTCTTCCTCCTTGCCTTTGTGACTTGTACTTATGCTGCAACTATCGAGGTCCGAAACAACTGCCCGTACACCGTTTGGGCGGCGTCGACCCCCATAGGCGGTGGCCGACGCCTCGATCGAGGCCAAACTTGGGTGATCAATGCGCCACGAGGTACTAAAATGGCACGTATATGGGGTCGTACAAATTGTAACTTTAATGCTGCTGGTAGGGGTACATGCCAAACCGGTGACTGTGGTGGAGTCCTACAATGCACCGGGTGGGGTAAACCACCAAACACCTTGGCCGAATACGCATTGGACCAATTCAGTGGCTTAGATTTTTGGGACATTTCTTTAGTAGATGGATTCAACATACCGATGACTTTCGCCCCGACTAACCCTAGTGGAGGGAAATGCCATGCAATTCATTGCACGGCCAATATAAACGGTGAATGTCCCCGAGAACTTAGGGTTCCCGGAGGATGTAATAACCCTTGTACTACATTTGGAGGACAACAATATTGTTGCACACAAGGACCTTGTGGTCCTACATTTTTCTCAAAATTTTTCAAACAAAGATGCCCTGACGCCTATAGCTACCCACAAGATGATCCTACAAGCACTTTTACTTGCCCTGGTGGTAGTACAAATTATAAGGTTATCTTCTGTCCTAATGGTCAAGCTCACCCAAATTTCCCCTTGGAAATGCCTGGAAGTGATGAAGTGGCTAAGTAGAACGGTCATTCCTCTAATAAGATCATCTTTGGTCGAATTATTGCGTGAAAGCGTTAGTGTGGGACAATTTATTTTGTCTCGCTTTTATAGTTTACACACTTTGTTTATTTTGGTTGAAGTGGTCAAGAGATGATCT

abscisic acid-activated [Nicotiana tabacum L.=tobacco, leaf, Genomic, 3033 nt]

GenBank: S40046.1

GenBank Graphics Related Sequences

>gi|251138|gb|S40046.1| abscisic acid-activated [Nicotiana tabacum L.=tobacco, leaf, Genomic, 3033 nt]GTCGACTTTTGTATCAGTTGATGTTCTGGTAGTATTACACTAGACTTAGTTTCGTAACTGATTGTTTTATAAATTTTCCGGTAACGTCCAAATATGTCCTTACCGTCACATAGTTGGTCTATATATCCCTTTTTGAAATAGAATCCATCCAAATTATTTAGCTCTTCCGTTAATTATACTAAAATGTATGACAAACACTATTTCCTTTTTATTCAGTACTTTTTTTTCTTTATCAATTTAACTTGACAAAACTCATGAATTCCTGTTAATTTTACTATTACAGGCCCCACAGGTTCCTTCCTGACCTGAAGGAGATGAAAATTATCGGAGAAATTTTTTCGGTGGTGTTAATTGGGTGAAGGTAGGATATGATTATTCAAATTTTGAAGATCTTGTTCCTTTCAGATCTGAAGCTCCACAGTGAACAACTCCTTCAAAATCTGAATAATTATATCCTACCTTCACCAAATTAACACCACCAAAAAAATTTCTCCGACAATTTTTATCTCCTTCTTTTCAGAAGAACTAGAGCCTTCTCGGTTGGAAGGTTTTGGTGGTGCAAGTTTGATTTTTAGGAATAAAATCACGTCAAATTGGTAACGAAAATGGAGAAGGCACCGGAAATGGAGGAAACCGGATATGGGAGAATGAAAAAGGGAAAAAAAGATAAGAAAAGAAAAAAGAAGAAAGAAAAGAGAAAGGTAAAGAAAAAAAGTACTAATAAAAAGTAGATAGTGTTTGTAATACACTTTAATACAATTAAAGAAAGAGCTAATTAGTTTGGGTGGATTCATTTTTAAAAAGGGCAACTATGTGACGGTAAGGACATATATGGACCAACTATGTGTATGGTAAGGGCATATATGGACCAATTATGTGACGGTACGAGTATATATGAGCTAAAGTATTAACAAAGGGTAAATGTGCTCAATTTCGTATATTACAAAGCCATATTTGGACCTTTTTCCGTAAATTTTATGTAGATTTAGAAAAAAGCAACAACCTATAAGGGGTTGGTCTTTAAATATTGTCTTCATTTTTTAATGTACTTAAAGAATGAGCTCTGGACCTATATAGTTCTTCAGAGATTTTTCTATTGGATCGCTAGAATTTATGTTATATTTATTCTACTTTTATTGTTAAGTGTTCACAAATTTTATTCGATTAGCATGATTTTGTGCTAGTTTTATTGTTAAACAAATTTCACAGAATCGGCGTAACTTTATTTTATCTGCAATCGATGTACTTCTTAAATTGTTCATTAAATCTACCTGACTGGTATAATTTTTCTGTGTTCTTCTCTGCGCTTATTCTACATCCAGAATAACGA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 15

nuccore fasta 1000000 multi 5

Page 16: Introduction

GEN 3040

TATCTAATTAATGAGCTGCTATATAAATCGATGTAATAGTTCTCAAAAAGAAAATGAAGGAAGAAAAAACTATGTGGTGGGACAATATAACATCATCTATATATAAAAATTAAAGTGAAATCCAGGATTTCAGTATTAAAACTACAGGAAAAATTTATGATCGGTGCAAACTCCATAAAAAATTTCGGAAGTACAAAATGTGGAGTTCAAACTGATAAACAAACTCTAATAAATTTCTTATAATTTTTTTATATTTTTGTGACGAATATTATTGTTTGAGTTTTATTTTCACATTAAAAACTAAATATTGAATAGCTTTAAAATGATGGCTATCTGCCAAAAAGTGGCTATCTGTCAATTTCTTGCGAATTAAAAAATGGTATAGATAAAAGAAAGCAAGAAATTGACTAAAAGAGATATTGTTACAAGTGTCACGTTACAGAGATTATAGGTCAGCGTTATTACCAAATAAATTGACTTCTATATTCATAAAATAATTAATTATTAGGCGGCTCTTATGTTTAAGCGCCGCCTCCATCTTTGCCAAAGCATCCTTGAGATATATCCGTTTATTAGTCAAATGTTAATAAATATTTATGATTAATATCCATAGTACGAAAAGCCGCCATTCCCCTATATAAACCACTAAACAATTTGTCACTATATCCAACAACCCAACTTGTTAAAAAAAATGTCCAACAACATGGGCAACTTGAGATCTTCTTTTGTTTTCTTCCTCCTTGCCTTGGTGACTTATACTTATGCTGCCACTATCGAGGTCCGAAACAACTGTCCGTACACCGTTTGGGCGGCGTCGACACCCATAGGCGGTGGCCGGCGTCTCGATCGAGGCCAAACTTGGGTGATCAATGCGCCACGAGGTACTAAAATGGCACGTGTATGGGGCCGTACTAATTGTAACTTCAATGCTGCTGGTAGGGGTACGTGCCAAACCGGTGACTGTGGTGGAGTCCTACAGTGCACCGGGTGGGGTAAACCACCAAACACCTTGGCTGAATACGCTTTGGACCAATTCAGTGGTTTAGATTTCTGGGACATTTCTTTAGTTGATGGATTCAACATTCCGATGACTTTCGCCCCGACTAACCCTAGTGGAGGGAAATGCCATGCAATTCATTGTACGGCTAATATAAACGGCGAATGTCCCCGCGAACTTAGGGTTCCCGGAGGATGTAATAACCCTTGTACTACATTCGGAGGACAACAATATTGTTGCACACAAGGACCTTGTGGTCCTACATTTTTCTCAAAATTTTTCAAACAAAGATGCCCTGATGCCTATAGCTACCCACAAGATGATCCTACTAGCACTTTTACTTGCCCTGGTGGTAGTACAAATTATAGGGTTATCTTTTGTCCTAATGGTCAAGCTCACCCAAATTTTCCCTTGGAAATGCCTGGAAGTGATGAAGTGGCTAAGTAGAGTGGCTATTTCTGTAATAAGATCACCTTTTGGTCAAATTATTCTATCGACACGTTAGTAAGACAATCTATTTGACTCGTTTTTATAGTTACGTACTTTGTTTGAAGTGATCAAGTCATGATCTTTGCTGTAATAAACCTAAGACCTGAATAAGAGTCACATATGTATTTTTGTCTTGATGTTATATAGATCAATAATGCATTTGGATTATCGTTTTTATATTGTTTTTCTTTTGAAGTTTTAGTAAAGTCTTAAGCTT

osmotin {promoter} [Nicotiana tabacum=tobacco, Genomic, 2051 nt]

GenBank: S68111.1

>gi|545027|gb|S68111.1| osmotin {promoter} [Nicotiana tabacum=tobacco, Genomic, 2051 nt]GTCGACTTTTGTATCAGTTGATGTTCTGGTAGTATTACACTAGACTTAGTTTCGTAACTGATTGTTTTATAAATTTTCCGGTAACGTCCAAATATGTCCTTACCGTCACATAGTTGGTCTATATATCCCTTTTTGAAATAGAATCCATCCAAATTATTTAGCTCTTCCGTTAATTATACTAAAATGTATGACAAACACTATTTCCTTTTTATTCAGTACTTTTTTTTCTTTATCAATTTAACTTGACAAAACTCATGAATTCCTGTTAATTTTACTATTACAGGCCCCACAGGTTCCTTCCTGACCTCAAGGAGATGAAAATTATCGGAGAAATTTTTTCGGTGGTGTTAATTGGGTGAAGGTAGGATATGATTATTCAAATTTTGAAGATCTTGTTCCTTTCAGATCTGAAGCTCCACAGTGAACAACTCCTTCAAAATCTGAATAATTATATCCTACCTTCACCAAATTAACACCACCAAAAAAATTTCTCCGACAATTTTTATCTCCTTCTTTTCAGAAGAACTAGAGCCTTCTCGGTTGGAAGGTTTTGGTGGTGCAAGTTTGATTTTTAGGAATAAAATCACGTCAAATTGGTAACGAAAATGGAGAAGGCACCGGAAATGGAGGAAACCGGATATGGGAGAATGAAAAAGGGAAAAAAAGATAAGAAAAGAAAAAAGAAGAAAGAAAAGAGAAAGGTAAAGAAAAAAAGTACTAATAAAAAGTAGATAGTGTTTGTAATACACTTTAATACAATTAAAGAAAGAGCTAATTAGTTTGGGTGGATTCATTTTTAAAAAGGGCAACTATGTGACGGTAAGGACATATATGGACCAACTATGTGTATGGTAAGGGCATATATGGACCAATTATGTGACGGTACGAGTATATATGAGCTAAAGTATTAACAAAGGGTAAATGTGCTCAATTTCGTATATTACAAAGCCATATTTGGACCTTTTTCCGTAAATTTTATGTAGATTTAGAAAAAAGCAACAACCTATAAGGGGTTGGTCTTTAAATATTGTCTTCATTTTTTAATGTACTTAAAGAATGAGCTCTGGACCTATATAGTTCTTCAGAGATTTTTCTATTGGATCGCTAGAATTTATGTTATATTTATTCTACTTTTATTGTTAAGTGTTCACAAATTTTATTCGATTAGCATGATTTTGTGCTAGTTTTATTGTTAAACAAATTTCACAGAATCGGCGTAACTTTATTTTATCTGCAATCGATGTACTTCTTAAATTGTTCATTAAATCTACCTGACTGGTATAATTTTTCTGTGTTCTTCTCTGCGCTTATTCTACATCCAGAATAACGATATCTAATTAATGAGCTGCTATATAAATCGATGTAATAGTTCTCAAAAAGAAAATGAAGGAAGAAAAAACTATGTGGTGGGACAATATAACATCATCTATATATAAAAATTAAAGTGAAATCCAGGATTTCAGTATTAAA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 16

Page 17: Introduction

GEN 3040

ACTACAGGAAAAATTTATGATCGGTGCAAACTCCATAAAAAATTTCGGAAGTACAAAATGTGGAGTTCAAACTGATAAACAAACTCTAATAAATTTCTTATAATTTTTTTATATTTTTGTGACGAATATTATTGTTTGAGTTTTATTTTCACATTAAAAACTAAATATTGAATAGCTTTAAAATGATGGCTATCTGCCAAAAAGTGGCTATCTGTCAATTTCTTGCGAATTAAAAAATGGTATAGATAAAAGAAAGCAAGAAATTGACTAAAAGAGATATTGTTACAAGTGTCACGTTACAGAGATTATAGGTCAGCGTTATTACCAAATAAATTGACTTCTATATTCATAAAATAATTAATTATTAGGCGGCTCTTATGTTTAAGCGCCGCCTCCATCTTTGCCAAAGCATCCTTGAGATATATCCGTTTATTAGTCAAATGTTAATAAATATTTATGATTAATATCCATAGTACGAAAAGCCGCCATTCCCCTATATAAACCACTAAACAATTTGTCACTATATCCAACAACCCAACTTGTTAAAAAAAATGTCCAACAACATGGGCAACTTGAGATCT

N.tabacum ap24 gene

GenBank: X65701.1

GenBank Graphics Related Sequences

>gi|19782|emb|X65701.1| N.tabacum ap24 geneATATTATTGTTTGAGTTTTATTTTCACATTAAAAACTAAATATTGAATAGCTTTAAAATGATGGCTATCTGCCAAAAAGTGGCTATCTGTCAATTTCTTGCGAATTAAAAAATGGTATAGATAAAAGAAAGCAAGAAATTGACTAAAAGAGATATTGTTACAAGTGTCACGTTACAGAGATTATAGGTCAGCGTTATTACCAAATAAATTGACTTCTATATTCATAAAATAATTAATTATTAGGCGGCTCTTATGTTTAAGCGCCGCCTCCATCTTTGCCAAAGCATCCTTGAGATATATCCGTTTATTAGTCAAATGTTAATAAATATTTATGATTAATATCCATAGTACGAAAAGCCGCCATTCCCCTATATAAACCACTAAACAATTTGTCACTATATCCAACAACCCAACTTGTTAAAAAAAATGTCCAACAACATGGGCAACTTGAGATCTTCTTTTGTTTTCTTCCTCCTTGCCTTGGTGACTTATACTTATGCTGCCACTATCGAGGTCCGAAACAACTGTCCGTACACCGTTTGGGCGGCGTCGACACCCATAGGCGGTGGCCGGCGTCTCGATCGAGGCCAAACTTGGGTGATCAATGCGCCACGAGGTACTAAAATGGCACGTGTATGGGGCCGTACTAATTGTAACTTCAATGCTGCTGGTAGGGGTACGTGCCAAACCGGTGACTGTGGTGGAGTCCTACAGTGCACCGGGTGGGGTAAACCACCAAACACCTTGGCTGAATACGCTTTGGACCAATTCAGTGGTTTAGATTTCTGGGACATTTCTTTAGTTGATGGATTCAACATTCCGATGACTTTCGCCCCGACTAACCCTAGTGGAGGGAAATGCCATGCAATTCATTGTACGGCTAATATAAACGGCGAATGTCCCCGCGAACTTAGGGTTCCCGGAGGATGTAATAACCCTTGTACTACATTCGGAGGACAACAATATTGTTGCACACAAGGACCTTGTGGTCCTACATTTTTCTCAAAATTTTTCAAACAAAGATGCCCTGATGCCTATAGCTACCCACAAGATGATCCTACTAGCACTTTTACTTGCCCTGGTGGTAGTACAAATTATAGGGTTATCTTTTGTCCTAATGGTCAAGCTCACCCAAATTTTCCCTTGGAAATGCCTGGAAGTGATGAAGTGGCTAAGTAGAGTGGCTATTTCTGTAATAAGATCACCTTTTGGTCAAATTATTCTATCGACACGTTAGTGTAAGACAATCTATTTGACTCGTTTTTATAGTTACGTACTTTGTTTGAAGTGATCAAGTCATGATCTTTGCTGTAATAAACCTAAGACCTGAATAAGAGTCACATATGTATTTTTGTCTTGATGTTATATAGATCAATAATGCATTTGGATTATCGTTTTTATATTGTTTTTCTTTTGAAGTTTTAGTAAAGTCTTAAGCTTAAACCAATAAACAAAACAAACAAAAAAAAAGGTATTGATAATATTCTAGCATAGGACCCAAGACGTTGAAAGTATAAAATGGTAGGATAAGATAAACTATTTGGAGCTC

5.

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 17

Page 18: Introduction

GEN 3040

Solanum lycopersicum DNA, chromosome 8, clone: C08SLe0023D06, complete sequence

GenBank: AP009601.1

GenBank Graphics Related Sequences

>gi|165263516|dbj|AP009601.1| Solanum lycopersicum DNA, chromosome 8, clone: C08SLe0023D06, complete sequenceGAATTCTTCATATAATTAACTTCTTGATCATAATTTGCGACTAATTAAGCCACAAATTGTCACTAATGCCATAATTAACGATTGATCAACTTGGGGTTCCACTTCTAGAGTTAACACATCATCTCCATATGCAAATCCTCTCGACGATTGCTTTTGTTTTAACTGCGACAAAAAAAAAATCAATATCAGAATCTTTATACTATCAGTATATATAACTTGGATTAAATTTTTTTTTTAAAAAAAACTTTATATGAATTTATAATTTTTAAAGGACCTACCTCAGCAACAATTTCTCCAGTACTATTTGTAACTTTAAATGATGATTTCTTGTCCAATTTTTGAATCTTATAACAACTTTCAATTGATTTTTCATATCCCAAATTCACATTACATATAACATTTCCTCTTGAAAATGTGCAATTCTTTTTTACTTGAAACCATGGCCTTCCTTTAAATCCACCACATAAATATCCATTCCATCTACCAAAAACTCTCAATTTCTGCAATTAACATAAAAACAAAAAAAATAATTAATTTCAACTTCCATAACATAAAATATACTCATTCTGTCTCAATTTATGTATTAATACTGTTCGAATTTCGAGAGTTAAACAGATCCTATGTTTGAATTCATAGTCAAAGTTTCTGGAAATTCAAACTGTGTCACATAAATTATAACGGAATAAAATAATTTATTACCTCTTTTTTGATGGAAAAGAGAACTTGTCCATTAAGATCCATAAGAAAAACTTCATTGCTACATCTTTCTTGATAATTATCAACTCTAAAACAAAGTTCACCTTTGGAATTAAAAACAGTACATCCATTTCCATGAAAAACTAGTGATTTCATCCATATTGTAAATATTTCTTTATTAGATGTAACATAAGGAGATGAACAAGAAGAAGAAGAAGAAGGGGAAGATATATTTTCTGGATGAATTTTACCCATAATTAAAGTAAAATTAAATTTATAAAAAAAAAAATTGTAATTTTGAAGAAATGAAATCGCTTGAGAGAAGATTTTGGAATTTGATTTTTGGTGTGATGAGTAATAGAGAAGTTGAAAGGGTATAAATAGAAGTTAAAGAAACAACCACTGAAAAAAATAGTTAAAGAAATAATCAAGAAAGTAGAAACATTTGACTTTATAAAAACGTGTAATGTTGACGAAGTATTTTTTTTTATTATCATCACCATTTAATATTTTGATATTTACTAATTCGATTATTTTAAATTCGTGTAACGCAAAATTTATTACAGAGAAAATTATTTTTTATTAGAAGTTTTTTTATTATATATAGGCTTGAATTCGAGACTTCTAATAAAAAATAGAGAAATTTAATTCATTATACAGTGATAAAACTACGTATATAAGAGATTGAGTTGATCGATTGATATTTTTCACTGAAAAATCACATTATATATATAAAAATATGATTATTTTATACATTTATCTTTTCATATTTTTGATATTTCTCGATAAAACTCTTTTGATATTAAATGATAGGCATTGATATATAGAAAAATAATAATAATAATGGTTATACTTAAGTTTTTTATTTTACACATTCCATATTATTGTAAGAAAACAAGTCAATTTAATAAAAAATTACTTATTTGAGATAATTTCAAAATAATATTGTGTCAAAGTCAGTTTAAATAAAAAAAATATTTATTCAAAGATAATTTCAAAATATTATATCAAAACTTATACATAAGAAGACATAGACAAAGAGGTTTTGTCTTTATTTATTCACATCCAGTCTGTCTTTGGGCAAATAAATAAAGTAGAATATGACTCTTCATAGAGTTTATTTTTATTCTTATAATATGTGTGTCTTTGTATACCCTATATATAATATGCTTTTTTTTTGCCATGTGAGGTGGCCTTATGGAGAACCACTAAACCACAAATCATCACAACATAGATTCATATTTCTCCTTTATATCCTCCTTCACCTAGGAACTAAAATATATTCTTTTTATAGTTTTGTGTCAAAAGATTTTTTTTTTAAAAAAGTTCCAAAATGTGGTCTATAATAACTCACTAGGCAATATATCAAAATCTTGGAGGTTCACCGTGGAAAAATATAGTGGGTCGAAACTTATCCATTTTAATCAGATATTTAGAGTTTGATTTTTTAAAATGAGAAAATTTATATTAGAAAGTGTTTTCTTTTTTTAAAAAAAAAAAACTTTTATGCAACGTGAATATAAATTAATGTTGTAGTGAACCCTTAAGTTGTAGACAAATGTGTGAAAGTATCAAACTGATTGACAATATGAGTTTGCTATCTTTTTATACGATTTGAAATTATTAAGTTTTTAAGTTAAATTAAATTTACGATTTATTTTCTTGATATATTAAAAGAGTGAATATGTTATCCACGCTTCAAATGTACATGTTTTCTTCTATGATTTTTGCCCAAATTTTGAATTACACAACATTGGGCTCCAAGATATATTCTTTCACGTTTCAAATGTCCATTTAATTTCTTAATTATAACATGATATTTTCATATTAGTCAGAATTTTAGTGATTTTAATAATCATTATGACTTTTTTAGATTATTAATATTTGTAATTCATAAGTCAACTCCTATTTTCTTAGTGACATATTAATCCCAAATCAGTGGGGTTCCACTTACGTTTGTGTAACAACCACTAAATACTTCTTCAAAAAAAAAAGAAAAAAAAAAGAAAGTGGAAGTGTAATAACTGTTCAATGTCAAAAGAGACACAATTAAATTAGAGATTTTAATATATGCACTTAAGGGTAAATGAAATTAAAATAAAATGGTGTCAAAAGAAAGGATTATCATTGATTAATTTAATAAATAGTTGTTGTTAATTGTTTGTGTGCATCGTTTTGCGTGCTTATCCCTTAGATGAGGGAAAATTAGGTATTTGACACTGACTAACTGTTTATTTTATATTTTATATAAATTTGAGATGGTAAAAAATGTAATTTTAATATAAATACAAAATTCAAGAAATTATAATTCTACAAATATACCCACCCTTAGATATCAATAAATTTTATAACAGAATGGTAATATTCAAATATGTTCCTTTTGATGATAGATTTCTTTTTATTTTTTAATAATTTAAACTCGAGATTTCTAGGCCGTAATAGTTACGATCAAAAACGAGTAAACTCACACATAAAAAAAGTTGTTGATGATGATTGTCCACTGGGAATTTTGCATCATGAAAAATTTAAAGTTAAGAGCTGGTAAAAATGACAACCATATTCTTCTTTTAGTGGAATTATAATAAAGACAGGCA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 18

Page 19: Introduction

GEN 3040

CCACCAGTAACATATTCTCGACTTATATATAGGATGTGACATTTGATAATATTCTTTTAGTAAATTTTAAGTTTAGCAAATATTAAAATTATATGTATATGTTATAATTATAATTTGTATAATTACATGAAGTGTCAGATCTTTATAAATTATTGGTAAGTCTTTCGTTTGTATAAAAGCGCAACGATTATATATATATAGGTTAGACAATTGTACATACGTAATTACTATGTACATATACCAATGATTGTGTTTGTATATATGTATAAAAATGAATTTGTATATAATTGAATCGAAATAAAATATTTATATATCACATCTCTGGATTTATACAACACAATTATACAAATTGTAATTTGTATAATTTGTGTTTGTATAAGAGAAAGACAAAAGAGAACTGGTAGAAGAAGATCTGTATTTGTATAATTATAAATTTATAGGATAAAAATACATGTATTTGTATGTGTATATACAATTTTAGTTCGCTTTACACAAACGCAAACGCAATATATAGGCGTGGAAGAGCGGCGAGTGAAATTCTTTGGAGAGAGAGGCAAATGACAAGAGTTTGATATGAATTACAATTAAATGAAAAGTATAATTATAACATTAAATTTGAACTAATAATTTGCTATTTTATATAATTTTTCCTAATTTTAAAGTTAATGAGTTGTAAAAGAAAAAGATATGTTTATCGTAAATAATTCAGAAAGAAAGTATGAAACAAATAAATTTCAACGAAGAGAATAATAAAAAATCGGGAAAAAAATATATTATCCTTTTAAGGGATTTTTCTAATATTGGACGGCAAACTCCACTATGTACCGACAAGCAAAGTTAATTTGATTAGAAATTATGATGAATTTGACCTTAATTTGATATTATTATTATTATTATTATTATATAACACTGAAACTAGACTAAGGTGAAAAGTGTCATGATATGTAATAATTTAGAAGCTTGAATCTGTCATAGTTCTCAAACATATATATTTTATTTGTTTCATAATTTACTTTTTTCTTTCACGATCAAATCGTTTTAGTTTGTACATAAAATTTATATATTTGTAAATTATATAAAATATCTTAAATCATAAGTGATAATTCAAAATATTTAAAAGATATATAAAAAAAAATTATGATAAAAATAAATTTATTTGAATCTAAAAAGCTAAAAGGTACCACATAAGAAATATGAACTTCTTGTTTACAAATATTTAGAAAAAAATTAACAAGGTTTTATATATTAATTTAAAAACTTTCTTGTTAGAGTTTACCATAAATATCTATGCGAATATCTTTTTATAATCACATAAATATTATGACATGACTAAAATTATATTTTTTTCTTATAATTATACAAAAGTTATAACATGTCCAATATTACGAGATAAAAAATATCTTCTCTATTTTAGACTACATTTAAATAAAATTTACATCTCATATATTAATTAAAACTATATATAGGGGAGTATTATTCTTAACACTGTTTTCTAAAAAAAAATTTGGGGAGAGTTTTGCGGAGGGGTGTATGAAAAATGTTTGAAAATGCAAATTGAAAGACATATTCATAAGGTGTGAATCTTAGAATTGGGACGTACGCTTTATTTTTCTTTTCTTTTTTATAGTTTTTGAACCTTTTTTTGCTTTTAATGATATAATGACATTTCTCAAATAGTGATTATAATACTTCTTTTTTTGTTATTGATTTATTAATATTTTTCTTAAATAAAAATTATATTATATTTTTCTATATATTAGAGGTTATTCATGAAAGATTACTTGAAAATAATTAAAAAATTTATTTTGCATATTTTCTTAAGAATAAAACTATAAAATTGAATATACATGAGATTATTTCCCGTAGATCTATAAGACTTCATTCTGTATTTTAAAAGTTTTATCTCGTTTCATACTGCATAAAACGTCTCATCTCACCTTTCATCATATAAATTATTATTTCTCTAGATTTCATTATCAAATAATTATATGAATTTCTCAATTATTCAATATATATTTTTAGTTAATGGTATAAAATATGTGATCATATTTAATTATCGTAACATGACAGAATAACAGTCGGATGATGGACGTGGCATGTTTTTGAACTAAAACTTATCACATGTTAACTTTTAAAAAAGTTGATCCAAATGCTTAAAATATGATTGGATAATATATATAGGACAATGAGTAAATTTATAATTTTTAGGATAATTTGTATATAAATAAAATAAATTATAAGTGCGGGTTGATGCATGCACACCAAAAAGTTTGAGTTTGACAGAGCCATTTATATTTTTATAATAAAATTATAATTCATCACTATGATTAGAACAAATTAAATATCCACTTAGGGTAATTTTTTAATTTTTTTTGGAAATAAATAATTAGTAGTGGTGCTAATATCTATTGAAGAAATTTAGGTTGTCTTTTTTACTTATTCTTGATAACGTATTAATGTGCATGCTGACATTGGTGAACTTAATTACGTAGGAAATCATTTTTTTAATGTTTACAATGGTTTAATGTTAATTTAGTCATAATGTCCTAGTAAGCAAGCATTTACTTTACAACCTGTTTATAATTACCTAATTGCTTAAACCATCAAGCATACCATTTTGAACACCACTAACTTCTTTCCTTTTTCGTTTTCGGTGTTCGATATTTATATTGTAGTCTGATTAAGTTAGAACTTTATATTTATACTTTTTGCATTCGATGTTTAACGATTATATTGAAGTTTGATTAATTAGTAAATTTAAATTTGCACATTATCAAATACACTCGATATTAGATAACCACATTGCAGTCTGATTAATTTGATTTCGTACATTACATCGTCCATTTCTTAAGGTGACGCTTTCATTTTCTCCCTATTTAAAAGGGACAATTTAATGAGGGAGTCAACCATCGATAAATCGTAAATTGAAATTGATCATACTCTGATATCATGTTAGAAATGAATCTTGAACCGAACTCAATAAAAAGAAAAAAAAAGTCCACAAAGTGAATATTGTTCATATAAGGGAGACCAAATCTCTTATTCCTACTCAATGTGGGACGGTCAACAAAAACATAGAGACTTTCGAATATTAGTGAAGAAATTTTAACCATCCTAACATAAGTCATATTTTTTAAAAAGCAAATGCCACTTATATTAGAAATTACCTTATTCGTTAGGCCACGTACAATACACCATAACTCACGTTGGTGTGTCGAACAACATAATTGATATTTGACTATTTTATTATCTATATATTCAAACATCAGCAGTTATTCAAATCTTTTTGGAACATCAAATTTTCAAAAGTAAATAAAAAGTAACTTAAAAAGTGTAGATAGACAAATTCAACTTGAAATATATACGAGAAAAATAATTTAGTTGCTGATGCTCAACTTAAAACAAGAAAAAATGAGCTCATTGGTAATCATGAACCTAAAATAATATCAACAAGGAATACGTTTGAAGTAATTTATCGTATGTGATTTGGATAATTATTCCTTTATTAATTGTTAATTAGGATGATGTGTTTTTGTTCCTAGTATATTAAAATAATCAAATTTATTTCACAGTTTTATATAAAATTTTAAAAAAAGAAGACATCATCTTAATTCTTAGCTACATCGTCTTTTCATGAACAAATTTGATGATCTAGTAATATTTATATAAAGAAATAATTTAAAGTAAATAACAATTTTTTGGATATAAATTATCAAATGTTGTAAATATTGTCATTACAATATATATAAACAAGGATATTTTAATAGTTAGGTTGTCTCACCAGGTCCACCATATAAAGCGACGTGCTCAAATTTTCCTTTCGTTGGTGGAATATCGTAAACCCTATAAAAATCGGGATTATCAGTGTGCGGGATCGTATTTATCATTATCGTCTCACCTCTATAGTTTAATGCAATAATTTTATTGCATGAACCATCGTACCCTTGCTTTTTAGTGGGGAAAAAACTTGAACCCATTGGTGGTTCAGGT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 19

Page 20: Introduction

GEN 3040

ATGTTTTGTGGGCTATATGCTACTCCTCCCCATTCTATGTGTGTAGCAAAGCTAGCTAACTGAGTGAAGATGTTTTGAGGCCAAAATCCAATTTCTCTATGCGTTTCTTCAACCAAAACCCACCAATTTCCGTTGGCTAGATCCTTTTATTTCGAAAAAAAAAAAGATATATATATATATATATATTAGTATCTCATTTTATATTCATTTTTTTTTTAAAAAAAAAAACAGTTCTTCAAAAGACAAACATAACATATACCCGATCGAGGTACATTGAATATTCCCACAAGTTATGAATATCACCACGTTGTGAAACATTACTAAATGGCGCTCCAATGCCTATCTCAGTGTCTACCAGTACAAATCCAGGACATAGTGTATCAAAGCAACCTATATTACCAGCCTATATATAGACAGTATTAAAAAATTTTATGAACGCGAAATAAGAGTTAACAAAATATTCATAAAAATATCAACTAACCTGAAAATGTATGAACATTCTAGTTAGGGTGTCGCCATATAATGTTGGATCCACCTACGTACACAAAGAAATAGAACTATCATCATGATAAATTGAATGGTTATACACTTTCTAATCTTGCATTATATTCGAGAGGTGTCATATTCAACCCTCTAAATACGAGATAATTTATTTATAACAAAATAAGCTATAGACAAAAGCTTACTCTCCAACCGATTTGTATAATGTCCGATGATTTTTGAACTTTCAATCGACATCCACTGTGTTGTGGAGCTTTAACAGAAGGATTCCATAAAGTAGATGTCATTCCAGCTCCCCCAAACTTGTAATCCAAATCATCACGAGTTGAAACTATTGCAACCTTCAATATTTTTATAAATTCTTTTTATTAAGAAAAAAAAAGACATCAAAAAAAAATTTAGAAAATTAAAAGTTCATATACCCTATATCCTTGTGTAGATAATAATCTTCTTTTCAATCCACTGTGATTCCTGGTCTTTGGAAAAGTAATAAATAAAAACAAATGAGATTTGATTTTAATGGTGAAAACTAAACATCCTATACGTTTATTTATATTAATATAAATTATATATATAATTTAAGATCCTATTGATGGAAATGAATTATTAATGATACATCAATAATTAAATTGATTTGACATTATCTTACCTGAGAAAATTGGGTATGAACTTCTGGTGGTGGCATATGTTTTTGTCTGATAAGATCATCTTTTGTAATTCTTTTAATCGGAACTGATCCAAAAGGACAACCACCATCATTTAGCCATATCGTCGATGACTCTTTAGACTTTGAAGTACCTGAAGATTGCTGTATTCTAGATAACGTAGGCTTCATCTATTCACTCAAAATTAAAATTCTAAATCAATACATATATAAATCCTTGTTAAAGGTCGATAGTAAAATATAATTTGGAACGACATACATTTGGATGAAAATTGTGGTTCTTCAAAGATGGATGATCAAACGCAGGTTGCTTGTAAAAATCCACACAATCATATATATCTCCATATGTAGTCTGCGACTCAAAATTTAAATAAATAAGTCATTTAGGTTATAACAAATGCATATATATTGTTATTCAAATCCTAATATGAAAAAATACCGTAATTGTTTTGATTGCAGGCTTGTTCAAAAGTTTAAGTTGACTTTCTAACTCTTGATCCTCTAATTCGGATAAACTTATTTCTCCTTGAACCTCATTATAACTCAGAATAAAATATAACACTAAAATTTGTTGAATAATTCCTTGATTTATCTGCATTGCCTGAACTTTGAGAGAGCATATGTTAAATAGATTTAGAGAGGAGATTAAAAAAAAAATTGTTATGCAAGTTAAGTAATGTAATTTGTTTTAAAAGCAAGATTATCTTAATTTTGTTAAGAATTTATTATATTAAATGCAAGTTGACTGTTTGCTTTAAACTTTAATCTACAAAACAGCTGTATCATTTGACCACTGAGAATATTATCTCATTTTTAAAAAAAGGCAAACTCAATTTATCCTGATGTATTGTTTTTTCTTTTATTTCAATTTAAAAGGTTGATGTGTTTCTTACAAGCCATAACAATAATAATATTTCATTATATTTAATATAATTTTATAATAATTGATATATTTTGGTTTTGTGTTTTATGTTAGTATATATCAAGACATAACACCAAATTAAAAATTATTCACCTATATGAATATATCAACCATCAAAAAATTATATGAACAAGCGATTTCGATTTACGTATTATTTTCTATAATTATTCTTTAAAAGAAATTACATGATAGCGAGATTTTGCAAGTGTATAACATATATACAGTTATACATGTATCTGTATATCTAAAAAGTTATAATTGACATTACTCTCACATACCTCCGCGTGATTTACAATAAATATGGGATAATATTAAGCCACTAATTATTTTATTTTATTTTTTGCAAAAATGGACGCTAATAACACTATTTACTAAATGTTGTCACAATCAAAATATTAATACAAATAATCCATAATTCCACCAGTCTAAAGTTTGTCTTTCCACTTAAATTAACGGGCCTAAGATTTTGCCCTCCCAGCCCAAAAAGCTTGAAAAGCCCAGACAAAGTCTAGAAAAATAAAGCCCAATAGCTGTTGACATTAGGTAAATTGTGACTTCAGTGGACGATTATTTATATTAAGGGAAAATTGTATATAATAGCAAATTAATAACCTAAAATAAATGGAGTAGCTAGGGTTTGATTTAATTGTGCTCCATAGCAAACATTTGCAAAAAATTGTCAGGCGCCTCTCTTTCAAATATCTCGCTCGCCACTCTCCTCCAGTTTCTCGCTCGCTTCCTCACTTTTTATACAAACACGAGTGTATAAAAATTGTTTCTAATTGTATAAAGCAGAGAAAATTGTATAAATACATATATTTTTGTTCCCCTCTCTCCCCTCTGCCAGATCTCGCTCGCCACTCTCCCAAATCTCGCTCGCCACCCTCGCCTTTCTCACTTATACAAACAAGAACCGAAATGTATAAATTGCATTTCTGTTTGTATAAAACGTGAAAAAATTGTATGTACACATGCAAGTACATATATTTTCGTCCTATACATTTATAGTTATACAATAAAAGTACTCCCTGCCCAGTTTCTTAGCGACAAATGGTTAAAATTGTATGTGCCATCCTAAATTCTCGCAAATTTCTTGCTTCTCCATTCATTTTTTTTAAGCAAATGGTTAAAATTATTAGACAAAAATAGTAAATGATCTAAACTTTTCCTTTTTCTTTTCTATCTTTGTAACTTTTAGGGAATAGAGATTGAAATCCTGGCTATATAGATTTTTTTTAATATATATTTGATAGCTCAAAAGGATTTAAAAATCACAATTCTCATCTAGACTTTGAAATATTTTTTTTTTTTAAAAATGGCATTTACAGGGGCCCCACAAAAATGTAAAGTATGTGAAAAAACTGTTTATATTGCAGAAATGATCTCTACAAGCGGTGTTGCTTATCATAATACATGTTTTAGGTGTAACCATTGCAATGGAAGGCTTGCAGTAAGATCTTTTCATTTAATAATTATATCATCTTTTTATATATAAATTATAACAAAATATAATCATGAAATGTATAGGTTATCTATAGTTCTTAAAACAATTTATTTCATCAACATTAATTAATCTTTTTATAATACATTTTGATTAATCTTTTTTTTTTAAAAAAAAAATAAAAATTTGCAGTTGAGCACTTATTCAACTCTTGATGGAGTATTATATTGTAAACCTCATTTTGAACAAATCTACAAGGAAAAAGGAGGAGCTCCCTTGAAGCATTCTGCATCATGTAATTTTATTTTAATTAATACACTTTAAAATAATTAATCCTACAATTAATTTTAATTATCTGATTATTTATTTGTTTCCTTATTTCTTGAGCAGTGGGAAGGCAAAATGAACTGGTATGCATCATTTATTTTACATTCTCTAAAATTATTTTTACAATCAGAATTA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 20

Page 21: Introduction

GEN 3040

TGAGCTAAAATATTTGATAGACGTCCAAACGAAGTTAGTATTTTTTGTTTAAATTCATTAAATATGTATAAATATTGAATTATGAACCCAATAACTAAAACGAGCCATGAGTCAACGACGTATTTAGAATTATAAACTTTAAATTCTGACTTTGCCTGCGTTTATATTAATTTTCTGTATAATTTTATTGTTATTAACAGAACAGAAGCTCTAGTAAAGTTTCGGCTTTATTTTCTGGTACTCAAGACAAATGTGCAGCTTGTAAAAAGACTGTTTACCCCTTGGAGAAGGTTACTATTTTTACTTTTATTTTATTAATTTCTTTTTACAATTAATAAATATCAACAAATTTCTTCTTAAAATTAAGACTAATTCATGTCAACAATTTTTTTTTTTAGTTTTTCTCTAGTCTTATAAATGTACGGAAGTAATTTGTTGTTTTGGAAAAAAAGAAAAAACTAATTTCCACATTTTGTTCATGCTAATAAAATTAACAACATATCTCCAAGTTTTATTATCTCGGTTAATTCTGATTTTGATTTTTGTGATATATAAAACATAATATCTTTAATTTTTAATTTATTAAAATGTGTATCTTTAGTTCATTTACATATCGGGAATATTTCATATCTGAACTATTTTCAAAAACAATTTACCCTGCTCATCATCTGATTTTATATAGAAAGACTAATTTTATTCGGACATTGATATAGCGACAATATTTTTAGTGAAACTTAAGAAAAAGAGAGATTCCTGATCAACTTATTGATGTTATGATTTATCTTAATATCGAAGTAAATCAATCAAAAGTATATATTTCAGGTAATTAAAGGACTAAATTGTAATTTACCAATAATCTAGGAGCTAAGAGTGTATGAAACGGATGTATGAGATAACGTTCTCCGCTTACTTAAAATCTTGAATCTGCTTCTGAATAAATATTTTTTTTCAGGTGACGGTGGACGGTGAAATGTACCACAAGATATGTTTTAGATGTGTCCATGGAGGTTGTAAACTTACAACATCATCTTATGCTGCATTTGATGGAGGTCTCTATTGCAAACCTCATTTCTCTCAATTGTTCAAAGAAAAAGGTTCATATAATCATCTAAGCAAAAATGCTTCAATGAAGAAAAATGAAAGTTTAATCAACGTTGAACAAGAAGGTTCTAGTACAAACACAAGATGATAAGTCATCCTATGTTAATTTACGTATGAAATTTTATTTAACGTCAATACGTTTATATTCAGATAATTGTAAAGGGTTATGTTCACTCATATGGTAGAATGACTAAGTAAACATGATAAGAAAAAAAATATTGTTTGGTAGTTCACTTATTTAGAAATGCATGGTAGAAAATAAATATATTAGAGTAAGAAGTTTACTAATTTTTTCCTATAAAAACATAATATATTTTTATTTTCTTATCGATTTAATCAAACTTCGATACAAATAACTATCGAATATTTTTCAGTTTGAAAATTTAATAAAATGATGCAAATAGTATGACATAAATAAAATTAGGATCTACTTTATTTTTTATATTTAAAAAAGTGAATCTATTCTTATGCCTTTTTTCTGATGGCAACCCCACCAAAAAAGTGTTCAAATAATAATTGGTGTTTGTCTTAGCATTGTTGAGAACTGCCAAATCCAATATTTTTTTTGATAAAATTGTGAATTTTAATTGCAAGTATAAGTTCGTAACAATTGAAAATTATTATTAATATTGTTAGTGAATTTATCAAATTTAAATAAAAACAATTCTCTTCGTCTCAATTTATATGATATAATGTTACTGAATTCAGAAATTTAAAAAGAAAATATAATATGAGATATCTAAACAGCTCAAATACATCCATACCGATTTGAAAGGAATAAAAACAAAAAAAAGATAATAATCTAGACCAACTAACTTTATTAGGAATAGTTCATTTAGAATAAAATATCAATTACCTAATCAAATTTGTTTCTTTTTGCAAGAAAAAAAGTATGAATCAATTTACATCTTTTTCTTGTTTTTATTCCTTCAAATTCTTACTACTCACTACATAATAAATTATTATATTTTAGGCCATAGACTCATAGTATTAAATAAAATATTTTTGGTACGAATATAACGACCATTAATATTCTAGCTTTAGTTTACTTTGATTTCACAATTATAAATCATATTGTATGCTCCTTCAAACATAATCATGTCTGCTATAAATTTATAAATAAATAATGCAATCACTAACATATTACATACCTCCAATAATTTTACAAATATCTCAAAGTAACAATTAATAGTGAATATGAAAAATAAAATACTAAGTGAATATTAGATAAAATTACTAAAAAAGTAGAAAATGAGAAAAGAAAATTAGCTTCGTTATTTTTAATTTTCAAATCTTTTTTACTGAAAAGCGATGTTATGCTATAACGAATAATAAAAAATAAAAGGAAAAATATATAATATCAAGTGTCACAAAGAGTACACTGTACAACCAAATCAATAAATTTAAATTCATAGCTAAGCAAATATTTTAGTAAAAACTATATATTAAAATTCTCTGTTAGCAAAAATCCAAACTTGTGAGCTACTTCTTTTTTCTTCTTTAATTCAGTTATCCACCATATCGTGTCGTGTAGTATTTTGTAAACAAGAACCAAAAATAGAAATACTCTCCCTCCCTCGTAACTTCGGTTACACTTTTCAAATTTTGAAATTTAAATAAATTTAAATTTAACTATAAAATTTTTATATATTTTTTAGTTATTCTGAATTGTTAATTATATATTGTGACTTATAATATTTTTTACGTAGTTTATAAATATATAGTTTCATTTCAAATTTCTTTTTTGAAGATTTCACGTGCAAACTTAAATGATTTGACTCTTGAAAAACGACAAGTATCTAGACGGAGAGTGTAACCTTTTTCTTTCCCATTTTACCCTCATCTTCTTTACCTTAACCCCACAAAATTGCCCTCTCTTTTTTCAAAAAAATCATCCCTTTACCTTCACTTTTTTTCTTGTTTTCTTGCTTTCTTGCACTGTAAGTGCCACTTCTCTGCATAAAGTTTCCATCTTTTTTGTCTTTAAGAATTTCTGACACCCAATTGTTTCAGCTCTCTGGTCTCCTCTGTCTTTTTTTTTTCAATATATGTTTTGGGGTTCAAGAAAGAGTAACAAGCTCAAAAGGACAAGAAAAGGGGAGGATGAGTACATCAACTAGTATTTTTGTGATAAGATGGATCAATTTTCTTACAATGGTATGAACCCCTTTTCATATTTTCAGTTTCTTTTGAGCTCCAGCTGAGTTTTCCAGCTCTGCTGCCTTATGTGAACTTGGGTTGTTTAGTTTTTGCTTCATTGAGTAGAGAAGATGTATTTGTAAACTGGTTTGGGGTTTGTGTTTCTTGATGTGGTTGGCAACTTTACTTTGATGTTTGTTAGTTTAGTGTTTCAATTTATTCTGAAATGCTATTGCTTTTGGTTCAATGTGATTGGTAGTTGTTCGGTGAGCTTGATTTATTTGTTGTTTTAGGAGAAAAGAACCAATTTTTACCTCCTTTAGTTCTAGTTTTCAATCTGAGGTAGAATATTTTGCTATGTGTTTAGACTTTAGCATGGTGGATAATGTTTTCTAAGAAAATAAGTGAGTTTTTTACTTTTTTTGTGTGTTCGACAAGTAAGCAAAAGATATTGTCCTGAAATGAGAGGTGGGATTTGGATGTGAGATGCGGTGTGTTGGAGGGTGGGGAGAACACGATCAATGTGGAATATGACTAGAAAAACTTGTTTCTCGAAGTCATTTTCGTCATTTTTAAGAAGCTTGTTTTCCTAGAGAATGTGTTTTTCGAAATATTTGACCAATCTAACGTGGGAAAATTGGAAAACATTAGTGAACACATGCTCTATCTTTTTTGTTTTCAAATTGAGGTTTTTCTAGTTCTTAATTTAGAGCAGGAAGTTTATCTTTTTTGGTGGTGTTTAACTGTTTT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 21

Page 22: Introduction

GEN 3040

TGTACTCCATTTGGATCTTGAAAATGCAGCTGTTAGCTGTGGGCGTCATAGGTTTTGGGATTTGGATGAGCACTCATCATGATGTCTGTCGAAAGTCTCTCACTCTGCCTGTTCTAGGTCTTGGAGGAGTTATATTCGTCGTGTAGGTTTCTAGAAAATGTTTTCCTTCACATAAAAACGCACTTGTTCATTTAGTTTTCCCTTTAACCCTTAAACCACTCAAGGTTTGTTTATGTATATTGTGATAATCATTCTTCTTTACATGATTTTCAGATCAATTGTTGGGTTTCTTGGAGCATGGAAGAACAACTCCATCTTGTTGTGGATTGTATCCTCACTTTATCAAATGTCACATGAAGTGCAATTTTTTTTCTTATTCTTGATTGACATAACATTTTTTGGTTGCTTCTCCTTTACGTTATATTTCAGTATCTGATCATGTTGTGCCTAATTTTGGTGGCAATATTGGTCTTCACCGTTTTAGCGTAAGTAATGCTTCTTGGAATTTTATTAAGAAATTGGTGCATTATGTCATTTTCTGCGCTAATTATCATGCACACAATCACTAAATGTGTAATTTTTTTTATGGAGGCGATTTCAGTACAAAGTTAAAAGGCATGACCAAATTCGTAACTGTAGTTGTGGTCCACCAAGATATTTTAGTCATTGCTATGTTTATGATCCATTAAAAGTAACTATAGTTTTTTTTTTTGCATCCTGCTTGTTTTATGCTTTACAATTTACAAGCCATAATGGTGTATTATTGAGAGTTCTTTAGGAGTCTATCAATAGCTAATTAGCTACAGCACTGTTCTTCATTGCCTCTGGCTTGGTCCCCTGACTGGAACATACTCTACTTCTTGGAGGCTGGTTTCACCGCCTTGGCTAAGTCACTTTTATTAGGGAAGAATAGAAATACAATATCATGAATATCCTTAACTAGTTGTTCTATAAGTTCCAGCATTGTGTGTCTTGTTCCTGATCCATATTCCTTCTGCGCAGTGGGCTTATATTTTGGATACAACTTCTAATGTACTAAAATGAGCTTTAGAAAGAGTCAAATGGTGTATAGATTTGTTCAAATTTGGTAAGCGCCCTAGCACCGCCAGGTTGAAGCCTATTTCCTTTTTCATTTTAGAGTAAAACACATAATGATTATTTTATAAAGGAGAATTCTTTCTTCAAGTATTTTTTTCTTTGGCCTGTTAACCAGCAATTGGTAGAGGCTTTAGCAGTTAAAGTAGCAACATTAGGTGTAACGTTCATAGGTCTACTTGAATTCTTTTATAACCAAGAGACTGGTTGTTTCTAGCTCCAAAACTTGTTGAGAATTCGCTCCTGTACCCTTCTCCTCTTTAATACCACATTTGGTTCGCAACAGGGTTAGAACTTATTGCTTGTGCTTACCACAAATCCCGCATCGTACTACCTTTGCTATTGAACCAAATCCCTGAGTGAGGACTCCTCTGATTCTACTGGAACATGATACAATTTCCAATGTTAGTGTGGCAAGCCACAATGGCTCAGCTGAGACTTCAGCAGGTGAAGTAGTAATGCTACATGCCGTCTAAAGTGCATAAGACTTTCTTAAGTCCTTCTGGACCATATAGGAAGTTCCAGTGTTACATCAGAACTTATTGATATCAACCCAAGTAATGGTACTGCTCCCCTCTTTAGTATTCGATGGGACTAATTTCGTGTATTGGTCAGTATGTGGATTAAGCCACTAATCTTGTGAGGAGACTGGTGAAGTAGTATCGGCAAAGGAAGAGAGACTTACATATGGTGTTCAGTGACCTTGAAAAGGCGTATGATAAAGCACCTACAGTTTCTTTCCAGCCATACTGCCCGCTTAATGCAAAAAGGAATGCACAACTAATCTTCTTCTGCTTTTCCTTCTTTTTTTTTTCACGCAGTGACAAAAACTACACTAGCAAGTGATAGTCTAAGCATGAACCAGTCCAGCTTGGTTTTCAGTTAACTTCATTACTAGTTCTATCTATAATCTGTCTTGAACGTGGAAATCTAAACTGGTTTTCCATATTCAATCCTTCATTAAACGGCAACTACTGCTGTGTCAGGGTTAACAGGCATAACTCACAAGGTGGCAATTACTGTGTCTATGTGCATGCTGATGTCAGCAATGGCTATTACCTACATATTACTGTGAAACTTATTTTGGTTTGCTTTATCACAGGTTCATTGTGACGAATAACAGCTCAGGTCACAGTGTTAATGGACAAAGGTACAATCACCTGTCATACTGTCTGTGATCACAATATCTCCTTCTCTTCATCCTTCAGTATCTTCCCCTTTATCCACTCTTTTTCTCGACCCACGTCTTGCTCAGTCTCACTTTGGTTCTCAATGTGTCTCAAAGACATGGATTGTGCTTGTATATCTCATCATGACCGTCACAAGTTTTTCCAGTTTTCCTTTTTCCTTTTACTTCTGTGATATTCTTAGAAGAGTGAGGTCACTAATGGTTTTTGTGTGGGCAGGTACAAGGAGTATCAACTTCATGACTACAGTTCGTGGTTTCTAAAACAAGTAAGCACCTCTTGTATGCAAATACCACCTTTGTTACAATTTATGTGACACACTTTCCTTTTTAGCCCGTCCCAAAAAGAGTGTCACATTTCTTTAATTTGAAACAATTTAACTTAAATTTCCATCTTACTCTTAATGAAATTATGTATAGCCACAGGACCACAAAAATTCAAGTCTTCCTTTCTTTCTTAAAATTATGTCAAGTCAATCGGTGCCACATAAATTGGGACGGAGGAAGTAATAAGTCGCAGGTTATAGGGCAAAGGCATCATGTCCTGATCGTTATAGGAGTTGTTTCAGATGCCTTAAGATGAAAATTTCATGCATTTATTAACCATTTGCTCATTTGTTTCAGCTCAATAACACCCATAACTGGAAGCATCTGAAAAGCTGTCTTGTCAAGGCTGATGACTGTAGTAACCTATCAAAACGATACAGGGTCATGTTCCGACTTGAACCTGTTCGTGATTTATATCTGTGAAATCAAATTATTAACGCTTCTGCTGTTTTGTTGCAGACTGCCAAGCAATACAAATTAGCAAAACTAACGCCTGTTGAAGCTGGTTGCTGCAGACCACCATCTGAGTAAGTTATCGTTATTATTACTTGTTTGTTATTCCTTTTCTTTCTTTTGCTAGATATTTTCGTTTCATGCTACTCCGAGTTGTGCAAGTCACATATATCTTTTGTATATGTTGTTTCACCAATTCCTCTATATCTCGCGTGTTTGATATTGAAATCCTTACTAGCATCATCCCATTCATTCCAAGGACTCAGAAATACTCTGCTCCTTTTGTAGATGTGGTTACCCTGCTGTTAACGCCTCATACTATGACTTGAGCTTTCATCCGACGAGTTCAAGCAAGGACTGCAAGCTTTACAAAAACTCGAAGAACATCAAGTGCTATAGCTGTGATTCCTGCAAGTAAGAGTCATCTTCTGGTCTTGTCATTCTTCGCTTAGGTTATGCAGCTCACGGCTCTTCTCCTACTCTATTATCTTACAGGGCTGGTGTTGCACAATACATGAAAACTGAATGGAGAGTGGTTGCTATCTTCAATGTGATTCTCTTTGTCGTTTTGGTAAGTTCACTCTTCTAATTAACATCTCAAAAAGTGAGCATAATGACTCTTGTTTCTTGTTTGAGAAGTACGGTTACATTTATGGGCTAAGATTTGAAGTTTATGTGTTCTGAACTTGCCACCAAATCCATAACTCTTCTGAATTACTGAATTCACAACTACATATTTAATGGATTTCCACAAATATAGGTTCTTAGTTAAAGTTACTGGATTCGTCTAAACTCATAGCTAGTAATCTTGCTCCACTTCTGTGATTTCTCCGATGACTTGAATTGGTTCATATGACTTCTTCTCACTTTATCAGTCGATGATCTACTTCGTTGGATGCTGTGCAAGACGCAATGCTGCTCGTGAACGC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 22

Page 23: Introduction

GEN 3040

TCTAAAATTTGAATTCCGCCACTGCAGAATCCAGATGAGATGAGTTGGTGTTCTTCAGCTGATGATAACAGTTCATGTTTTTCAGCTATTTGTTGACAAACTTTGTAAAGCTTTATTTTAGTGAAAAAAAAATGATTCATGTAATATAAAAGATTTTTTTATGTTTAAGACCATATAATTTTGAAGTGTTTAAAAGTATACAAGCCTGACTCTCTTTGTGTGGAAAGTAAACCAAGATAAAATATTACGATAATTGAAATACAAGCAATAACAGATAGTAACATAAAATAAATGGACAAGGCCTAAAATACCCTTAAAGTATTGAAAATGGTACAAAATTACCCTTCATCCACCTATTGGCTCCAAAATGCTCTTTCCATCCACCCGTTGGCTCCAAAATACCCTTGTCATCCACCTTTGGATTCAAAATTGACCATTTTCAATTGTTTTAAGATTAAATTCTTTAAATATTTTTTTTTATATACTGGGCGTTCAACTATTAATTATAATTTTAATTTATTAATATAATTTATAAACTAATCCACTACCCATTCATTACAAACTAAACCTCACTCAATTAATAATTCAATTATAATATCAAAATTGTCATAAACACTACTAAAACACGATGAAATTATAGATTACTGAAACTGACATTCAAAATTATTCGAGTCCGAATCGAAGCCCCAATGAAATTTAGGTTGAGCCGCTTATTTAGGAGGACACTTTCTTTCAAAATTGAATTAGAAAATTATGATTAAAGGTAAAGAAATAATACATCCCGATTTAATTAATGCCTCTTATTTTAAATATAATTTTATAAATATTTATGATTTATTTTAAAACCTTTAATATATTATTTTGAAAAAAAGTTACCTATGAGGTAACATCACATAATTAAGACGTAAGAATAATTAAGATGAACATAGTCAGACTTCTAAGTTTATCGGTGATTTTTATGTAGCCACTTGAATGTATGATAATTTTCTTTTATATTTTTTAAAAACACTCAATTGGTAGTCGCTTGCATTAATAATTTGACGGGTTCATTAATTTAGAGGGATTTAATTAGTAATGGGTGTGTAGTGGATTGATTTATAAATTACACTAATAAGTTAAATTATAACAAATAGTTGAGCGCCACGTATTTTAAAAAATATTTAAATATAAAACCGTTAAATAAGTGGTCAACTTTGAACCAAAAGGTGGATGACAAGGGTATTTTGGACCTAATAGGTGGGTGGGAAGGGCATTTTGGAGCCAATAGGTGGATGAAGGGTAATTTTGTATCATTTTCAATACTTTAAGGGTATTTTAGGCCCTTTTCCGTAAAATAAAATGACAAGAAAATTAAATAAGTAATAGTACGACTACTAGTATGGACAAATAGAAAGACAACACGCTACTTTCTACTAACTTTCTATTATAATTCGTGTTCTCCACAAACTTCGTTTCTAAGGTCCTGTTCTCAGTAAGCTGGTATTGCATCAAGTTTTGTCTAAGTACGTATCTCAATACTCTTTTGACCTCCTTCTTCTCTTGAAACCAACTATAATAAACCCTTCTCACCTCCGCGATGGAGCATATGTGTATAATTTTTTTTTACATATCTGAATAATCTCAATCTCGCTTTTCATCTTGTCCTGTAATAATATTAATTTCACTGTCCAGTATATTTTCATTCCTAATCTTATTTCTGATAGTATGTCCACACAATTATCCAAACGTCCTCACTTCTATGATCGCTATCGAAGAAAATGATCCTTATAAGAAAATTATCTCTCAAACTTCAATTTTTTATTTCTACATGTTCAAACAATATTCAATCATTTTAACCTCGACAAATACGTAACTCTACCACAATCCACATTGTTTATTAAAAAAAAACATACTTTATTCTTTAACACATATTTATAAATTCAAAATTTTATCTTAATAATATTGATATTATTGTAAAAAATTATGTATTACATGAAGCGGAGAATACTAGTATCATGTAAAATTGTATTACTCCGTAATATATTTTTTTAACTCCCTAATTGGAAATTATTTTCATGAATTCTTAGGCTTGTTTAGTTGGGTTCACGCAAGTGGAGAATGCAATTATTATTAAAAAAAGTAAAAATTTAAGGAATCCCATAGTGTAATTCAATAATTATATTTTGTCCAGACAAAATTAGTATTTACAAAAAATTCCTTAAATTTGTTGTTCCGTGATACTTTAGACTCTAGTGATACATGGTAAATGATACATGGTAAGTTTAAACTGTTGAGAAAAAAATGGGGAAAAAACGGTATAATTAATGTTGTTACTAAAACAGCTTAATTTAATGTAACAGATCATTAATCACCATAAAATCAGCACGTAATTGGATAACATTCTCCCCTATGTATAAGGTAACAGATCATTAACATTTGGATAACATTCTCCCCTATATATTTTTCAAAGTCCTTTAGGAAATTGTTGTTAAAATTGACAGCAAACCTTATGGGATGTGTTGGTATCTTTTTAATCTTGTACTTCAGTGGCTGCAATTTGAAAATAAAACAGCAAATTATAACGATATTGAAATATAGATACTAAATAAAACTAATAACTTACTGATACATGGTTTCAGTTTTCAGAAATTCATACTACATGAAAAACATGTGATACATGTCTACTACATGTATCAGTGAATTAACTAAAACATTATAACATGTGATACATATCTAATGCATGTATCAGAAACATCGACTAAACAATATACACACTGATTCTATATGTATCATCAAGTACAGTGGATGAAACATTTATCAAACTTAGTGAATGATACATTATAACAATTTTCAAAACATGTATCACTACATCAACCAAACAACTAATACCTTACTGATTTATGAAAGTGAATATCATATATATATATATATATATATATATATACACACACACACATCATTTGAACCTTTTGTAAGGTGACCGTTCATTTTCATTTGCTTGCCTCCAAGAACAAGAAGATTACCAAATTCATCTGCTTGCCTCCAAGAACAAGAAGATTACCAAATTCATCTGCTTTTTGGTTGTTCGTTTAATCTTTCCTCTGATTGGAATTTGTTTCTTCGGTAACTGAACTTGGGGGTTTAGGGTTTTTATTTTTGAGAGTTGTTAAGCATGTGGGTTTACGTTGAAGTTGCAATGGCGGAGTAAGTAATCTTTAAGAGAAGGAAGATGATTATGAAAAAAAAAGAACCGAAACTTGATGGAGTAAGAACAAAACGGGAGAAGTTAAGAGAAGTTGTTCTGATTTTTGAAATTTCAAAATTTTTTAATTTTGAAATTCAAAATTTCAAATTTGGGTCTTTTATTTAAAATTAATTAAAATTAATAATTCAAAATCAAAAAACTGATTTAAAAGGTTGAGTGTTGAAATGGAGAAAATTCAAAATCCAAAAACTGATTTAAAAGGTTGAGTGTTTTAATGGAGAAAAAATAGGCATTAAAATGGATAATTGTGTATTATAGGAGAGAGAATGTAATGTATCTATAAGCTTACACTAAAATTAAAAAAAAGGGAATTATGTAATATTTAAAAAAAGAAGAAAAAATTAGAGAATATAAACTTGTAGTTGTGTATTTAAGTTATTTTTCCTTAAAAAAATTGAAATATTTTTGGTATATTATATCATGTCTGAAATCCCAAAAGTTGGGAAAAAATCATGTCTTAAATCCCAAAAGTTGGGAATATTGGCATTTGGACCTCTAACAAAAGTACATAATTAATGTTTTTATTTTACTTTATGCAAGTGCAGAATATTATTTGATTATTATAATTAAAAAATCAAAGATTTTGGTGTAAAACTATGTTTATTATAATTATATTCACACTTACTAGTTACTAATCCTATTTTAGCAAAATCCAAAAAAGTATACAAATTACTTTTAATCTTTTTAAATAAGCAATCCCATAAATTGGTTCTCTCCATTTTATATTAAGTTAATTTTTGAGTTGC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 23

Page 24: Introduction

GEN 3040

TTCATACATTTAAGAAAAGTAAATTAAGACGTAAATTGAATCTAGTTTTCATGTTTTATCTTATTAATCATTGTCAAATTTATAATTAAAATGACTGAATGTTAATTACTGTTAATTTCAATACTAACTAATAAAGGATAAAATTTGAAGAACACTATAAAAATAGTCTTAAAAATTAAACAATTAAGTTAATTTGAAAAATGAAAAATGAATCAAAATTAACTTAATATGGAATGAAGTGAGTAATATTATTACTCGTTCCGTCCCTAATTACTTGTCAAATTTTCCTTTTTTATTTGTTTATTTTGACAAATCAAAAAATTTAAATAATAGAAATAAAATAATAAATTCACTATGTCAAACATTATTTTTCTAACCGGTGAGTCAAATGAAAAAATAAATAAATACATAATTAGGAACAGAGAATACAAAAAAAAAAAAGGTCTAATTTAATCTCAACCGTCAAAAAGCGCGGGAAAATTAATATCCAAAGGCCAACAAAAATGGCCTTATAAATAACACCATTCTCACTTCATTTTTTTCACAAAAATACTGCAAAAAAAAAAACCCTAGAAATCTCTGCAATGGCACCAAAGAGAAAGGCAACTGCAGGGGAGCCATTGCCAACTCGTATAACTCGAAGCCAAACCAAGAATGACAACTCGAAACCCGTTTCAGTTGACCCGGTTTCACCAAAACCAAAACGGGTCAAAAAGGGTACCGTTTCTGAGACGAAATCTAAAGTGAAGGCTCCAGTTGATTCCGGGTCGAAGAAGATTGTTGTTGTTGAGCACTGTACCCAGTGCAGGCAGTTCAAGATTCGGGCTGTGAAGGTGAAGGAGGAGCTGGAACATGGGGTACCCGGTTTGGAGGTTCGGGTCAACCCGGAAAAGCCAAGGAGGGGTTGTTTTGAAATCCGGGTCGATGATGAAAAGGGTGAGAAGTTTGTTAGCCTTTTGGATATGAAGAGGCCATTTGGACCCATGAAGGCACTTAATATGGACAAGGTCATTTCTGAAATTATTGAGAAAATTAAGTGATTCATTTTAATTTTAATTTTATTTTGCTTAATGGGGATTTAGGGTTATGTTTGGATTTTAAGGGTATGATTTTGTGTTTTGTATGACTAAAGTATTTGATATTTTCAGGTCATTTGTAGCCCTTCGAAAAATATGGAAACAATCTGTTGTGAATTTTGCCTTTTGTTTTCCCAAAACTATGGTTAAAATGAAATGATTAGTGTGGTTGAAACTCTATTATATTCGAGATTTGAAATTTATTGATTTTGAATGTCTTTGGATCAATGTCCAATTATGATTGTGTTCACAACCAAAATTTTATGATGTTATTGGATGATTAATTTATTATATCAACAAGGTTTGTGTTAGGTTAATTGGATTTTGTGAACTTTAGTATATAGGAAATGATTTTTTCATTAATGAATAAACTTGTTAGGTTAACCAAGCCATGGTTGCACAAGCATCAATTGTGTGAGCATGGCGTGTTGGCATTGGCGGAGCCACCTTGGTACAAGGGGATCATCCGAACAAACATCTTTCGACGAAAAATTATACTATTTATATATGGTTAAAATAATATTTTATCTATATATTCCTTTGACTACTCCGTGTGTCTTGTTTTTATAAATTTTGAACTCCTTTATTGAAAATCATGATTCCGCCCGGTGTGTGTTGGCATTGGACTTTTGAGAAGTATTTGGAAGATGAGCAATGTGAGAGAGAATATATTCTAAGAGTATCAAATAACTTTAGCAAGTACTTGTGGTGGTGAATTCAAGGTGTGTGCCACTCACAAGTTCTATAGCCATGAGTGTGGTGTGTTGATATTGAGCTTGAGCTTTTGAGAAGTAGCATGGGTGGCACTCATATTTTTCCTGAGTTATAAGTGTAGTATATTGATGTGGGGCATTTAAGAAGTATTGATAAAGGATCAATCTGAGAAGTAGGGTTGTTCATGGTTATGGTTAACGATCAAATCAAACCGCAATTCGAATCAAATCGATATCTGGTTTAGTTTAGTTAGGTTTGATTTTTAAATTTTGGAAACCGATACTATTTGGTTTGGTTTTGGTTTTACTAAAAAACAACTGCAAAAATAACCAAATCGAACTGATAAATTAGATATATAAATTTTATAATTATTTATATATTATTCATAAATAAATTAAATATATTGTTTAATTTTAATTAACTTAAGGCTTTAACTTTACCATTTTTTCAAGTCTAACACTTTAGCCAAGGTATAACAATCTCAAATCCAAGTCCATCAAAATCCGTTTTAGTCTTTACTTATTGTACCTCACATAAAATAGTCACAGTTGTCCTTTTTGCATCTAGGCCTACACCAGTTTATCTTGCGGCCTCATCATATTTGTATTAAATACGTGTATATCATGTGAACTTGAATATATGCCTTATAAGCAATTGGGGATCTCTACGAGGAGCAATTACAACTCTAAGTTCTAATACTTTGCACCACTTCTTTGTTTATATAAAAGTTGGAGATATTGTTACCATTTGACAGAAAGGGCAAGTATAGAAAAATATACGACTCGGAATTTTCATAAGCAACCTGGATTTGATCATTTCCAATTGAAGAATTGTAATTTTCTTTCTCAGCTACATCTTTACATTAGTTTTTATTCCATACAATCTAATTTTTATAAACAAAATTTGAAATAAATATTGTTGTACAAATATTACGCCTCATAAAACTTACCTTTTTAGGTCGCAGATGAAACCTAATATCTCAAATTTACAAAGAGATGACATTCCATTGAACATTAGTAGGCATTATGAGTTTGGAGATGCACTCACAAGAAAGAAGCTAGTTGTGGGCTTCGATAGCCTACTAAAATGATACGAGTGATACACAGTAAATCTCAAAAATTGACTCGTGGATAATGGTGATGGTTACGTGAAACTTAGTTCAAAGGGGGTGATTGTCAGGTTTGCAAACAAAATTTCATTGTTCTACATGAATAGCTAAAAAGATTGAATAACTCATATAAAGTGCATAATCTCTTATTAGTGTGAGGTTGACTGGGGAAAAACCATTTGAGTTTGGCCCAAAGTAAACAATATAACACCATGTAAAAGTATCTTTGGACTGGCTAAGCGCAACGTATAATTTAACTTTTTTTTACGTTGAAAATACAATAGTAAGATAAATAAATGATATAATGTAGGAAGAGTATTTCAAAGATTACAAATAATATCAGCAATAAAATCGTAAAATAGTTTATAATTAAAAAATGTTTATTACGTGTAAAATCTTAAGTTTAGATAATCTTATGACTTTTGACTTATAACATTTTAATATTTATCAAAATAGTTTATAAAATTAGTCAAAACACATTTATAAGAAATGACTTACTTACTAGTTTACCTTATCTGCATCATCAAAAAGATGTCCAAAAAAATTATTTCAGATTCTCCTAAATTTATTCCCTTTTTAAAGAAAGCCAAAATGTAGAAAGTTAATGGTAACATTACTATTTTTAATTGCACCATTTGAATAAATACTTCCCCATCTTCTACTAATATACTAGTTTAATTTCAGATTATTTGCATTTTGTTACAAAAATCATCTCCAAAGTCATGTCATATCAGGTAACTTTTCAAAATTCATTTTCGTTAGGTTTGAATTTTTTACTCGTATCTATTGATGATGGGAGCTGACAGATATTCTATAGAATTAGTTAAGTTCTGTAAAAAGTTAGATCCGAATAAACAAAAAGATTGCTTTGAATTTTTGTTTGCTAGTATAAATAATGTTGTTTATGACTTTTGAATTTTCTTGTGTGTTTCCATGAAATTGAGGTGATCATTTATATATATTTCTATTCCAATTCTTTGTTTGGGTCATTTATATATATTTCTAATCCAAAAGTTTGCTTAATTATTTCTCAAATTTATGTAGAAATCATTGATGCATCTGAGAGCTAAAGTACAAATTGTGGTTGCAATT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 24

Page 25: Introduction

GEN 3040

TCACTATTGTTGAGCTATGCCAAAATTGAAGCAAGACAAAGCCTATCTATGGTTGAAGATTTGGAATTGGAGAGGCAATTGAAGCTTATAAATAAGCCAGCTATCAAAACGATTAAGGTACAGTCTCGATTTTTTATATAAAGATTAAAAGAAACATCTTTTGTCTTACGTGATGGAAAAGTAATAATGATCACAGTATAAATATTAAATCTAATAGTTTGTACTACATCTATTTTTTTTCTATTAAAAAAAATTGATTGGAGACTAATTACTAGTACTATTTGATAAGCAGGGGAAATATGGAGAATATATGATTGTGTAGACTTTTACAAACAACCTGCATTCGACAATCCTCTATTGAAGAATCACGAATTTCATCCTGAGGTATATCTCTACCTTAGTTTCAAGTTCGTACAATCAAGTTTTTATAATCAAAATTTGAACCAAATTTAAGACACAATTTCTTTACCTCTTAAGATAAGAGTAAGGTCTGTGTACACTCTACCTTCTACAGATTCTACTTGAGAAAATTCACTGAAAATATGATGTTGGATACAGATGAAACCTACCTTGCCCAATTTACAAGGAGATGACACTTCATCAAACATTAATAGCCCTTTTACCATAGGATTAAAAGACGGAGGTTGTCCTACTGGAACAGTTCCTATCAGAAGAATTACTAAAGACGATCTAATACGACAAAGATTCTTATCACAAATTAGAGGCGCGGATGAATCCCCTTATGGTGATGTGAGCTTGCTGAATTTCTACAACTTCTTTTCGAATTTCATAATTCAAATTCTTTTATCATATTTTTCTTCAGGATATGGAGTTCAGTGTTTTTGATGGAACCAACTCAAGTATACCTTTTAAAGGAGGACGCAAGGTTTATATACTTTTATTATTCCATATTCTATCCACAAAAAACTAATTTTTTCGATATTTTAACACATTATATCCACCCACCTCCCAAAAATAGCCAACATATGTATAATATATGTATATACATGTATAGCAATGTATAATTAGCTTAAAAAATGTTTTGTTTTTGCTTTGTTTCAGTATGCAACAGTGCAAATACCATACAACTCAACAAACAAAATAACAGGAGCTGGGGCAATTATTAGCTTACATAATCCTCAAAATCTTAGTGGACATCAATTTAGTGCTGGTCGTATTAAGGTTCAAATTGGAATCGAAAGTATACAAGTCGGTTGGATAGTAAGTTTTTTGTCTACGACTCGAGTAATTATATGTGTGATTGCATCTACTGACTCTAAATCCTGCATCAGACTAAATATGCTAATCGAAGTTACCTAATAAATTTATATTAGGTGAATCCACATGTCTATGGAGACACTCATACGCGATTATACATATATTTAAAGGTAGTTTTCTTTAACTTCTTAAGAATTAAAACTAATTAAGATATAATTTTCTTGAAAGTCGATGTATTTATTAATAAATTTTTTTATGATCGCAGACAGAAAAATTAGCATGTTTTAATACACGATGTCCTGGATTTATTCATATAAATACTGCAATACCTTTAGATGGAGATCTTCCTGCCTCTACTTATGGTGGACCTATTTATGATGTACCAATGTACATTGCTCGTGTAAGTGTGCATCTTCCCCTTTTTTTTATAAGTTTTTCACTCGATATCTGATATTTGTTTAAGGATCTTAATTAATTGAATTTACGTTTGAGTAGAATTTATTAAAAAGAAACTTTTTAAACTATTGCATATAGATTATTTTTTTGAGCGAGTGGTAAATTTTATGAGAAAACGTTAGACTAGTGTCGAAGGTTTGTAGACATACACTACACAACAATATAATGTTTTGTGAATTCCAAGAGCTATATTTGCACAACTTTCAAAGACAAACAAAATCTAAAATTGTGGCATGGAAAGAACATTCGTGTAAATCTTCCTTCTTCTTCTGTTGTTCATATTCTGAGTTCGAATTTAAAATACTTTAATTAAAGGTGAAATAATTTTTTTCATCAGAAAACGTTTTATAATCCAATTATGATAATTTTTTTAAAAAATAATGAATATGTATATTCATGTAACATGTTATAACATGTTACTTATCATTTATTCTATATTACTAGTGAATGTTTATTATATACAATAACATGTAATATCTTGTAAGTGATTTGATTAGATAAATAATACACACTATCAATCCATAAACTTAACCTTCTTTTATTATTATTATACAGTATCTGACTTTTTTTTTCCTTTTACCATTTATAGGATATGTCAAATGGAAATTGGTGGTTTAAATTTGGAACAAACTATACATCAGTTGGATTTTGGCCATCAAAGATATTTACAAAGTTAAATGAATTTGCAACAAGTGTAGAATATGGAGGAATCGTATATAGTCCACCAGGTGTACCAGAACCTTCTATGGGCGGTGGCTATTTTCCAGTTGGAGATTTGAACAAAGATGGATATTGTAAGAATAGTACCTATTTAACAGATAAAAATGAAACTAAAAAATCTTTAGATGATATTGAAGTGAAATTATATGCAAATAGTCCAAATTTGTATAGAGTTGCTGATTTTCCAAACTCTGGGGTTGAATCTGGTAATTTAGTCCTATATGGAGGACCAGGTGAACATATGTATAAGAAACATAGAGATATTTGATATATAGATGCGGTGTTTTAAGATATTTCGATATTATATTTATGAATTATATATGGTTTGTTCATGCAGTGGTTCTTGGTATATAAGTAAGTAGAATGTGTCAACTAGTGGTGCATTTTAAAGCCGTTCAAGTCGAGGTTTAGAACGGACAATATCACTAGGTGGCTGGACTGTTACAAAATCATTAGATTTTGTTCAAATCTCGTATTTATATTTTTAAAAATCAACTTAATTCTATAAATATTTAATTATATGTCCAATTCTTTTGTCTGATACGAACTTACATGATCATATTAGAAACTCATAAACTTTAAATTTTGTATCGAACATTAATGAAACTTTTAACTTTTTTCTTCTTCTTTTTGGTTATAATATAGGACATGTCTAATGGAAATTGGTGGTTTAAAAAGGGACCAAACTATACATTAGTTGGAATTTGTCCATCAAAGATACATTAGTTTGATAGTTTAAAATAAAATTTTAGATATTCAAAAGCTATACAGAAAGTATTATGAATTATAATTTTTTTTATATAAATATAATATATATATATATATATATATATATATATTAGTCAATTATTTGACTTAAAGAAGATCCTTTTATAAGAAATCTAAAATAATAATATAGAAAATTAATTGTAACATTTGACTTTTACACTATTCAAATAAATACTCTCTCGTCTTCTACTTAGAGTCTGTTTGACTCAACTTAAAAGCTGGTCAAACTGACTTAAAAGTTGGTTTTTGACTTATTTAACTGTTTGGCAATACTCAAAATAACTTATTTTAAGTTTAAAAAAAACTTATTTTAAGCTAAAAGTTAAAAGCTGGGGTAGAGGTGCTTTTTTTTTAAGCTTATAAGCTGTTTTAAGTTGACCACATTTTTATCTTTTTGCCCTTAATATTTTTATACAATCTCCAAATTACCCATATAACCCTAACATCTCTTTCTTCCATTTTTCCCTTTTCACGTTTGGCATAACAACTTCACCACTATTATCCAAACACATAACTGCTTATTTTAAAAATAAGTTTCAACACTTTCAAAAGTACTTTTTTAAAGCTGCTTTTATTAAGCCCATCCAAACTGGCCTTTAGTATAAATGTATAAGTCAATTATTTGACTTTAAGAAAATCCTTTTATAAGAAATCTAAAATAATAATATGGAAAATTAATTGTAACATTTGACTTTTACACTATTCAAATAAATACTCTCTCATCTTCTACTTAGTATAAATGTATAAGTCAATTATTTGACTTTAAGAAAATCCTTTTATAAGAAATCTAAAATAATAATATAGAAAATTAATTGTAACATTTGACTT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 25

Page 26: Introduction

GEN 3040

TTACACTATTCAAATAAATACTCTCTCATCTTCTACTTAGTATAAATGTATATATGTTTGTTTAAGTTCACATGTATTTTTATTTTGCATCAAATTTCAACACCATCGTCATGTCAAATCAGGTAAGTTTTCAGATATTTATTTTCTTAAGCTTTTTGAAATTTCATATCGCTTAATTTTATCATACGTTATATTTTTGTTCATTCATATCTTTGATAACAGTAAATCGTTTCGTGTTTAACCTTTTGTCATTGAGGAATGCTTTACCATGAAATGGATCTACATGTGCCCAATTTTTTCGGGAATGAAAATGTAAATTTACCCTTTACCTTTAGACTGTAGTATAAAATTATTCTCTAGTTGAAATTTCGACAGAGGTTAAGGGTAAACAAGACTTTTTCCCTAACAACAGAGGATTTATATTATATAAATTAGACCAAAAATTTAACACAATGTTGATAAATTACATCTCAAATTGTAGATATTACAAATTGTTGTTGCAATTTCACTATTGTTGAGCTATGCCAAAATTGAAGCAAGACAAAGCCTATCTGAGGTTGAAGATTTAGAATTGGAGAGGCAATTAAAGCTTATAAATAAGCCAGCTATCAAGACTATAAAGGTGATATATATCATAATATTTGAAGCATATCAAAATTATTTATCAAAACGTAGGAAAAAATAAATAAATTCACTTTTTTTGGTTGATTTGTAGACCAATGATGGAGATATATACGATTGCATAGATTTTTATAAGCAACCTACTTTTGATCATCCACTATTACAGAATCATGATTTTCATTCCGAGGTGTGTTTGTATTATATACTTCTATATTCAATTTTAATTAAATTTTATGACAAACATTGATGTATTTTAAATTTGACGTAGATGAAACCTACTCTATCCAAATCAAGAATAGACGCAAAGGCTTTGAGTTTCGAAAAACCTTCAAAGATAGGGTTGAAAGGCGGTGGATGTCCTAGTGGAACAGTTCCTATTAAAAGAGTTACAAAAGAGGATCTCATTCGACATCAACATATGCAAGACATGAAAAAAAATCTTAATGATCATGTAAGTTCACATATTATATTTATAATGCCATATCGATTTTTTATCGAATTCATCATATAAATATGTAATACTTTATATCGCCGATATATAAAATATGATTTATTCTTTAATATAATACCCCACCCCACCCCCCTCAATTTTTTCCTAGTTTTGAATCTCTAACATAAGGTGTACTTGTACTTCATGTAGGGGAACAACATAACCAATTCAAAGGCCTCAATTTCCCTAGATGAAATACATCCTTTCCAGGTGTGTATTGAACCCCACCACCCCCCACCCCCCCACATCCTTTTTTTTTTCAAAATAGATTCGGACACTTTTGAACTTTATTCAATGCATATCTAAACTTGATGTTTGCCTAATTAAACCCGAACATGACAATTTGATAACCTCGTCAAGTTCAGGATAATTAGACCAAAGTAAAGTTCAGATGTACATCGAATAAAAATCGTGAAAAGTGTCCACTTTGAATCATTTGCATATAAATTATCTTACTTTGGGATTATTATTTTATTATTGATTTTAGTATGCAGAAACAGAAATTCCAGGTGCTGATATAAATGTACTTCGTGGAGCTGGCATGATTACTACTATACATGCACCAATGAATGTTCAAAAAGATCAGTTTAGTGGAGCTCGTGTGAGGCTTGAAAATGGACTTAGTGATGCCATTGAAGTTGGTTGGATCGTAAGCTCACTTCTACTTCTCATTTAAAAAAAAATTTAACTTTTATGGACTAACAGTATTCGAAAAGAATACATATATCATCGACGATTATCTGTTGTTACAAGTCAAACTTATAAACTTAATCAGTTATACACGCACGCTTATATATATCATTTTATGCAGTGACAGTGTTTAAAAATAAATACATATATCTTCGACGGATATTTTTTGTAATAACTCATCTTATAAACCTGATGGTGTACTAGAAATTTGTATATATCATCAACTGTTATACATATTATCATTTCATACACTGACAGTCTTTAAAAAGAATACATATATCATCAACAGTTGTATGTTGTTACAAGTTTTCTTATGGTGTACTAGAAATTTGTATATCATAGATTGTTACACATATATCATTTTATGCATTGACAATGTTCGAAAAGAATACATTTATCATTGACGATTATCTGTTGTTTCAACTCATTATTATAAACCTAATGGTGTAGAAATTTGTATTTCATCTACAGTTATACATATATCACTTCATATACTGAGAGGTTTGAAAAGAGTATATATATCATCGACAGTTATACGTTGTTACAAATTATCTTATAAACCTAATATGGTTTAGAAATATCGAGAAAATTACTAAATATCTTTTTTTCTAAGTTGAAAAAAAATACTTGTTATTAATATTATTGTTATGCTAAACTTAATGAAGGTGCATCCTACTCTTAATGGAGACAATACGCCAAGACTTTATGCAAGATTTGAGGTAATTATGAACAAAACTTAATTACTTTGATCAATTATTTAAAACTTTTCTTAGAGACTATAGTGTTTACTAATTATAATGAAATGCTTTTTTTTTAATGTATTATTATAGGTTGGAAGTGCTGGTTGTTTTAATACATTATGTAGTGGCTTTGTCCTAGTAAACACAGATATACCTTTAGGAATGCCACTTGTACCTTCAAGAATTGGTGGACCTATTAATTCACAAATAATGTATCTTGAACAGGTAATTCATATTTATTTGTTCTGTTACGTTAGTGAAATACGTTTTAAAATTTCTACAAAACTTGACGTTGATAAACGCTATTTACGAAATGTTCCTTTTATGTATCTTACTATATGTCAATATAAACTTATATAGGATGTAGCAAATGGAAATTGGTGGGTTATGCTAGGAGAGGATTACAAACAAGTGGGATTTTGGCCTAAAAGTATTTTTACAACATTACAAGCTTATGCAACAGGTGCTAAATATGGAGGTATAACATATAGTGCACAGGGATTACAATTCCCTCCAATGGGTAGTGGACTTTTTCCAAAGAAAAATTTGTTAGAAAATGCATATTTTAGAAAGTGCACTTTTTTGTGTTATGTGAATGATGAAATGGTAACATACTCTTTAGATCATATTGGCACATATCCATTTCAGAGTAACACAACTATATATACAGTTCAAGATTTTATTGAGCAAGGTGATGTACTTGGTCATTTGATTGTTTATGGAGGACCTGGTGGATAAACATTGATGTAATGTTGACATTATGAATTAAAAAATATTTGGACTTTTATCTGTCATATAATTTGTATTTTGCATTCATATTAAGAAAATAATGATTATTTTATGATATTAGGCAGGGGCGGCTCAACGTTATTGGAGGCCTAAAGCAAAATTTCAAGTCGCGACCTAAAATATAAATTTTTTTGAAAAATAACTTGCACTATTTAGATGAAAATTATTACTACTTAATGTTGTTTTTTCAGTTATTAGTTTTAGGTAAGATTTTATCAACTCAACATTGAAAAACATCCTTTAAGTGAGACAATTATTATATGTATTGTTAACATAATGATACAAGTAATAAATTTATAAATATTAAATAATGATTAGAATTAGAATCATAAATTTAACTCAAAAAAGTTGATGATTTGGTATACCTCATTTTAATATGCTTAGAGTAATGCTTGAAACCAAATTTTAAAATAAATAAAGAAGAATTTATAATTCACTTTGTTCAACATTTTTCATTGTTAATTCTTATTTTTAAGAAAGTACAAAAGATGTTAAAGTGGAAAAAATAATTTATTTTTAACAAAATTATACTTTTTATCATAAATAATTTATTTATCTATAAAAATTTTAATACATAATTTATTCTTGACAAAAATTTGAGGCCCCCGAAATTTGAGAGTCTAAGGCAAAGGCTTTATTTTTGAAAGCATAGAGTCGGCC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 26

Page 27: Introduction

GEN 3040

CTAATATTAGGGGTTCATTCGAACTTCCTTCGAGGAAAAATCATATAACATATACATGGTTAAAATCACTTTTTAGGTATATATAAAAGATGTTGAACCCCCTTCGACTACTTTATGTGTCTATTTTTTCTGATTTTGAACCCCTTGTCGAAAATTTTGACTCTGCTACGATATAGTATTGTCTATTAATTGATGTATAGTTTAATGTTAAGGGTAAAAAGTGTAGAAAAAAAATATCTTTTTTTTTAATATGTTAAAAATCACAATAACTAAAATTGAAAGAAAAAAAATTGAATAGTGAATAAGTAAAAATGAATGAAAGGATCGAATATATTGAAGTTTTTTGTATTACTTTCCATACTTTTAAAAAAAAATTAAAAAAATCTCCACACATTGAAGTTTTAAGGGAACATTATAGGATATTTTCTTGATATTAATTGGGTAATAGTTCTTGTGTGAATTTTAAGTGTTAAATGTTGTTGCTCAGTGAAGTGTAGGGAGTAGTCGTCTTGTTATTGATATGTATGAACATGTCAGAATTCGATAAATAAATAGAAAATAAATAATTTTTAAAATTGCACCGTAAAAAAATCAAAACAATTAGTGCCATCAAGACTTTTTGAATTATTAGAAGTTTATTATTTCAATGAATATAAGTTTCATGGGAACTTTGGAGGAGATTTTCTTGACCTTAATTTTGTGGAATAATTAAAAGTTTATTATTATATTTTCAATGAATTTATGTGCCCTCTAGAAACCATATCATTTAATAAGTTGATTATCGACTAGGTCCAAAGAGGCCTACGTCATGTTAGAAACAAAAGTCCTCTTGAATTAATTATTTAATAATTAAATGTTTTGAAAATTATGAGAAAAAATATTTAGTCTAATCATTTTGAAATAGAAAACGTATAATATGATTAATTTGAATTTAAATTTATTAGAACATCATCTATTAGTAATTTTATTTATTTTTCAATTGTATATAAAATAAAAATTAAAAAATCTTATCTATTCAATTATACTATTGATAATACTTAGAAAATTGACCCAAAAGAAAACGACACTATGATCGATCTTATCATGAAGATTTCTTATTTTCTTAAATACGCAAAATATCATGATAAGACATTCATAAATCCAATATAATTTTATTCTTTAAATTCAAAAAAAAAAAGTTCATAATTTCAAATAAAGAATTTAATTGGTATATGTTACCGCATTTCATATTCTTTATTTCCCTAAACATTTATATTTAAGATCTCATAATTTTTAATTTGCATAGTTAATTACTATAAGCTACACTGTTTTGATTCTTAATCATAATTAACCACAACTTTATAATGCATATTGTATTCTTTTTAATTATTTGATAACTATTTAAATTTATGAAACTTTTATACTTTTTTAACGTGGAACGTGAAACACTAAAAGATAATAGAGATAAAAAGAAAAAAAATGTTTGATTGTTCATAATATTATTTTATTTCTTCATTTACTTGTTTTAATTTTTTTTTTCATATTTAAATTTAACATCAAATTACTTTTAAAAAACGTATAAAATAGACAATTAAAAATAAACAAACAAAAGAACGGCAAGAACATGAGACAGTTGCTGGGTTTGCACTTGCACCACGTGCCTATAAGGATGGGCTCCTGAAAAAACTCTCGTTAAAGTCCAACCAAACAATTAATATTAACACGACGATAAAATTTATTCGTATCAGGTGTTTAATTATTTATATAAATATAAATATAAATGTTAAAAAAGATATTGGGTTGCTCCTCTTTTAGTTAGAGGTTTAAGATTTCAGCTTTGCATGAAAAAATTTTAATTTTTTTTTAATAAAACTATATATAACTTAAAATTTAAAATAATCAAACTCTAATACGAATATGGAACCGAAAGAGTTTGTTTTTACCAAATAAATGTCTACACAAACCCACCAAATTCTTGTCAGCTCTTCATAACCTCCAAGCAACATCTTTTTTATTCACACTGCTATTCCCATCTTTCATAAATGACTATTTTACCCTTTTTAATCTTTTCTACCAATACTAAAATTAATATATAAATTTTTTATGGACGTTTCTATTAGAAATATGAAATTTTCAATGAAAATAATTGTCAAAAACGTTTTTGATAAGTATTAAAAAATTTTATTTCATTATTATTCTAAAAATCAACTATATCTTGTTTAAAGTTTCAATTTTTCTCATATGTTCTAAATATGTGAGATGTACCTTTTAGTTGATCTGTACAACGTGAAATCTAATTAGTCGAGTAAGTTGACTTCGAATATCATATGGTAAAACAAAAAGAAACTTCCCATAAACTACTTGAGAATCATAAAAAAAAAGGAAATATTATGAGGGTATATATGTCATTTTAAACTAGACAATATTAGTTTTTTCTGGTTCTTATCTGCACCACAAGTTCTTTAAATTAAGACACATATTAATATAAAGACTTATCACCATTCAACAAAAGTTCTAAAAAAAAACTCAAAATTAGACCAACCTAATTATTTCCCTCTCTGTCCATATTTTCCTAATATTTCTTCTTCATATAATCATCATGAAAATTCAATATCCTTCTTTTAAATTTCTGCTAATATTAGTAATTATACTCTTTGTTAGTCAATTATCAAATTGTAGAAAAATTCAAGAAATCACTCCAACAAGCATAGAACAAAGATTGAATTCAAGAATTTCATGGCATAACCTAGCACCATCTCCAAAAAAATCAAGAAATGAAGATATAGACAAGAGTTATAGAGTATCTCATCGAATCGTTCCCGGAGGACCAAATCCTCTTCACAATTGATCAACTTACGTTCATCTCCAACATTGATAGCTCTGTTCATCGAGACTTAGCATGTAAGAAAGAAAATTATTAAACTTTTTTTCATCTTTATTGGGATTTGAGTGCTAGCCTTTTAAGGTGTTTATCGACTTTGTTGATTGTTAGGTCACAATCTTAGGTGCCCTTTTGTGATCAATTGATGTTGACTTCTTCACATTTCACCAAATTCCCAAGTTCTTTTTGTAGCTAAGTTTGATATTATACAATTTTTTTCCATAATCTTTGTATTGTACATGGAAATCAAAGACGTATACTAGGAAAATCATGATATATTACTTTAAATTGATTCAATTACATTTTATCTTCTCCACTTTCTTTCTCTTGTTTTTTAAGTTTTTTAATTGCATACAAAACGTCAAAATCAACATAACTTATAAAGTTATTGTCATGTGATCAGAAGGTTTCACGTTTGAGCGTTGAAAACAATTTCTTATAAATTGTAAATGATTCTTATAATTTGACTTCACAAATAGCGAGAACTAAACAATTCTTATAAATTGTAAATGATTCTTATAATTTGACTTCACAAATAGCGAGAACTTAACATATTGACCGATTTACGAATAGTAGTAGTTGAAGAGCTACATCATTGCTCATTAAATTTCAAATATTTAAATTATGCAAACAAAAGTATAGTGAGATAAAAGTGTGAAAACATACCAAAAATTAAACCAGTTGTTGAATAGAACTACTAACCAATCAATACTTAGATCAGCCACCGACTTATTACCAAGTGTTTATTAAAAAAATATTTTTTTTTAAAATGTGAAATATATTTTTTTAGAACAAAAAATTGCAAGTGATGTAAGAAAAAGTCAACTCATAGACAATTAAGTAGGTAGGCCGTGGACAAATTAAGTAGCAAATTATTGTTTATTAGTCTCTATTCAAAGCATAGTGCTTAATTTAATAATGTTTAAGTAGTACAAGCAATTTTATATTGGTGGGCAAAACCAGCTAGTTTTCTCATGGATATAAAGACTAAAGAGTACTTATAAAATAATATAATCCACATTATATTAAATTATAGATATTTAATTTTTATAAAGAAGAGAAGGAAAAAAAAACGTTAAGAGTTAAAGATCGTAAACAAATATTGCGTAATTTAATATTTTGGTGTCCTA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 27

Page 28: Introduction

GEN 3040

GACATAAGAAAACGATATCGATCCATATTTTAAGAATCAATAAAATAAGTAAAATCATGAAAATTAAAGTTTAAATTTGAATTGTAAAAAAAATGTATGTACGAAATTGCTGAGGACATATAAATTGACTGAATGCGATGATTATAAAATATTAAAAAGATTATATAGATATTAAATATAATAGTTGAGTTGTACACAAATTAATTAGGATATTATTATTATTATTAAAGAAAAAAAGAACAGTATTAGTGCATGGTTACTTGATGAGTTGTTAATCCCCCTCCCTATAGGCTATATATATATAATTTATTATTTTGGCATAGAAAAAAACAACTTTTTTTTTGGAAGTATTTGGTGTCGACTTCAATAATATGGACAATTAATTAATTGTGATGTGAAGTGATTAGTAGTTTAATATGTATTTTGATAAGTAACTTCAAAGAGTACTTAGTCTTCCATGAGAAACACACAAATTAAATAATACAATATCATTATCATGCAAGCGTCTACAAAATGGACTTCTTCGACCCTATCAAATTAAACTAATTCATTATAACAAGTTATTATTTTTATCGAGCCTTAGGGTGTGTTCTGAAAAATATTTTTTTTCAAAAAACGTTTTCTAATCTTTTCATATATCCTGTTGGTCAATATATATATATATATATATATATATATATATATATATAACACAAATTTCTTGAAAGCTAGAAAACATATAAGAAGTAATTAAATAAATTAACTAACCAAATTTCATCGGTGGCATTTCAAATTCATTGTCTCATTTTCATCCATCCGATGCTTCCAACCCCTCAACCCTTGACGACCATTCCACCTCTCGAATTTCGTTATTCTAAGTTATAAATGCTTTTATAAAATAATATAAAAAGTCACTCACTTATCAAATACTAGCAAATAAATAATTATAAAAAACTCATTTATGTTTTTTAAGAAAATGTGGAAAAGCATTTTCTACCACCTTTAGTCATAATATAATTTGGCATGGAAGAATTATAGAATATGAGACTAATGTATCTCAATTTGCTTTCCCACAAAATGTTCAAGCTAAGCAAGAAACTGAAGTTTAAAGTATATTATATAATTAATTAAAGTTCCAATATTATTATTGACTTTATTCACAATACATATTTCTTCAAAGATAGACCTAACACGTAAGCTACTAAAAGTCAAGTCATGTTTGCTGACTAGATAAAATAGAAGTTAAGGTTTGAGAGATTTTTTGTTCGAGCCTTCGAGCAGTTATCTAGACATCATTTTATCCTTAAATATTAAATTTTTTTATGCAAATTTAAGTTTAATCAAACTCTAATGTAGGTACTACGTACTAAATATTAATAAAAAATCAAACTAAAAGAGTTCAAGTTGCTGAAAAAAATGAAATTGAAGATTAGATTTGATTTTTCTTATATCAAATCAATATTTTATTTTAATAATCAAAAATTGAATTGAGAATATATATCAGGTATAATATAACTTTCGTAAGCTAAGTCATGATAAACATATGTATGAAAGACGTATTTTGCATTCATATGATATATGTAAACAAGTTTAATTGGTACAGTAATCTAAATCTCACTTCGTGAATTTCATTTATATGTTGTTGTAATAGACAAACTGATTGTTGAAAAAAGATAAAGACAATAAGAAATAAAAATATATAGAGAAAGTATTAACAAGTAGCACCAGTGGTCTAGTGGTAGAATAGTACCTGCCACGGCACAGACCCGGGTTCGATTCCCGGCTGGTGCATTTCAGAGCAATGATGTTTACTTGCACAAATTGCGAACGATTGGGTCGTCACCTACATCTGATTTATAAGGCAGATAATGTGCATCTGAGCTCTATTTTTTTTATTCGTTTTCGTTTTTCACCTGAATTTAATATATATTGGAGATTGATTATATTTAGAAGCGCTCCGTTTTTCTTTTCCATCCCAGCCTGAGTTGCTTCGTTTTTATTTCCACCCCACTGAGTTGCTTCCTCATCTGTGGTCATATATTATATTTGATTGTTTGTAGAAAAAAAAAAGAGTTTAAATTTTATGTGTTTTAAAAAAACTAATAAGAAAAGTGAATAATGTTTTGAACTCCAATTTATTTAATTCAACCACAACTCCAAAGTTAATGTATACTGAAATCTTCTTTAAAAAGTCAACTTTTTTTGTTGTTGTTATGATAAAATGAGGCAAAAATATAAATGAATTTAGAATTTTATGGCACAATAAACCTACAACTATAATATATATATATACATTATAGTAATATGAAAAATCAGATTCTACAAAAAGAATTTCTTCTTCTAACAGTGTCATTCGACAGTTTATATGCACTTCGATTATACATTATACACACTTCGACTATATATGCATGTAAATATTTTGTTTTTTGCTAATTGATGTCCTTTACTGGTCCCTATAAGACAACTAGTGTTACGATACCCCACTGATGCACAAGAGCAGTGATGTTTACAAATTGTGAACGATTGGGTCGTCACTTACATTTGATTTATAAGATGAGATAATGTGTATCTGAGCTTTACTTTTTTGCTCCTTTTTATCTTGGTGAAAATTTATTTTAATATATATATAATTTTTGGATAAAAGTATTGAATTTCCGTAAATCTGTAACTTTCACGTGTTTTCTTTTACCTAACAATAATAACCATCTCAGTACTAAAACGATAGTGAAGTGAAGTTGTTCTATCATAAAATTAAGAATCGATAAAAGAGTATATTGTGTTATTCTCAAATTTGAAAATAACAATGTACGTTATTATCCATTTTTCAGTTTTTATCACAACCTTTCAAATTATGTTGTTCAAACTTTTCAAAATATGTCAGACAAGTACATATTGAATTCTTCAAAAATATCATATTTTTAAAGAATTAACACGAATACAACAATAGCTTTCATTGTTAAATTTGACTATTTAAACAATATAATTTTTAAGTACATCATAATAATCATTACGTATAAAATAACTTACTCAAAATCAACATGAGAGTGAAGGCTTGGTAGATTATCAACTATAAACTAAGGGCCCGTTTGGCCATAAATTTTCCAAATAATATTTGGGAAAAATTTGGCAAATAATGTTTGTCCATACAATTTGTCATTATTTGGCAAATATTTTTGGTAAATATTCCAAATTTTCAAATACTAGTTTTTTTTAGTATTTGGGCCAAATATTATTATTTGGGATATTTTAAAAATTAAAATTTTACCCCAATCTTTTATCTTTTACAAAAACACCCTCTCTAGTATTTGTTTGCGTTGTATTACATCATTTTTTACGTGAACACCAAAATAGTGATGAAATATTTAGTGAATATTAAATAATGATATGATTGTTGATGAAAATTATTAAAAATTGGCTTTAAGTAACAAAATCATGTACTTTATCTACTTCACGATGTATGGAATAATTCTTGTTATACTCACTCCAAATTATTACATTGCTTCAGTGTCATGGACATTATTTGTTATTGTTGTAACTAACATTCAATTGACTTGTAATACAAACTTTTAGTTAGTTTTGATAGTTTTTAAAACTTGTGTGTATAAATCATATTTTTCTAAAAAGGTGAAATATATTTCCCAAATTTTATGGCCAAACACATGGTGAAATTTCACCCAAATTTTCACTCAAATAATATTTGCCAAAAATATTTGGAAATCTATGACCAAACGCTAGCTAAATATGTCCACTCAAAGATCAGCAAAAAAAGAGTATATAATAAGTTTTAAATTGAATATTTGTGATTAAAGACACTTTTATAAGGTGTTAGCATCCTCATTGTAAGAAAAATTTTGCGCGAATAATTCATTTTTTACTGAAATAATTTTATCACTAAGAAGATTTTTAATCGTCTTTATGATTTACGATACTTATTCCTTATACAATTATTTGAGAAATGATTCTTTCCCTCCGTGATCGATATTAATTTTGCTTTTGTTATAGATGTAACAG

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 28

Page 29: Introduction

GEN 3040

TCTATAAACTTTTTAAAATTTTTATCAATTCTATTTTATTTATTTGGCAAGAAGCAATAATTATTCTTGATATGGTTTAATTTCAAAGAGGAAAATTGATAAATTGAATGTGAATCTTCTATGTTTAACTTTAGGATCATCTTATTTATTGAATGTTAGTGGATTTTTTCAATTTTAAATAACAAAGTTGATGGATAATATTTTAAATAGAAAAATTATATTTAAATAACTTGATTTTTATATAGTTAATACGATATAGTATTTATTGTACCGTCTATTATACATACAATAATATATGAATTCATATATACCTTAATATAAATATTTTTAAAATATCGCGCACCTAATACTATATTTATTTATTTCCTTCTATGGCCATTAAATGATGACATTATATTATGCACGACTATAATTATTGGAATTTGATTCTTAATTCATGAAATTATGACTTTGAGCATTGTTTTTAAATTTTTTGAGAACCAAGTATTTGAAAATCATAATTTTTTTTATTAATTTAATTATTTACAAACTATGTAACAGGGAAATAAAATTTAATTCTTCAAAATAATAACAATTTTTTCAAATGAGAGTAAAGACTTTTCTTATTTGAAGTATTATCTAAACATACATTATTTCGTTTTTCTTTTCCCATTTAACAAACAATTATTTACAAAAAATTATAATTTGAAGATTGGTAAGAAGATTCCGTCAACAAAATTTAATTTGCAACAACTTTTGTTATTGATATGAAATGAATATTTTCCTATTTAAACTATGTGCAAATGTATATCTAGATAATTTTAAATTTCCATTAATATCAATATTTGACAGAAAAATTATAATAAAATTTGTGTTGATTTTTTTTAGGTTTATACATAAAATAATTTATTTGTTTTAGTAAATTTAAAGAATATATTTGTGTAATAACTTTAGTAAGTCGTGATAAACATATGTATGAAAGATATATCTTGCATTCGTTTTAATATAAATATTGAATATTACACAATTGAAATATGCAAACTAGTTTAATTAGTACGAATAGTATAGGTTTCATTTTGTGAGATTTCATTCAATATAGTGTTGTTAGGAAATTGATTGTTGAAAAATGATAAATACATAAAATATAATTATACGAGAAAGTTCAAACAAGTAGCACTAGTGGTCTAGTACTAGAATAGTACCCTGCCATGGTACAGACCTGGGTTTGATTCCCGACTGGTGCATTTCAGAGCAATGGCATTTACTTGCACAAATTGTGAACGATTGGGTCGTCATGGTACATCTGATTTATAATATGATATAATGTGTATCTGAGCTCTGTAACTTTTACTCTTTTTTTTCACGACAATAATAATAACAATTACACCCAAAAGGATAGTGAGGTGAGATTGTCTTATCACAAAACTAAGGATCAAAAAAAAAATTATATTGTACAAATTTGAAAATAGTAATGTATATTTGAGTTATTTGATTGTCCATTTTTTAGTTTTTGTCACAACCTTTTAGTTTATGTTGTTTTAATTTTTTAAAAAATGTTAGACAAGTATATGTTGAATCCTCTAAAAATAATATATTTTTAAAGAATAAACATGAATACCGAAATAGTTTTTGTTGTTAAATTTAACTATTCAACCAATTAAACTTTCAAGTACATCATAATAATCTTTATGCATAAAATAACTTATTAAGATGCACATGAGAGCGAAGATTATCAACTATAGCCCATACATGTCTTCTCAAAGATGAGTAAAAAGGAGAAATGCAACATCAGAGTATATATACAATAGTACTTAAAGAAGTTTCAAATTGAATTTTGGTGACGAGAGACGTCTTTGTGAGCTGTTAAACTTCATTGTAGGAAGAATTTTGCGTGAATAATTCAATCTTCATTGAGATAATTGAATATCTATTAAGATTTTTTTTATCATCTTTATAATTTACGATTCTCACACTTATGCAATTTAAATAACGATTCTTCCCCTCCGTAGTCGATATTAATGGTGCTTTTCTCTCTGATTAATTTGTGCACTTAATAATTTTTTTGGCAAGAAGCAATGGTTATTCTTGATATGGTTTAATTTTAAATAGGGAAATTGATTAATGTGAATCTTTAATGTTTTAATTTTGGGATCATCGTATTTATAGAAAGTTGGTGATTTTTTTTGTTTTAGTTAACAAAGTTGGTGGATGATACTTTAAACGTGAATTTCACATTTAATTAGCTTGATTTTCACATTATTAATATTATATAATATTCATTGTACATTCAATAATATATAAATTGACGCATAATTCAATAGACTATTAAAAAATAGCATGTATAAAATATATATTTCATTAACTATAGTGATTGGAATTGTGAGTCTTATATAGTTCATAAAATTGCGTTTTCGAATTTTTCGAAAACCAACATGTATTTTGAAAATAATAAAAAAGTAAATTAATTTTATTTTTTTTACAAGCTACTATAAACTAAGAAGGAAATAAGCTTAATCATGTGAATAATAACACTATTTTTCAAATAGGAGTAAAGACATTTCTCATTTATATTATCTAAACATAATTTTTTTCATCTTTCTTTTCTTATTTAACAAACACGTCCAACAAAATTAAACAAAACTATTTATGAAAAATCATAATTTAAAGGTTGCTAGTGATTTTTGGAAATAAAATTATCTAATCATTTTTTTTCTTCAAAAAATCTCAAACTTAAAATTATAAGAAACAAAAAAATTAAAAAGAATAATTCGGGATAAATAAATGATTACTTTTTCAATTTCATATATTTGTTGTCGATGAAATGAAATGAAATGGAAGTTTTCCTATACCAAAAATAATTTTAAATTTCCATTAATATCAATATTTGATTAAGCATGAAAATAAAAATTAAAATTTGTATTGAAGATTTTTTTTTTTGATTTTATATAAAAATAATTTTTTAAAAGTAATTTAAAGAATATATTTTTGCCACGTGTTTTTCCACTGGTAATTGCCTAACATAACCTTTCCATATTATAGTCTTATCACGGGAGCTTATTGACCCTCACACTAGACAAAATCCGTCAGTTCAAGTGTGCCGTTTCACCTGAAATCGTCAGATTCCGGCGAGATGGCCGTCGAAGATTCGCCGGATTGGCCTCCTCCAGGATGGACAGAGGATGTCAAAGTATCAAAAGGAAGAAAAATCAAGGTAGAAATTGGACGTATTGAGCTAAATTTTACTCATCATTTCTTCTACATTGATGAATCTTTATAGTACGTAATTAGTCGATTGTATGGTCAAATACGAACAGATATATGTAAAAATTAGGATATTGTTGCTGTTATTGTTTTTTTTTGGGGATATTGTGTATAAATCCAGATGTGTTTGCGGTAATTTTTCACGGGGTCATCGGAAATTCGGATATTGTTGTTATTGTTTTTTACTATTCGTTTTTGAGCAACGGCTGATATGATGGGACAGCTCTGCAGCTCAATTTCGTTGTCATTTCTTTGAATTCGATATAAAGTTTAGTCATTTTTGGAGGTAAGTTAAAGGAATTTCGATTTGAAGTTGATAATAAGACGATTCTGTAGTTAAATCCAGATGGATTTATGGTGCTTTTTGATGGGGTTGTTCAAATTTGGGATATCAGCTTCGTAATATTTTTTAGTGCGGGATCACTTTGACAAACTACTTGCATGTAGAGTCTTGATATTGAATGGTTTATATGTTGTGTAACTTGAGTATTGTTCTGCTCTGTATTGCAAATTCAGAGTTTTAATTCAATTTTCCGCTTTCTAGCTTTTAGTGTTGAAGCTTTAGGTGGGCGAAGTGGGATTCAACTTCTTTTGTTCTTTCGTAAAACTAAGTTTATGAGGTGAGTACAGCAATGGTGTCTCAGGACCTGTGTGGACATAGCTTAAACATGGTAGAAAAGGATTCATATATGCATATCAACTAGTCATTATAAGGCCAGAATCATTTTAGTTTGGTTAGGAAGTTATATG

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 29

Page 30: Introduction

GEN 3040

TCAGAGTAGAGGTGTAGCACGGTAGCAGTAAGCGCAAGGGTTAGAGAACCTGCAGTTGTACAAAACCCCTACTTTGCCTTTAAAAGTTTCCTCAGTCCCTGAATTCTTTCTCGATGAGTTTCATAAAGAACTTCTCTTTGGAATACTTCATGTAATTGTGTTCCCGTCTTGTCTTGGCTAATGTTGATGTATGCAGTGATGTGTAACATTAGAATACAATAATATTCCCAGTGAAATCTCACTTGTGGGGAGCCGTAACATTAGAATCTGCACTATAAAAGGTTGTATCACATATTGTTTTCATTTATCTTTTCAGTACTATACAAATGGGGAGACTGGGAAAAAGTTTTATTCCAAGAAGGAAGTAGCTCGGTACTTGAAGACGAAAGATACTTGCGATGATGTAACTCAAGCAATGAATATTCAAGGTAAGAGCTTCTCTGAGAACAATGTTAGTGATATGGATAATCAAGATGGGAGCTTTGCCAAGGACAATGGTAGTCAAATGGATAATCAAGATAATTGTTGCTCTGAGAACAATGTTAGTCAGGTGAGTATTGAAGGTGACTAAAGCAACTCTTACGTTCTACTGTTTTGCTATGTGATTCTATCTTCTAAGGTCTGTCTCGTGTTATTGTCAGACTGTTGGGAAACCAAATAATTCTCATGAATGGTTACCTCCAGGGTGGATAGTTGAATTGAAGACTCGTAAGTCTGGTTCACATGCTGGTTTATCTTACAAGGTATGAAAATGGAAATATACCGACCTCAGAAATGCTTGTGGTGATGATTTGTTATAATATGCGTCTCTATTTTGACATTAGTGTGACCTTTGGTAAGATCAGCATTGTGTCAATTTTGGTGTTGTATCAGCACGTCTTTTTCCATGATTCCATGCTTAAAGTAAACTGATAACCGTGCTCACTCAAATTCCATGTAGATGACAAAGACTGTTGCATACAAATGATCATATGCTGGTTTTGGTCCTTCTGTTAGTTTGTCTCAAACACAAAATTCCAATAGTATTTGTTAGTTGAGGTAATGATGAGCATGCAGATTCTAGGGGTTGAGTGGTGCATAATATAGAGAAAAGGTCCAAATTTACCCCTTTACATACAAATAGAGGGGTAAATTTGGCATTTACTCATTTTTTTTACAAATATTTTCCTTTTATAAAAAAAAGAAGAGAAGAAGTCCAAATTTACCCCTCTACCATAAAAAAATATCTTAATTGTACTCTCGGTTATACTTTGGGGCAATTCATACCTGATACCTTTACCACTAATAACATGTCTTGTTTTTGCCCTTGATTTTAAGGACACTCCAGCATGGGCTGGGTGGACTCCACATGCCAATATACTTTTGGAAAATGTTCAAATTCACTCTCTACTTTTGACTAGGGTTAAAATTACCCTTCCTTATACATCAGACTGGAGTTCCTCTGTTAGGGTCAAGGGGAAAAAAAAAGAAACGAGATTTTCATCCTAAAAAAACACACATGAGGATTTGCTAACAACGGGGTGAATGGTCATTAGCCCTCACACATAAAGAAGGGTAAGCTACACTGTCAGCTTCATAAAAGTCAGTCCCATGTTTCTATAATTAGAAATCACTTAACTTCAAAAATTTTCTTCTTAACCTTAATGAAATGATTTGTAACTTAACTTTAAATTTTTCCATTTACCCTTAATGCGATAATTTATAGACATACAAATGATTCTTTTTTCACGAAAAAAGTGTCTAAGGTTTGTTTTGTACCACAAATTTCAAAAGTCTACATATCTTTCTTTAACTTTGTGCCAAGTCAAAGATGTGTCACATAAATTGGAACAAAGGGAGTAGTATTTTAGAACTTTGTTAATAGCTGTATAATGTCTTTTAGCTTAATCAAATTTGGTCCTGTATGTGCATCAAAACCTCAGACCAGTTTAGTCATCCAGCCAACTCCTGATTCAGTAATCTGCTGCATTTTTACAGGGTTTGATAATAAGGAAATACTATAGAGAAAACTCAAATTGAAGAAGCTTCATGACTTAACTTCAATTATACAAATTTTGACTTCTTTTGTCATAGGAACTTCAACCAAGTATTCCTAAAAAACTTCTTTAGGGGCTTCTGGTACCTTGGCTGTAGTCCCCCTTTTCTGTGCAACAGTTTTTTTGCTGCAACACCTAAGCAGCAGTTGTTCTAGTTCTCGACTTCTCTGGAAGAAGCATAGCTTTAGAGCAGGAGTGATGCTACTTTGCTGATAAAATGATCCAATCATACCTCATCTTTAGTTTACTAGTTTACATCCTTGTTTCTTTTGTTGTTCTTTCAGGTTTACATTGATCCATCAACTGGAAGCAAATTTTATTCAAAGCCTGAAGTATCTAAATATCTCAAAACTATGAAGCAGAACAACATCGCGGGGGAAGGACAAACAGTAGGGTCTCTCCCTTTTAACTTTCTCACCATGATAAAATCAGTTTATTATTTTCCTTAATTTTTTGGATCATATGGAGCCACATATTAAAGGTGGTGCAGTCAAAACCAAAATAAAGAAACGGAAGTACTTTGGATGAGATTTGATCTCTTTATTGTTATCTTGATCTGCGTAATTCTAATAATCTGTTTCTAAGGATATATTTAAGAGAAAAGGGAAGTTGCTTGTATTTATTTTGGCAGTGGCCTGTTGGAAATGACAAAAGTACTAAAAAGAGTTGCACCATGCATTTGTAACTAGTTTGAAATAAAAATCTAGCTTTGGGCAGACTGACTCATTTTTTGGACATTCCTAATAGGAAAGAGTACTTGAGTAGGTAAAAGTTTGATGGAGGAGGCAAGTTCCAGAAAAGAACTTGATGGTAGCTTTTGTAGCCTTAGGTTATTACATATTTTCCACATTTCATGTAGAAGAGAATTTAAAATGCCTTACTTAAGCCGAGGGTCGATCAGAAAAAATCTCTTTACCCGCACAAGGTAGGGGTGAAGTCTGCGTACACACCACCCTCACCAGATCCGACTTTTGAGCCCAGACCTCTTACCTACCCTGGGTATTTTTGTTGTTGTTGGGTTACTACAGCATCTTCATGTGTAATTATGTTTTTGTGTCAGCATGGAGATGGTGAAATATCTACTACAAATCAGAAAAAATCAGAGGATCCCAAAATTATCAGGGGTGGGGGTCAGACTCGCAAATCCAAAAGGCTGAAATCTGACACCGACCAGGCCTCGGTAAAGACTATTTTTTTCTGTAATGTACTTATGATACTGCTTACAGATTTCCTTCTCAAAAAAAGACACTGCTCACTGATTTGATTGTTAGCTATGTAGTTCACCATGGGTTACCGATTTCTCTAGTAACAAATAGGTGGCAGTTGAAAGCATTTCACTGGGTGTTGCATTTAAGAACTGCAACTTATTCCCGTCTTATTATGTTTTTTCTCAGCCTGGCATTGAGGAGGGGTCATTTTCCAAGCTGAAAGCATCTCAGGATTTTGAAGAAGTTGGGGAACAGAGTTGCACAACCAAAAGGAAGAAGTCTGGCATTACCAGGACTACAAAGACTTATCCTTCTGTAATATACTTAAGATACTTCTGAAAAAAATTGATTCTTTAATTCACTAGTAATATCTAAGTTGCTGTTGAAAGCAATTCAACTGGTGTAGTCTTGAGATGTTGCCAGTATTCTCATGTAATAATGTTCTTGTTTCAGCATGGTGCTGGAGAGGTCTTCATTTTAAATGAGAAAACATCTCTGGGTTCGAAAGAAAATGGGGCACAAAGTCCCCAATCCAAAAGACAGAAACCTGGCAGTGGCTGTCCGTCTGCAAAGAGTGTTTCTTCTGTAATACCTCAGATGCTTCTGATAGATATGATCTATAGCTGTTCTATTCACTACGGATTGCTAATTTTTGTAGTAATATTTAGGTTGCAGTTGACTGTGATTCAGCAGATGAATTACCCCCTGGCTGGAAAAAAGAAAGTAGGAAAAATGATCGTGGAACA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 30

Page 31: Introduction

GEN 3040

AGGAATTACCTGGTATTTCTCTTTTTTCCTCAGCCCCTCCCCCACCCCTCTTACTATCTATCATCTATGTATGTGTGTGTATATACAGGAATTTGCGAACCTACTACCTGAATGTAGTACTTAGCACCATAACCATTTTTCATTTTCCCAATATGAAAAGGGAATTGATCTATGGTTCGAGTAATTTTAATTTCGTTCATCAATAGAAAAGGGGAAGTTAGAAAGGATTTTTTGAATCCATACGTATTTAGAATATGTGCATCTTTTCATAACAAAGATAAACTTTTTGGAAATATTGTCTACCAAGGTCTTTATTGTTCATCTTCAACTCCAACAGACTTTTCCCAATTGAAAGAAACCCATCTCATTTACTCATTTCGTCTCCCCAAAGACCACCTCATGCTAAAATGGGTTCGTTATCCCTTCCATGGTCAATCTTGGATGTGCTAATTGAAAGGCTAGAAACTAATCGAAGCCGTAACAGGAATTCCATTGAAGTAGCAGGGAGCTGGGAAAAGATTGTGGGGAGTGGGGTTGGGGAGAAGAGACTTCAACTTTTTTGGTTAAAATAAACGTCCCTTTTTTAATGAGATAGTACAGACAATTTGGGTAAATTAAAGTGGATTGTAAACCTTTGCACAACCCATATTGTACATACAAGTATAATCTGTTTAAGGGGGACATTACGTTCTGTTTTGCACCTTTATGCAGTATTTTAGCATGCATGAGAAAGGGTGCTGTGGAAACAGGGTTAGGGGACTCCTTGTGGGACTGATGTGGAATGGTGTGACAGTAGATATATATTCAAAGTTGAGAATTTTATGTCGGCTTGGTCATTGCAGCTCTACACAGATCCAGTACATGGATATAAATTTCACTCCAAAAAGGAAGTTCTCCGATATCTACAAACTGGAGATGCTACTAGCTGTGCTAGAAGACCTACGAAAAGGAATGTAGCTTCAACTACAAAGGATGATTCTGTAAGTCACATCTTTCTGGATCCCCCCTCCCTGTATACCCATCACTGCTGTTATTTGTACTTGTTTTATAGCCCTTTTTCGTTTAAGTGTTCAGCTTGCATACAGAACCTTGTTGGTCATCTTTGTAATGTTGCTTCGCCCTTCAGTGAGATGCACCAGTCTCAACCCTTTAGTAGGAAAACACATTCCATCCTGTAAGAGTAGTGTATTTAGTGCCTTCTAAAGCAGGCAGGCTCGTTGAAGTTGTGAAATTTTAGTCGCACTTGAAAAATCCAAGTTTGAAGAGAATAATTTCACATCTTATTATGCAAATTGAAGGATGTGTAGACAACGAGGGCATTCGGCTAATCTTATAAAATAGTTAAACTGGCTTGTAAGCACCTTTTGGCTTATTTATGCATTAGATAGCAGTCAAAGTGTTTATAAGCCTTCCAAAATTAGCCAAAAACTGTAAGTTGCTCATTCCCAACTTATGGCTTTTCAATTTGTAAGTACTTTTGGTTTGACTAAAGTTTTATGATTTTATCCTTGATATATTCTCTGATTCACCAAGTACCTTGATAAGATGGTCTTTTGTTGTTCGTCCTTTCCCCTACAAAATATATATTTTAATATATGAAATATATTTGAGGTAGAGGGTCTTCAGAAAACAGCCTCTCTACCTCCTCGAGGTAGGGGTAGTGTCTGCTTATACTCTACCCTCCCCAGACCCTTCTTATGAGACTACAATGGGTTTGTTGTTGTTGATCTTTTGTAAAAAGGTTGTGGATATTTTAGTCATTTTAAAAAAAGACCATCTTATCAACACTTTTTTACCAAACACATTAGCTGTTTATTATCTTTCAGCACTTCTATCCAAATACGTTTGTTTTAAAAAAAATTCCTTCAGCACTTAAAAATACTTTGCAGCACTTGAAGCTTATCAACTATCTACAATCTGCTAATCCAAACAAACTCAATAACTTGCAAGAAATTTAGCTAGTGACTTGTTTGGTGGATGGCAAAGGAATGGGAGAATACCTTATGATTCTAGATCTGAACCATCATCCAAATGCTATTTATTATCTTTTTCCTTGTAAGTCCATTGAATGAGGAGGAATCTGTCTGGTATTGCATATCAAATTTAATCATTCAAGTGTGAAATCGAAGTCGAATGGAAGAAGGGCATCCAAATGTGCCCAAGGTGTTTTTTTTTTTTGTTTTTGGTGAGCAAAAGGCAGAGATAATCAACCATCTTTTTTTACATTGTAAAGTGGTTAGCCAGCTATGGAACCTGTTTATTAGTTTCGGAGGCATAAGATGGACTATACCAGGAAGGGCAGGTTAGGCTCTAGTAAGCTAGAACAATGAAGGGAGTGGTAGCACAGACAAAAACATATGGAAAGTTGTCCCAGCAGTGATATGGTAGACCATTTAGAACGAGAGCAATTCAAGATGTTTTGGAAGTATCAGTAGTCCTTAAGATTAAAATGAACTGCAATATCACTTTTTGCTATTGGTGTTTCTGAGTATATAGATGATCCTCTAGCAATTGTAGACATTCTATGATCCTTGTAAGATGAAATAGGACCTGGGATAGGGCTTTCTTTAAGTTGCTGAGCCTATTCAGTTGTATGTAGCTCTGTGATGTAAATTTTTGGGAACCCAACCTTAGTGCTGAAGATTTATTTACATAATTGTTACCATTGTCAAAAAAATAAAATAATCAAAGTTGAAAGACTTTTCACCGTATTAACATTTTGCAGATATGGGTATATTCTTTCTGTCTAACTGGTCGATCTTGAAATTCTTCCTCCTTGTTCTTGACTAGCCTTATTTTAGTTTCATGAAAAACTGTAATTGGTCTTGTTCCTTAGTAGACATAGTTTTTGAAAAAATTGTTGTTTATGAGCTTAGTTTGCTTATTATCAAGACTAGTGGATGTTGATTTACCCTAATGGGAGGGATCTCTCATGTTTATAAATTTATTTCTCTTGATAGCCCACAACAGATGATGCCAGCTTGAAGAGAGTAGGAGGCTCTATGACAGGAAGACAGCTTTTTTCCACTGACGAATTGAAAGGTGATAATCTGTAGTTGGAAATATTGCTCTGTAGTCCCTATCTGAGAGGATGTGCATCGTTTATTCAGCGTCTGGTCATTTGTTCTTGTCATTTCAGTTCCCTTCATTTTATTTTTCTTTGTCAATATGTTCACCTGCTCTCTCTCTAGATGTTGGTGATTGTTTCTCAATTGTGAAACTGGGATTACTGATGTAGGTGGGGAAAGTTTTGGTACTTGCTCACCTGCACAGCAAGTTGAGAGTTCGAAGAAACAGCCCGACTGCAGTGTGTCTGATCCTAATGCCATCATCACATCCATTTTAGCTGAGATAAATGAGAATCATTCCTTTGAAAATGTAGTTGGAGATGCTGAAATCGAAACTGTAAGTGTTGGTGTTAAACAGCCATCAGCTGTTTCCACTGAGTCGGATCTCCTCTCGGAGAAGCAACTACTAGAAAATGAGCAGGAAAAACATAGTTCTAAAGCAACACAGCAGTCAAAAAAATCCAGAAAAACAGAGTCACTTAACCTTGGTCGTCGGGTCTCCAAAAGACTTGCAGGTCAAAATGCAGAAGCGGCGGCTGATTTGGATCTTGGTGAACGTGCTCTTCCAGCTGTAGCTGATAAATCTGCGAGCCTCTCCGTGAATGCCTGCAGCCAGGAATTGCTCCAGGACTCCGATCCTACACCAGAAACTGGCAACTCTGATCAGGCTTCTTTGAATGGAGACCCTTCATTGGATGATTTACCCCCAGGCTGGAAAAAAGAAATCAAAATTACTAAAAAAGCTAATGGGATAAGAAAAGACCCGGTACACTCTCTCTCTTCTTGCTTGCACATCATCCTATTCCAGCAGACTATTTCGCATCATCATGCCGTGGCAGGTTATACTTGAGAAAGGCTGTGTTATGTTAGCAGGAGTAGGAGCTGCTTATGTGAAGAATGCTGCATAAGTAGATTTACATTC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 31

Page 32: Introduction

GEN 3040

AAAAGATTTTAGTGCGGATGTTAAATTGATTCTTTTATTGCAGTTATACATTGATCCAGTGGATGGCTATGTATTTCGCTCCAAGAAGGATGTTTTCCGCTATTTACAAACTGGAGACATCAGTAGTTGTGCTATAAGACCTGTCAAAAGGGATCTAGATGCTGCAATGACGGATAGTTCTGTAAGTCATCATATTCCTGGTTCCACCCACCACAAAACACACACACACACCCCTCTGTTAGTTCTTTTTGTTGGATAAGTTCTTTTATCTACGTTACTGCACCTTCTGTTTTCAGTTTGATCAGTTAACGTTATCAATGATCTTTTCTTGTAAGTTCTTTTAAATGAGAAATATAATCATTACAAATTCTAGTTATAATGATATAGAAGGTTAAAAAGGAACAGCTAGATACCTCATGGTTCCCAATTCTGAACCACCATCAATGGAAATTGACATTGTCATTTCTCTTGTAAGTTTTTTAATGGGGAAATCTTTATATCAAGTCTTTTGCATATCAAATATAACTATTCCAAGTGTAGAATTTAAGTTGATGTTGTCATTAGAAAAGAATTTATGTTGAATGTCTTTTCATTGTATTAACATTTCAGTACATGTCTTAACACCAAATGATGTTTGAAGCAAAAAAAAAAAAACACTATATGATAATGGAATAATCTTTTTTTTTATGAAGTAGTTGCATTAATGGCATCAAGCAAATGCAACAAATATATTGATTACAAAAGAGAGTTTGTTGAATAAGTTACATGTAGTCCTATTTTAGGATAAGGAGCTAACTTAAGTCATAAAAATGGTCAGAGGTGTAGATAGGGGTATCTATAGGAGATAGATTCACCAATTAAATAAACATGAGACTTCTACCCATAAGGGAGTGATTTGGAGTTGTTATTCCATCAACACATCTGTGGCTCCTTTCTGTACAAACACACCAAGAAATGTTTGCATGGATCATTTTCCAGATTTTCCTGATTGATTTACCAACTTTCCCCCCTGCTGTACTCCAGCTTGGCAGTTTCCACTGCCTGTCCAAGGTTGAGCCCTGGGATTTGATGTTGGACTTAAAAAGCCGAGATCTCCATCATTCATAGGCATACTTAATGTTCTCCGGCATGGCCCTGAAATTTCCACAATTGTTTTGACTAGTCCTTTAATTGGGACTGTAATCACAAACTATGTGTTTCATCGTTTGTCATGAGTTTAAACTTTGACAAGCTAATTCTTTGAACTCAGGGCTAATGTATGTTGATTTACTTTGAACAAGAGAATTCTCTAACGGTTGAAAATTCTCTCTTTTGGTAGCCCACAACAGTTGACACCAATTCGAAGAAATTAGTGTGCTCTGTAACCGAAAGGCAACCTTTTGCTGCCAAGGAATCCAGAGGTGATACTTCTTTAGTTAATATTCCCTAACATTCTAACTATTAGAGGTGATATATACTTAACTAAATGTTGGGGCCATCAACAGCAGTCGTTTCAATTTTTCATTCTAATCTTCTCCTTGATTTTGTCTTATCGACTTTTTAAATATACATTTTTTTCTCAATTCTGAAACTGGGATTGCTGATGTAGGTCGAAAACGTTCAGTTACTTGTTCACCAAAAGTACAAGCTGAGAGTTCCAAGAAACAGCCTGAAAGCAGTGCATCCAATGCAAACACTATTATTACATCATCTGAAGCTGATATTATTGTGAAGCAAAATGCTGAAATAAGTCTAGACACAGAGCCAGAAATAAGAGACTCAAGTGCAGACACAAAAGTCATAGATGCTGAAAGTAACGCTATCAAAAGGTTATCAGCTGTCTCAACCCCTCAGTCAGATCTACCCTCAGAGCAGCAATTACCAGAAAATGAGCCCGAAAAGCATATTAGTAAAGAAAAACCGGAGCAGTCAAGAAGATCCAGAGCTAAGAAACCACTCACACCTGGTCGGCGGATCTCAAAAAGACTTGTGGGCCACAGCCCAGAACCGGTGGCTGATTTGGATCTAGGTGAACGTGCTTTTCGAGCTGTTGTTAAGAAATCTGCCTCTTTGGGCATCCCCCTGAATGTCTCTGGTCAGGAATTTTCCCAGAACAAAGATCCTACAGTAGGAACTGGCAATTCTGATCAGGCTCCTCTGAGCAGAGAAGCTTCAGGCCTTGAATTTCCACCAGACTTGGGAAAGGAAATGCTTACCCAGAATGAAGATCTGACAGAAGGAACTGGCAATTCTGATCAGGCTTCTCTGAAGATAGAAGCTTCAGGGGATGGATTATCGCCAGGCTTGGAAAAGGAAATAATTTCCCAGAACAAAGATCCGACCGTAGGAACTGGCAATTCTGATGAGGCTTCTCTGAGCAAAGAAGCTTCAGGAGATGGATTATCACCAGGCTTGGAAAAGGAAATACTTTCCCAGAACAAAGACCCGACAGTAGGAACTGGCAATGCTGATCAGGCTTCTCTGAGCAGAAAAGCTTCAGGGGATGAATTATCACCAGGCTTGAAAAAGGAAATACTTACCCATTATAAAGACCCGACAGTAGGATCTGGCAATTCTGATCAGGCTTCTTTGAGAGCTTCAGGGGATGAATTATCACCAGGCTTGAAAAAGGAAATACTTACCCATTATAAAGACCCGACAGTAGGATCTGGCAATTCTGATCAGGCTTATCTGAGCAGAAAAGCTTCACTAGATGAAATACCACCAGGCTGGGAAAAGGAAATGCTTGGACAGAACAAATATTCTACTGTAGGAACTGGCAATTCTGATCAGGCTTCTCTGAGCAGAAAACTTTCACCAGATGAAATACCCCCAGGTTGGGAGAATCAAATACCCCCAGGTTGGAAGTATGAAATACCTGCCCAGAACAAAGATCCTATAATAGGGACTAGCAATTCTGATCAAGCTTCTCTGAGCAGAGAAAAAGCCTCGCTGGATGAAATACCCCCAGGTTGGGAAAAGGAAATACTTGCCTACAACAAAGATCCTGCAATAAGAACTGGCAATTCTGATCAGGCTTCTCTCATCAGAGAATTAGCTTCAGCAGATGAATTCCCACCCGGATGGGAAAAGGAAATGCTTACCAGGAACAAACATCCCACAGTAGGAACTGGCAATTCTAATTCACTTGATGATATACCTCCAGGCTGGGAAAAGGAACTACTTGTCCAGCAAAAAGATCATACAGTAGGAACTGGCAATTCTTATCAGTCTTTTCTGAGCAAAGAAGCTTCAGTGGATGGATTACCACCAGGCTGGAAAACAGAATTCAGAATAAGAAAAAATGCTAGTGTGATAAAAAAGGACCCGGTATTCTCTTTCTTCCTCAGCATTTATGTCTTCCCTTTCCTCCCGGTGCACTTCTCTCCCCCTTCTTTTCTCATGCACATCATCCTACTCTAGTAGAGTCTATTTACCACATACTAATTCATAAAAAAAAGAGTCTGGTTTACACCATTAGGGCATCTCCTACCCATAGCAAATAGTGTTGATACTATTTTAGTGTAAACCATCTCCAACCCACTACACTAAATTTTATACTAAAAAAGAATATACTCCTTATAATCTTCTCTCTCTTCAATTTTTTATTATAATTCATATTTTCAATTTTTTTAGTACAAATAATAATAAAATAATCAAAAAGAGATTTGGTGTTGATGAATAGTGTTGCACCAAATTTTGTGCAGCACTATTATCACACTAAAATAGTGTAAGAATTGGTGTCGGGTTGGAGCACTAAAACACAAAATTTTACATCAAAGTAGGATTTGGTGTAAAATTTGGTGTTTGGTTGGAGACGATCTTATGCAGATATGTGATATATGCATGAGAAAGGGTGCTTCTGGAGAAGGGTCATATAGCTGCTTATGTGGCAGACATGGAATGCTGCATTCTTATTTTGCAGTGAGAATTTTAAGTTTACTTGGACATTGCAGTATTACACGGATCCAGTGCATGGATATGTATTTCGTTCCAAAAAGGATGTTATGC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 32

Page 33: Introduction

GEN 3040

GCTATCTGCAAACTGGAGACATCAGAAGTTGTGCTGTGAAGCCTACCAAAAGGGATCCGGGTTCGACAATGAAGGATAATTCTGTAAGTCTGCACCTTCTGGATCTCCCACCCGTGTTCCCCATCAAATCTATTTGTGCTATATGTTCGATAACTCCTGTTATCCATGTTACTGCTTTTCAGCTTGCGTAATAATCTTGTTGGTCGTATTTGTATGAGCCTTTTACTAGGAAGATATATTTGTTTCTTTAAGACCAGTGGTAAAGTTGGTGTCTTGTGTAAAACTTTGTTTTACGTATTGGAGGTGTTGAAGCACCTCTTCTTACACGGGTTTGAGTGGTCTGAGACTGAGCAACTGATTTAGAGAATACTCGTTGAGAAGATATATTGGAAAGCAATAACTCCAAGTTGAAATAGAATAATTCCACATCACTCTATTCTAATTACATATGTAAAAAGAAAAGCTTATACGAAACTACAGGTAAAGACTATGCCATAGAAGGGTAAGAGGAATGACTCTTACCTTATACTTCCCAAATCTGGACCACAGTTCTGAGCTTTATCCCTGTTTCTATAAGTTCATTAAATGTGGAATTTCTTTATAAACTATAGCTGCAAAACGATGCGCTGGAAAGTTAGAAGAATGACTCAATACCTTAATGATTTCGAGATTTCAACCACCATTTAAGCATTTTACAAGTCCCTTTTTTCCTTGCGAGTTCTTAAACAAGGAGGAAAATCTTTTATGTGTGGCGTTTTGCTTTAAATTAACATTTCGCTGCATGTCTTTACTTTCTGTTCCTATCTGACAAAATAATCTGGAAATCCTTCACTTAATTCTTGATTGGCCCTTATTTGGTTCAATAGAGGACTGTTGTCACAAATCCTTAATGCATTTCACTGTTCATGACAAGATTTGAGTTCGGCAAGGTCATTTTTAATTTCTCGATACAAGTGTATTGATTTATCTTGAAAGGGAGGAAGTTCTGATGGTTGAAAGCTTCTTTCTTTTGACAGCCCTCAACACATGATACCAGTTCGAAGAAATTAAGGTGCTCTTTGACCGGAATACAACTTTCTGCCACTGATGAATCAAAAGGTGATGCATTTTAGTGGGAATTTTGCTCAAACTCCAATACTGTAGGAGATACAACATTATTCAGTATCGACAGTCATTTCAATTCCTCCAATTTAGTCTTGTCATTGATATACGTCCTCCTGAATTCTCTAAAAGTTGATTGTTATTCTCATTCTGAAATGGGAGTTGCTGATGCAGGTAGTAAGTGTTCTGTTACTTGTTCAATGGCACTGCAAGCTGAGAATTCCAGTGAACATCCTAAAAGCAGTATGTTCAACCTCAACACCAGTATTTCATCTATTGTAGCCGATATAACAGAAAAACATTCATATGAAAATCTAAATGAAGATACTGAAATAAGACTAGATGTGGAACCAAAAATAAGAGACTCAGTTGGAAACACAAAAGCTGCAGCTGCTGAAACTGTTGATGTTAGAAGGTCATTAGTTGTTTCCCCCCAGTTGGACTTCCTCCCAGAGCAGCAATTACGAGAAAATGAGAAGGAAAAGCATAGTAATAAGGGAGTGCCAGCAAGGTTGAGAAAAATCAGAAACAGCAAGTCATCTACTCCAGTTCGTCGTGTCTCAAAGAGACTTTCACGCCACAACCCAGAAATGGTGACTGATTTAAATCTAGGTGAACGTGCTCTTCATTCTGTGGTTAACAGTTCAGCCTCTTCAGGCCTTCCCATGAATGTCTCTGGTGGGCAATCGGCCCAGCAAACTGATCTAGCAACTGGGGATTCTGATCAGCCCTCTCTAAGCAGAAAAGCTTCATCACGAGATGATCCTTTAGAAGTTGTGAAGTGTCTTTCAGAGGGCCAAGCGTTTAAAGAGAATATTGATGAGAGAATATGGCGAGATTCTCAGCCAGCTGGAATATCATATGAAAGAGGAAAGGTTCACTTGGAACAGCATGCGCTTAACGAAAGGCCTACAGCCAAGCCGGCAAAAGAAGAAGAAGACGGTCAGAATAGATTATTGAACAATTCTCAACTAGCTGAATCAGCATCAGGAAGAAGGGTATTCCCTGCAGAGGACCGGTCTGTCTCAGAAAGGTCTACTGTAGAAGAGGTAAGTGAAAAACGAAATGGTGAGTACAAGCAATGGCAATATCCACAACTAGCTGAACCATCATGGAGAAGTGTGGATCTTAATGTGGAAAACCAGCAATGGCAAGATGTACAACTAGCTCCATCACAGAGATTCGTGGATCTTAATATGGAAAACCAAGAGTGGCAAGATTTACGTCTACCTCCATCACAGAGAAGTGTGGATCTTAATGTGGGAAACCAGCAGTGGCAAGCCTCACAATTAGCTGAACCATCACACAGAAATCTAGATCTTAATGTGGATAGCCAGCCTGTTAAAGAAAAGCAAACAGGAGAGCTGGTAACTGAGAATCAAGATGAACAGAACAGAAGATTGCATGCGCAACTAGCTTCTTATCCATTTGGGGACTATTGGTCAGACCCATGCATGGAGTTTGCATTCAAGACCCTTACAGGTGCACTACCAGTAGAAGATACCCTGACCTTTCAGGGATCTACCCATCAAGAATACAATACTTCCTACACGCAGGCCGATGATGGATGTTTTGAACTGCCATTATTCAACACTTCCAGTTTTTACTTGAATGATGCTCCAAACCACTGTGCTCCATCGGTGGAGCACGTCGTCAAAGAGCAACCGCCAATAAATCAGACCTTTTTGCCCACTGGACATAATAGCATACCAGGCCACAGTAGTGTTGTTTCTCAAAATCCAGGTTTGACCTTTTTGCCCAATGGCAATAACAGCATACCAGGTTGCAGTAGTGTTGTTTCTCAAAATCCAGGTCTGAATACCCAGGCAAAGGACTACCAGTCTAAATTTAAATCTCACAGGTGAATGAGAGCACATAAAGAGAGGGAAGTGTTAGGTGTATTAATGGATGAACTAATGACGCGGTGCTGCTCCTGTGAATGCATTTTTATGCAGCTATATCCTCTGTTTGTAAACATTTCTCTGATGCGGTTTCTGGAGTTTGGCGTATCTATAATTGATTGAAGATTGACCACAGGTGATGCTTCATCAATTGTCTGGAGGTGACAGATAAATTAGGTTTGTTCGAACCAGATAGCTGGTCGAAATGTAGCTTTAACTTTCAGAGTGCTTAGATCCCACGATCCATTGGACTTTAGACGATCTGTTTCAAAAGACATTATAGACAATTGCTTGCTATATGTCTGTATAATTTAGGTTGGACCTCTAAGCTTTTATTACACGTAAATTCGTCTCTGTTTTCTCTCCACCTCTATGACTGGATGTTAAAAGGCATCCATCCAAAAGATGCTTCTTCAAACAGGTTACTTATTTGTGAAAAGATTAGAAATGTGCACTCAAACAGAGTGCAGTAGTATGCACCTTTTTGAGACCCATGTTCATATTCCAGTAGGTACCATAAAAAACTCATCTATCCATTCTTTAGGTGAAATAGTCGTGGCGAGTTGGATCGAACACCATCATATGTTTTGAAAATGAGAAGAAATAAAGAGCGTTAAGGACAAAGTTAGAGTGATTTCTTCACATCATATGTTTTGAAAATGAGAAGAAATAAAGAGCGTTAAGGACAAAGTTAGAGTGATTTCTTCACAATAATTTCTAGATATTTGACTTGATTATCTCATAACTTGACATTGTTGTTCTGCATAATAACAAACCTTACATTGTAGCATATGATAAAATAATTTAACTTGAAAAATGAAAAGAAAATCCCAACATTTTTATTACAGTAACCCGTATAGACTCAAAATAGCTCTTTTCTTTTTTACTATATGAAATCCACACTTTTAACACTTGAAAATGTTATAACGGGTAAGATAAGTTCATAATTTAAGCAAGAACAAATTGATCAGTCGAGTAATGCTTCACA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 33

Page 34: Introduction

GEN 3040

TCAATAAGAAGACTCTTTCCATCCACCAAGGTTCTCTTCTTCATACTCCCTCTCAGAATATTCCTACAAAAACATAGCAAAATATATCATGCACATAGAATACCTATTATACTTGCATGTTACGAGTTCCTTTATTCATATAGCCAATCTGAACTTGTTAGCTCGGGATTGAGGCATAACCTTTTTTTTTTTTTTTTTTATAAATCAATAAAATTTTAGAAAGTAACTTGCCTCAGGAGTAAAGGTGAGACCAACTTTAATTTCTCCACAATATTCCTCATCTTTAACAACATTATAACATGTTATTGGGATGCTCCCTTCTGAAAAAACTGGTTCAATTGGTATCCTGCAAGTAACATATTATTGAACATATCTCAAAACCAATACTAGCAAACTCAGATTATGTTTGCATATATATATATATATATATATACAGAAATTGTATATACAGAAATTGTATAAATCTTAGATTCGTCTCTGATAGGTAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGGAAATGCTTATTCGACTCCATGAACTAATAATCTCAACTGTTTTAGGCAAATGCTTATTCGACTCCATGAACTAATAATCTCAACCGTTTAAACCATGGTTTGTATAACACCACTAGAAAGTAGATGTATACTACAACAACCACAAAAATAACTATATATTCAGTGTAATTTCACAAGTTGGATATTGGGAAGGTAGGATATCGATGTACGCTGAACATACCCCTACCTTTGCAGGTTGTTCACGAAGGACCCTGAACTCAAAAGAAAAGAAAAGAAAAAGCATGGTGCAAGTAAATAAAACGACAAAATAGCAAGATACAAATCAAATGAAGCAACTGATTGTAATAATCATTAAAGAAAGGAAACTACGCGATATACAAAATACTACTACTAAAAAGGAGAAGTAGAGACTAGGCTAGCCCTCTTCCTCTCCAATAACAAGTGCAACAACACTCAGGCTCACTAACTACTAACCTTCTACCCTAATTTTCGACCTCCACGTCTTCTTATCTAAAGTCATATACGTCTATTTATCTCAACATATACAAGTCATGGATGCATGTACTATCAAACTCAAACTATGTTAATATAGAGAAAATACGTATAATCTTAGATTCGCCTCTAATTAGATATACTTACTTAGCTTCCCCAACAATATCGTCCTCGCCGATAGAATCACTGTCCATTATCTTCAAGAAGAGTTCTTCAGCATCCTCAGAAATGGAGAAAACAAATGTTTCATTCCATTCTGGTTCACATCCCTTCCCTATGAAAATTTAACATTATAAATATCGATTGATCAAACTTCATTTCATTTTTCTTAAAAAAAACATAGGTAACTAATAACTAAGTAATAATTAACCAAAGATTATTAACAAATGTACCTGATGCAACAGTACTCTTTTTCTCCTGAGATCTATAAGTTAAGATCACATATGGATCCATATCATCTGTATCACACAAAATAACATTTTTTAAATTAAAAAATAAAATAAAATACAGAGTATCAATCAAATTAGCCAAGATGATTATCGATTTTTGATTCATATTCCTCGATTCTATTTTATTTGACATCGTTTGACTAGGCATAGACCTAAATGTCAACTTCACATACTTTGATTCAAAATTTAAAGTTTACCACGAAGTCAAATTAATATACATTGATAAGGTTTGGCCATGAATTTTGACAAATATCTGTTTGATCATGAAAATTTGATCCGATCTTCAAATAATTCTCTATTTTCCTAAATAAAACGCACATTCAAACAGGTGTTGTGTGAACACATAAATTACATGTAGATCAACCTCTAATTCACATAAGTAACAATCGAAAATTAGGTTGATTAAGAAAACATCAAATGTAATCGAAAGAAAAAAGAGATCTATACATTAATCTGATCAGATCAAAACAAATAAATTAGAAAAAAAAACAAAAAACTTACTGAGAAAATCAGTGTTTTCAAGGCCTTTAGCTCCAACAAGTTGAACTACAAGTTTTCCTCGTACCATTTTCAATCAAACTCTTAGCAGATTCCCAGGAAAAAAAAACAGATTCTCGCACGAATTTTCGGACAGAGATCGTTTTCTTTTTTTTTTTTTTTGTTTCGATGAAGAATATGTTGAGCTGATGTGTATTTATATGTGAACAAAAGACCGGTTTAAAGTGGGGTTAGACCGGTGAGTGTAAAGTGAACCGGACGGTTTAGGCGAAAATCACATTTTAGACCAAAAAAGGCAAAAAAGAAAATCACATTTTACGCACCTAAATTATTTCGAGTATGAAAAACATATTTAAACTAATAATAATAAGTTTGAGAAATATACTTCAAGTAATATGTGATGTCGTGTGTATTACGTGTATACACACATTTGATTATGTTAGATTGCGTTGGCGAAAAAAAAGCCACGTGGATTATTTTTTTTTTTGGATCAAATGCGTCGGAGTGTGCATTTGGTCTGGTTAGAATTGAATTGTGTTTTTTTTTCTTCAAATTAATGAGTGGTAATATTGTAGAAATGAATCAATAGAATCTTCTTCATTTTAAGATTTTAGGTTTCAATTATTCGTTTTTGTGACTGAAAAAAGGCTAATTCAGCAGCTTTTTGTTCAATTTCTCGTGCAGTCAAATTCTCATCAGTACAGGTCAAGAGAATGGACCCCTGGCCTCGGATGTCCATATAAAGAACACGACAAACATAAATATCCTCTTTCACTTCTATTCTAACGAACTGAATATCTTTTATAAGGAATCGGAGTACTTAGGAACATGTTTTATACATTTTCAATAATTTAAATATGATATTAAAAAAAAAAGATTGTTCAAATATGTTTTGACATTTTTAGCTTTTATTTTAATAAAATAGATTTAAAAAAAAGTATCTTTGAACTACGTGAAATTCAACAAATGTCTTCTTCAAAAATGCCATCCTATTAATATTCTTGTCCAAACATATCCCTACCATGAATATTTTAGTCAAAAAATATCCTATTACTACTTTTGAGTAAAAAATATCTTTTTAAAGTATTGTTTCTTTTCTGTTTAAACAAATTCCCTTTTTAAGAGATATTTGATATATACCCTTTGGGCCTTGATGGTTGATGAGGTGATCCAAGAAATATTTAGACTATCATTTAATGCTTGTTTAACAAACGAATATTAAGAAAGTGATTTAACATTTTTTCTTTATAAGATTTACCGTTTGTTTCAAAAATAAATTTCGATTTCACCGTTTAATAGATTTGGTAAAAAAAGTGATGATCGTAAATTTTTTTTTATGATTTATTAATATCACTCCCTTTTAGTGATATATCATACATTTCTTATTTTTTTAAAATGGATATTATGTGCAGATTATTATCTCAAGTGGACATATTATGACCAAGCAAAATTATTAGGTTTTGTATATGTTCATTGTACATAATAATTGTATATGCTTATAGCCGCCTATCTAGCTATATCTTTATAAAAAAATATAAATTATGGTTTTATTAATATATCAAGTGTCTATATATACATATATATATATATATATTTATAATATTGTCCATAGTGTAAAAAGCCGCCATACCCCTATATAAACCCCTAATTTATTGTGTAAATATATCAACAAACCAAATATTCTTCTTT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 34

Page 35: Introduction

GEN 3040

TATTCTAAAATGAGTCATTTGACAACTTGTTTAGTGTTCTTTCTCCTTGCCTTTGTGACTTACACTTATGCTTCCGGTGTATTTGAGGTCCATAACAACTGTCCATACACCGTATGGGCAGCGGCGACTCCCGTAGGAGGTGGCCGACGTCTCGAGAGAGGTCAAAGTTGGTGGTTTTGGGCTCCACCGGGCACCAAAATGGCACGTATTTGGGGTCGTACTAATTGCAATTTTGATGGTGCTGGTAGAGGTTGGTGCCAGACCGGTGATTGTGGTGGAGTCTTGGAATGCAAAGGATGGGGTAAACCACCAAACACCTTAGCTGAATACGCTTTGAATCAATTTAGCAACTTAGATTTCTGGGACATTTCTGTTATTGATGGATTCAACATCCCTATGTCTTTCGGCCCAACTAACCCTGGGCCGGGAAAATGTCATCCAATTCAATGTGTTGCTAATATAAACGGTGAATGCCCTGGTTCACTTAGGGTACCCGGAGGATGTAACAACCCTTGTACTACATTCGGAGGACAACAATATTGTTGCACTCAAGGTCCATGTGGTCCTACCGATTTGTCAAGATTTTTCAAACAAAGATGTCCCGATGCCTATAGTTATCCTCAAGACGATCCAACAAGTACTTTTACTTGTCAGAGTTGGACTACAGACTACAAGGTTATGTTTTGTCCTTATGGCTCTACTCACAATGAAACAACAAATTTCCCATTGGAGATTCCTACAAATACTCTTGAAGTGGCTTAATTAAGTAGAAATCTTGAGTTGAGCCAATCTCTTTTTTGTTAAATTTGTTTTGAAGATGGACATTGTCTTTAATTTGCCTTGTACACATTTCTTTTACATTTGTTTGTTTGAAGTGGCTAAGATCTGAATAACAATCTATGATCGATGCTTTGTCTTATATATGGTATTTTAATTTTAAATTGAAATTATATCTGGAGTACAGTTTTTTTCTTAAAATTTTTAATTATAATGAAAAAAAATTAGTCTAGTTAATATATTAAAGTACCTAGATCTGAAAATATAAAATTGTAGGGACAACTCGTTACATGTCATAATATAAGTCAAACAGATGGAAATATTTTTTTATTGGCGGCTTTTGGAACATTTTCAAGTATACTATAGTATATGAAAATTCATTAAAATTTTAATAAGTGTTTAGATTTGAATTTATAAGTTTAAAGAAATAATAAATTCAATATTAATTAAAAAGCTTAAAGATTCGAACATTAAGTTAAACTGATAATTAAATGAGTCGATATACAAAAAAGAAAAAGAAAAAAGAAGAAGATAATTTTATATAAAAGTCAACTTTTAAAAATCACTTACCCCATTAATATCTTCCTTCTAAAAGACGTTATCCTTTAGTATCTGAATTCATAACTTGAAATAGCAAAAAAGGGCTTTCACTTTCAATTGCGATTCAATGCCTAAGCTAAGAATTCAACATTATTTAAATATTGATATTGTATTAAAATTCATTACTTCCAAAACTCTTTTATTATAAAAAAATAAAATAAAATACAATTGGGCGTTCGTATTGCTTAAATAATTGCAATGTTACTTAAAAAATTCTATTAAAACAATTTAAGTTCTTTTATTCTGCCCAAGCAGTTACACGTTGTTGATTTCAGACTCCTCCGGTAGAGCCAACAACCATTCAGTGAATTGTCTTGTGTTACTAAGTTTCAAGTCGTTCACTGAATCCTTCACGATGTTTTGGTGGCAGGTACTAGTACATCTACCGATCAAAATCAAATCCAAAAACTCTTGCATGTTCCAAAATTAAACTCAAATATACAATCATCTCATTTAAACTCAAGAATATCTTATTATGAACCAACAAACAAAGACGACCATGATGACAACGATAAAAAATCAATACTCCCTCCGTCCAGAAATATTTGTCATGTTACGCTTATCGAAAGTCAATTTTGATTGATTTCTAAAGGTAAATTAAATCACATTAATTCGATACTTTAAACCAAAAAAATTAAATATTCTAAAACAATATGAAAAGTACTATAAATTACAATCTTTTCATATTAATATGATAAAAAAATGTATCTTCAAATGTTAGTCAAAGTTTTTATAGTTTGAGTCTATAAATAGAAACCATAACAAACAATATCGGACGGAGGAAACATAAAACAAAAATCAATATCGAACAAATTATTCAATCAATAACAAAAATCATTGCTAGTTCTATATCTAGTAACCAATTAGGGATAGATTATTCAAAACTTTGCATATTAGAGATAGATTAGTAACGAAATTCATAACTAATTTCAAAATTCATTTTTCTCGTGTGAAACAAGACCTACATAGCTTGTATCGAATAAAAAAAAAATTCAAACTTTTAAGGTTCCACTCATCTCTACGCCTACCCATTTGCTCAACTTTTTTGACTGTAGAATGCATGCACATTTTCAATTAATTATTATGTCCACTTGATAAATATGCTATAAATAATTTAATATAAAAATCCACTATATATATACGACGGAAAAAAATAATCAAGGGAAAAAAGGCAATTGTCTGTAAACAAAAATTACAAATTTGTAATTCAAAATTTTTAAGTTATTTTATTTATGAAAACATAACCAATAAAAATTATGAAGAGAATTCTCTACTAACTAAATTTCTCATGAAAAAATGTTCAAAAAAAATTAGGAAATTTTCCACAATTTTAGGTTTCATGCCCTGCCATGCATTTTTCCGATTTGTTGTATATACTAAACTAAATAATTTAAAATTAATATATAGTTAAACACAAAACTAATAATACGTATATATCACTTAATTAAAAAATATCTTCTCTTTTTTTTTTAATCATTATAAATATCATAATGTGTTAATGCATAAATTTTCAAAATAGCAAAAAGTTGAACTGTGAAATATGACTGGAGTGAACAAAATATATTACTTATAATAGGCTGTACTCATATTATATTGGCGGCTTTTTTTTTCAAGTGGAGATATTTTGGATACAATTTTTTTTTTTAATTTTCTATATATGTTTATAACAAGTGCCAATATTTATATGATTAATGTCCATAGTGAAAAAAAGCCGCCATACCACTATATAAACCCCTAGCTCGATATGCAAATATATCCTACAACTTCTTATACTAAAATGAGTCACTTGACAACTTGTTTAGTGTTTTTTCTCCTTGCCTTTGTGACTTACACTTATGGTTCCGGTGTATTTGAGGTCCATAACAACTGCCCCTACACCGTTTGGGCGGCAGCGGTCCCCGTAGGAGGTGGTCGACGTCTCGAGAGAGGTCAGAGTTGGTGGTTTTGGGCCCCACCTGGCACTAAAATGGCACGTATTTGGGGTCGTACTAATTGCAACTTTGATGGTGCTGGAAGAGGTTGGTGCCAGACTGGTGATTGTGGTGGAGTCCTGGATTGCAAAGGATGGGGTAAACCACCAAACACCTTAGCTGAATACGCTTTGAACCAGTTTGGTAACCTAGATTTTTGGGATATTTCTGTTATTGATGGATTCAACATCCCTATGTCTTTTGGCCCAACTAAGCCTGGCCCTGGAAAATGTCATCCAATTCAATGCACAGCCAATATAAACGGTGAATGCCCTGGTTCACTTAGGGTACCTGGAGGATGTAACAACCCTTGTACAACATTCGGAGGACAACAATATTGTTGTAATCACGGTCCATGTGGTCCTACTGAATTGTCAAGATTTTTCAAACAAAGATGTCCTGATGCCTATAGTTACCCTCAAGACGATCCAACTAGTACTTTTACTTGTCAGAGTTGGACTACAGACTACAAGATAATGTTCTGTCCTTATGGCTCTACTCACAATGAAACAACAAATTTCCCATTGGAGATGCCTACAAGTACTCTTGAAGTTGCTTAATAAGTAGATTCATCATGAGTGTAGCCACCTCTTTCTTGTTCAAAATGGTTGAGTTTTGTTCATTACTTGTCTTTTGTATTATC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 35

Page 36: Introduction

GEN 3040

CACATTTCTTTTAGTTTGTTATCTTGAGGTGGCTAAGATTTCAATAACATATTGTACTTTAATTTGTCTTACACATGATGTTCAAAATTCTCACACAAGAGTGCATTTGGAGTATCGTTTAATTTTGTCTTTTTAATTCCTAATTATCTAAGTCTTAAACCTTAGACAAAAAAAGGAATGAAAAATAGTATTGATAACATTCTACCGTAATCAGATCAAAAATATAAAATTGTAGGGACAAGATAAACTATTATAATTACTATATAGGATAGCTATCTTTATAGGACATAATAATTAGTCGAGGTGGAAATCTTTTTTGTAGGCGGCTTTTAGAATATTTCATGCAAAGGTTAAGGTGGAGCCACGTAGAATTAAGGGGTTTATTCAATTTTTTAAATAGATAATTATTAATTTTATTTATATATAGTTAAAATTATTTTTAATGTATATACAGTGATTAATTAATGAATGAATTTTTTTTTTCTTTAAATTGTTAGTGTGTTTATTTTCTCATATTTCAAAATCCTCTAATGAAAATTATTACTCCGCTATTAATCATGCTCTTGATATTCAACATTTGTATTGAGCTTGATAAATTTAAATTTGCATTCGAAAAATTTCACTTTGAAGGCAAAACACTTATTATATATAATGAAAAGAAATTTACTCTCATGGCTCAAACTCGATATATCAAATTAAAAATTGAAAACTAACATCCCACCACAGCCCCTTTGCATATTTTCAGCACGTTATGGTGCATATTCCGTGACTTTTATGCAATCACTCCTGATAATAAGAAGCAAAGTCCTTAGCACCATAATATTCAATTCATTTTAAAATATTTATCATGTTTTATTTGAAATAACTGATCATTAACAAAATATGTTCAAATAATTTCACTCTTTTTTTTCTTAAAACTTAACTTAATAATTTAAATTCACATTACGTAAACTTTTTAACTCCACGCCATTATTATAAAAAAATATGTCATGTGTCAAGTGGATATATAATTAAAAGAAAAACGTATGTGTATCTACACTCAAAAAAGTCAAGAAAATGGGGGTATATGGAGATATTGGTGAGTGGGGTTTTTTATGTTACTTTGGTGGTTAAATAATTATTCATCATTACTTACTATATTGACAAAAGAATATTATTTTCAAGAGTAATGTTGTTAATTGTTTTAGGACAACTTTCGCATATAGCAAATAAAAAATTTATATTTGTGTGTTATAGCAAAATTTGTATAATTACGCTCCATAGCAAACATAAAACTGTATAATTCACTATAATATACAATTGTATAATTTGCTGGCCTATTTCGCCGCAATTGTATAATTCGCTATCATATTTCACTGCAATTGTATAATGCATATTGTATAATTCTCTGCAATATTTGTATAAAATTTATTTTGCATACAATTGAATCGAAGTAAAATGTATGTATATTGCATAATTATAAGTGTATAGGAAAAAAATATATGTTTTTCTCTCGCTTTATACAAAAACATAAACACAATTTATACACTTCTGTTGTATAAAGCAAGAGAAAATTGTATTTCACTGCAATTGTATAATTCGTTCGCCTTTTTCGCTGCAATATTTGTATAAAATTTGCATTTGTCTACAATTGAATTGAAGTAAAATATTTGTAAATTGTATAATTAAGTGTATAACACGAAGATATATGTTTTTGCATGTGTATATACAGTTTTCTCTCGCTTATACAAAACAGAAACACAATTTATACACTTTTGTGTATAAAGAGAGAGAGGCGACTGACAAACTTTGGCTAACGTTTGCTATGGTGCACAATTAAATCAAATCATAGCTACTCCATTTATTTTAGGTTATTAGTTTGCTATTATATACAATTATCCCATTATTTTATCCCAATACAATATTTAGTCATTTCATTCCATCAATTTCGGTGAAGTTCAAATCAATCGTTGTTTTAAAAGAGATTAGTATTGATAAAGTCACGTTCTACAATATTTTTTATTTTTTTGAGTGGCGATAAATTATTAATATTATTTTTGACAATTCATTTTATTCTTTAGAGTGCGATATAATCAACTTGTTAATCTATTATATCAAGTTAACAAGTTATTAGAATCTTTTTCACTCTTAATCCTCTAAAATATGTAAAACATGTTAATGATAAAAAAAAAATAGTGAATTTTCTACGTTTATAAATATAATTTTTTTTAATAAAAATATAAATTTTGCTCAATTTAATCTTTTGGCATTGTGTCAAGTAGCACGTGCAGCTAACATCATTTAGCTTGAAAATGTTCTAAGAGCCGCCTATAAACAAATATTTCCATCTACTTTTGACTTTGCCCTAAAATTGTGGATAATAATTTCTAATAAAATGGGATTATTTTATTTTATATTATTTTAAGGTACTAGTTACTTTTGAGATTAATTTATTTTATTTTTATATTTTGTGATGGAACTTTTATCATATAAAAAAATGAGATAAAATAATCTCGTGAGATGTTCTTATTTATCTTATCCAGTATATTATAGTTTTTATGTTTTTAAACTGACTGATATTATTCTTTTATTTGACAAGTTAATACGTAAATAACAAAATATACATAAAATAATATTCGAAACAACATATTTGTGATGATGTGACTTTTATTCATGTAATTATATTAGGTTTATTACAAGCCAATTATATAACTCGACTACTTCAAGCAATTTAAAAGAAAGTGACTCAATTTTACTTAGCCTCTTCATCACTTGAGGGCATCTCCAAGGGGAAATTTGGGCTAGTAACACCATTAGGACAAAAAACAACCCTATAATTTGTACTACCACTAGGGCAAGTAAATGTGCTAGTAGGATCATCTTGTGGGTAGCTATACGCATCAGGACATCTTTGTTTGAAAAATCTCGACAAATCAGTAGGGCCACATGGACCTTGTGTGCAACAATATTGTTGTCCTCCGAACGTGGTACAAGGATTGTTACATCCTCCGGGTACCCTAAGTGAACCAGGACATTCACCATTTATATTAGCCGTACAATGAATTGCATGGCATTTCCCTCCACTAGGATTGGTCGGGGCGAAAGTCATTGGAATATTAAATCCATCGACTAAAGAAATGTCCCAGAAATCTAGGTTGCTAAACTGGTCCAAGGCGTACTCGGCCAGGGTGTTTGGTGGTTTGCCCCACCCGGTACATTGCAAGACCCCACCACAATCACCAGTCTGACATGAACCTCTACCAGCACCATCAAAGTTGCAATTCGTACGACCCCATATACGTGCCATCTTAGTGCCCCTCGGTGCATTGATGACCCATGTTTGGCCTCGATCAAGACGTCGACCACCGCCTATTGGGGTCGACGCCGCCCAGACGGTGTATGGACAGTTGTTGCGTACCTCGAAAGTGGCAGCATAAGTGTAAGTCACAAAAGCAAGAAGGAAGAAAACAAAAGAAGATCTCAAGTAGGCCATGTTTGTGGAACTATTTTTTTAAATGTAAGTTGTTGGATTTAGTGACAAATTGTTAGGGTGTTTATATAGGGGTGTGGCGGCTTTTGAAACTATGTATATTAATCACAAATATCAATATCACAATTGACTAATCAAACGAATAATATTGAAAAAATACTTAAGTAAAGGTTGAGGTGGCTCAACATATAAGAGCCGCCTAGTAATTACTCCATTACTTTATATAAAAATAAATATTTAATTATTATTTTTTTCAATTTCTTATTTAATATTAAGGTATTTTTGTCCTGATTAAATTTAAATTTACAAATTAATAGCATTTTTTATAAGATTTTTTTATTTTCAAGATTTACAAATTTTCATTTCTAATTAAGAATCGAGAAATTTCGGTCCGCTACAATCCATATATGGATAACATAGGGACAAATTTATTTGGTATAACGTGATAAGACTTTTTCTTAACTTAATTAAAAGAAGTAACCATTATTTTTCTTTCATTGACAGAGTTTCTTTTGTACTCC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 36

Page 37: Introduction

GEN 3040

CATTAAATTATTTTTCATTTTGTGTGATGTTTAAAATCATAAATAAAAAAAAAATAATTTTATTAATTCACTTTTTTAAAAAAGTTAAAAATTTATAAATAAATATGAACACTTTCAAAAGAGTCAATTACAAGAATAATATAAGAAAAATAAATTAATATTTTTTTATTTACTAAAATAATTAACTAATATAAAATCAATGGAGTACTTATCTTGTCATTTTTTATTTCGTGTTCAAGTTTCACGCGTATTATTTGGAGGCGAAATTTATGGATCGCAATCAATAAAATAAGGGAAGTTAATCTAATTAGGTAGCATCTAAAATCTCAATTAAAAAAACGAAAAATTACATTAATGACGTATAAAAACTTCAAATAATTACATGTTTTATATTCAATTCAATTCAAGAAATATTATTCTCTTAAAATATAATACAGCCCATACCCTTTCCCATTGATTAAGGTTGATTATTTTTGCTTAATACATATTAAATCAGTATTATATATTATATATTGTATTGATATCATTAAAATTTATAATTTCATTTAATTGATACTAAATCAAATATATGTATATATATATATATTGCCTATGTATTATCACTGGCAACCTAAGATCTATTTGTTATTAATTTTCGTCGCAAATTTAATTATCATTAATTATGGAGTTAATTACTTGCAGTGACATTATGGTCGCCACTAAAAATATTCAGTGTCGATTTAAAGAGGCTCATATACATGACAGCGTGAAATGATTCTCGAGGCATTATCAATACTTTTTTGTTTGTTTATTGGTCTAATTTAGGCTTAATACTTTTCTAAAAGTTATTTTTTTTTTAAAAAAAAAGAACGATAATTCAAATGCACTGTTGTTTTATACATCTAAGACAAAAGTACACATGAGATGTAACTCTTATTCCGGTCTTTAAGTTTGTTACGAACTCTTATCAATAAAGTAAAAGTGAATATTTAAAAAACTGAGCTATTAAGATTTTTCATTACGGTAGAGATCAGTTGACTAATTGAGGCGATTTAAAATGAAGAAACTCAAATTCACTTGGCCACTTCATCAGTACTTGCAGGCATCTCCAAGGGGAAATTTGGATCAGCAACACCATTAGGACAAAAGACAACCCTATAGTTTGTACTACCACCAGGGCAAGTAAATGTGCTTGTAGGATCGTCTTGTGGGTAGCTATAAGCATCGGGACATCTTTTCTTGAAAAATTTGGACAACTCTGTAGGGCCACATGGACCTTGGGTGCAACAATATTGTTGTCCTCCAAACGTGGTACAAGGGTTGTTACATCCTCCAGGTACCTTAAGGGCGCGAGGACATTCACCATTTATATTGGCCGTGCAATGAATTGCGTGACATTTTCCTCCACTAGGTTTTGTTGGGGCAAAAGTCATTGGAATATTAAATCCATCGACTAAAGAAATGTCCCAAAAATCTAAGTTGCTAAACTGGTCCAAGGCGTACTCGGCTAGGGTGTTTGGGGGTTTGCCCCATCCGGTACACTGTAAGACTCCACCACAATCACCAGTCTGACATGTACCTCTGCCTGCAGCATTAAAGTTGCAACCAGTACGACCCCATATACGTGCCATCTTAGTTCCCCTCGGAGCATTGATGACCCATGTTTGGCCTCGATTGAGACGTCGACCACCGCCTATCGGAGTCGATGCCGCCCAAACGGTGTATGGACAGTTGTTGCGTACCTCAATAGTGGCAGCATAAGTATAAGTCACACAAAGAAGGAAGAAGAGAACAAAAGAAGATGTCAAGTAGCCCATGTTTGTTGAAATGTATATATGAGGATAACTTATTTGAAAGCACTTTATATAGGGGTGTGGCGGCTTTTGGCACTATGTATATTAATCATAAATAACAATATCATATTTGACAAAAAACGAATAATATTAAAATACGATATAGTAAAGTGGAGGTGAAAAAAGAAAAAATGTATAAGAGCCGCCTAATAAATAATAATTTTATAAATATAGTACAAGTTCACTTTATTTGGTGACAACTTTGTAACGTGACGGAACTTATTTCTCAAAATAAATATTATTTAAAAAAAAAAAACAGCATCTTTCGTGACCAAAGTTTCTTTCTATATTTATCAATTCCATTTTTTTTACTTTGACAGTATTTTGTATACGTTTAAACATAAAAATTAGGTTTTTCATATTACTCGTATGGATCGATTAATTTCAAATATACAAATCAATTCGATTAACTATTATATACAGTGGAAATGATGTTTATGTGACTCGTAATTGATATTTTAAGCGATCAGAATATTTTAATGCATATCTATTAAGAAAATACCTATTGATGCAAAAGTCAACAAAATTATTTAATTTATTTATTTTACGATCGAAGCATGAATACATTTACTAATTAAAGTAAATTGGGAGGAATTGATGGACGAGTCATCAAGCTTATCAAATGTCAATCAACATTAAAATAATGTTAGTTTATGGCTGCTTTTAAATGAACAAGTGGATCATGTACAATTGAGTAAAAAATATCTCCTATAAATATATATATCCCTCTAAAACCAAATAACATAACACACAAATATAAACTTTAGATTCTTAAAAACAAAGCTGAATTATATGGAGGCAAATAAGTCTATGGTGAAGTTGGTTGCTTTCTTGATAATTCTTGTATCATCATGTAATTAATTGACCATTTATTTCTACTTTATTAAATTTATTCACATTTCTTATTTTTATTTGTTAAAAGAAATTGATTAATTATGTAAATTATTTATGAACAGGCTTTCAATCTCTCACTGCTCAAGATTTGGAAATCGAAGTTAGCGATGGCTTAAATGTATTGCAAGTACATGATGTGTCTCAATCTTTTTGTCCAGGTTTGGTTTTTATCTCTACCGTCCTTTTTGGATATATATACACGTACGACGTCCAACTAAATCTAAATAAATTTTTGGATCGAATCTTAGGTCGATTATCAGGGTAATATCTATAGATTCGAGTGTTGGGGTTGAGGTCATATCCCAATTTAAATGTCATGTTTCGAGTTGGGAGTTAGGGTCAGGTACTAGGTTAAATCCCATGCTCGGATACTGAGTCTGTTGTCGAGTTATGTCCTAATTCACGTGTCGAAATCGAGTTCCGAATGAATTTTCGATCAGAAACTATTTTCCTACTCTCTAGCCAAATATAAAAGATATTTTCTAAAAAATATTTTCCTTCGTAGCAAACACACCATAATTAATTATTTTTTTCTCATATAAAATAACTTTTGATTCATAGTTTTTTAGTCAATGCTCTTTTATAAATAATTAGTTGTATTTTTTTTTTATAAATTATAGGTGTGACAAAGGAAAGTTGGCCAGAACTTCTTGGAACACCAGCTAAGTTTGCAAAGCAAATAATTCAAAAGGAAAATCCAAAATTAACAAATGTTGAAACTCTACTGAATGGTTCAGCTTTTACAGAAGATTTGAGATGCAATAGAGTTCGTCTTTTTGTAAATTTATTGGACATTGTTGTACAAACTCCCAAAGTTGGTTGAACAAAATTAATTGATGTTATATCATATGTATCTAGCCTCCACAAAAATAAATTGGAGATGTATGGTTAAAATTTCCACTATATTTGGTGATAAATAAATGTGGCTTTTAATATTAATATGATTTTTGTTATATATACAATTACTTCTACTATATACCAGTGAAATCTCATTAGCATTGGGAAGGTGTTACGTACGCAACCTTATCTTATGCTAACTATCATCAGGGCGGTCTTAACGCTATGCAACTAAAGGGGGGTGGGGGGTATTTCTAAGAAAAAATTAATTATAAATGTTATTTTAAAAATAATTATATGTACTATGAGTGGTATGATTTCTACAAGGAAATTGCATATATAATGAGTATATCATCTCAATTGTGACTTTGATGAAAAGTTCTGACTTTTAAGAAAATTTTGTCTTTTATGAAAGATAGTGACATTT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 37

Page 38: Introduction

GEN 3040

GTTTTATCGAAAAAGGGTGGTGATTTTCGAAAAGAGTTTGTTTTTAGACTTTTAGGAGTCGCCACTTAATTTTTAAGAAAAATCAAGAAAACTTCGTTTCTAAACAACTTAAACAGAAAAAATTGTTTTGAAACATTTTAAGGGTTCGGGATTCCTATTAGTATCTTAGGAAGGTGTAAGGCACCCAAAACACTCCGTTAACTTACGGTTTTCCAATGATTAACTATTTGATTATTTTTATTTTTAACCTTTGCAAATTTATTTGAACTTTTAACATGACTTACTTAGCCAAACTTAGCCAATTTTGAAATATTTATTGTTTATCTTGCCAGTTTACGAAACCATTTCAAAGTTAAGCTTACTCAAGGTTTTATCTAAACAAGCCTTACGAATTTGAAACAATTTATCATTTAGAAGTTAGTTTTAAAAATTTTTAGGTTTGATAAAATAACAGGCGTTCGATGAGGGTTTGGAGTTTCCTAATCCTAAGCTAACGGCATTGAAAACAAAATAAGATAAGCAAATAGTAAAGAGAAGGATAGAGAGAGAGAGATTTGGGTCCAACTCCGGGTTCTGTTATCCTTGACCGAATGTTGGACCCACCTGCCTTTGGGTTTTTTAGCCCATTTCTGTACCTGTCCTAATCTAAACATACAAACATAATATAGAAAACAAAAAAACGACAAGAGGGCTTAGGCCCATTACAAAATATGCAAAGTAAAATAAAACGAAAGAAAAGTCTTTTAGTCCATCGTCTTCAATCTCCATGACCCTTGTCTTTCTACACTTGCTCGTTTTGATGAATTGACTCGATAAAGTTCAGAGTCTTCAAGACTCTCTCGAAATGCGAGGAGGATAATGGGATTCCCATAATGGACTCACGGGGAGTTTTCACATGTCACAAAAAAAACGATAAAATGAAGATTGGCATTTTTAAAACTACAATTAGAATAAGCAAGGTTCTAAAGCATAACAAGGATATCTTTATAATGCAATACTAATTAAATCTTCATGAGAAGCAAAGCCAACATCCAACAACCTTAATGGTACTTAGAGATATGCAAATCAGTCAACCTAAATGACACTACATAACGATACGTAAACTGTGCGGCGAAACCCCAAGTGTAGTGAAACCTTCACACAAAATTGTTGGTAATTTCCAAATTTTATATTTAAAGAAAGGACTTAGCATGGGATAAATAACGAATACAAATAGAGATGCATCATAAATTGAGTATCAATTAGCCGTTTTGCGCTTCAAACATTTGAAACAAACTTGAACCAATCATAAGAGGACAGCCTCATCAAATAAAACAGGATCCCGATCACCTTATCACCCAAACATCTAACACACTAAGCGAAAGGACCATTCAATCGACCTACTACATGGCTGTGACCTCGGACAGCTAACACAAGTGGCCACCAAGATAAAGATGGCCTTTCAAACCACAGATTTCCAAAATCCACACCACCCATACGGCATACAAAACGAAACTTTCATGCAAGACGCCTGACAAACATGATATCTAAACAAGATAACAAGTTTGGGAGATAAATACATTATAGAGAATAAACAACCCTCATATATTCTTAAAGCGATACTCGAACCAAAAGCTTGAAGAAGGTTCGATTGAAAAGAAGCTTACAACTACAACTCCCGCGAGATAGACATCATGAGCAAGCTTGGACGAGACAACAAGGGAAAATTACACTCTAAACATACATACATGCATACAAACAATCCAAGAGCCTGCTTTAACGAATAGACATCGATACTCGGATTAATGAGTCATTTCACAGAAGCTCTTGCCAATATGTCTTAACAATCTCATTCCAATCATGTGACGAATAAATAGTGTTCTACGAGATTAAGCTAGCATATATTGAAAGTTCTAAATGGCAAGCAAAGCAGCGACAGTGTTAACAAACCACAAAGACAAGCTCAACCGACCATGTATATTCACATTTTCGAGTATCTCAACATGTTCGACCACAAACTAAGAAAGGGCCAGCAAGTCGATTAACAAACATCACATGTCATACGGCGAAGGCTCCTCGAAACAAATACAAAGTCATGACTAACTCGTGAAGACAAACAGAGAAGCAATGTCGACGAACCATCACATTGGGACAGAATCACAACGTAAGACATATTCCAGACTTAAACAACCAAACGAAATATGATACCTCATGTTGACAGCACCACAAACCACGACTCAACTCCATAAAGAAAATTTAAAATAATAAAGAAAAATGGGACATGTACCTTTCTCGGAGCAGCGGAACAGGGATGACGGCGAGCAGACGACGATGACTTCCCCACGCGACCCACGGCTAGACGTTTATCTCGTTTGTTCTTCGTATTGGAGTCCTATTCCCGAAAAATCCCGACTGTTTTGCTACACGAACAGGACAGGAAGAAATCTTCTTCAAAGTCTTCCTAAACCAGAATTTCTAACCAAATTTTCTAACTTCTTTTGGTTATCTCAGAAACAATGGAATAAACCCAGAATTTCAAACAACGTTCCCAACCCAAAATTTCCTAACAAGAGACGGAAGAAGCAGAGTCTTTTTTTTTCGAAAATTTTCTCAACCAAAATCCCTAACCCTTTTCTGTGTCTTTCTTTTCTTTTTCAAAAATCTCAAATATCTAACATTCTTTCAACTCCCAAAATTCCCAGCGCTCCAAGTTCTGCTTTTCCGAATTTTTCGATGAAAAAATCTCTCACCCCTTTTCTGATGAAGAAAAGGGGTATTTATAGAGGAGAAGAAATAGCCCCCCTTTTCTCCTTTTGAAATCGAAACTAAATCCCGTTTCGGTCAAACTTCACATGGCACAGGCGTGTTGGAGAGGATGAAATTGACACCTCGCAGACCTACTCGACAGCGTATGGGTCTCTGCGCTTCCGTTTGGTTCACGCGAAGAGAAAGATGGAAGGAGAAGAGCGTGGGGTTGTTGGAGTCTCTCTGTTTTTCGCGTGATTTCTGGGGAATTATGGGTTTTTGTCGAGAGGGGAAGGAGAGAGCGAGCGTGGGGGAGACGCGGGGAGAGGATAAGAGGAAGGGAGAGAGCGAGGAGAGAGCGTGGGGAGGGGGTCGCGTGGCTGCGCATCTGGGAATTGTTTGGGAATTTTGGCTTTCTGTTTTGCTGGGTTCGCGTGGGGTGAGGGGAGAAGAAGGAAAGGGGAACGGGTCGGGTCGGGTAGGATGGGTTCTGGGTCGGGTAGGATGGGTTATGGTGAATTGGGTTGGGCTGGAGAATAAAAGAATTGGGTGTTGGGATTAATTTGTTGGGCTTGTTTTAAAAATGTGTGTTGGGTTGGTATTTTTTAAAAGAAATAAAAGGGGATGTTGGACTGGGTTGAAGAGGATGGGCGTCTGGGGGGAATTGGGCCTTGAGTCTGTGGGCTGTTGGGAATATTGGGTGTAAGGAATTTAGGCAAAATGGGTTGAGGACGAAAGTGGGCCTCGGAATAAGAGTGGGCTGATGGGTAAAGGAATGGGTTCAAGATATACATATATACATATACATATATGTATATACATATATGTATATATGTACATATATTTATTTTCGCAAGGTTTATAAAACTAACTAACTCATACCGACGCGGGTCAAAATTGGGTGTCAACAACTGTCCCTCATTTTACTTGGGTGGATGCAAGCAACTCGAGGAAAATGAAGTTGACCGAACCAATTTTGACCAACTCCACTCGCCTTCAGGAAGAAGGATTTGTCAAAGAGTTTAGGAATTATGGCCGAACCCTTGCAAACGGGTCTCCTACATACCTCAGGCTATATGAGAATTCAGGCCACTTGTAGTTCGATAAGCTTGATTTACCGCACGATTTCAAAGAATCAAAAGCCACCTTGACACCGAATTATGAAGGTATCGAAATGCTCGAGAATAGAATTCGAGAGCGAATGTTGGAATACAGCTGAGTTTTCAACATTGAATCCGCCTACATACCCTTAAACCTTCGGAATCA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 38

Page 39: Introduction

GEN 3040

GGCTGTGTGTAGTTCGACTCGATCGTTGGAAAAGGAAAACTATTATGCTAGCTAACCTTCGGTTTTGGACGAGTGACACATTAAACGAGTATGAAAAAGAGGCTTGTAACCTCTTAGCCATGAATGTGGCGCGCTTCACATCCATAATTAAGAACTTACACGGACATAATTTCCAAATCTTTTGGCGTATTGTCACTCAGATTGGAAACTTGCTTCCGATAAAACAAAACTTTCGGAAGCGAGACAGAAATTGACAGGACTAGAGAAAACTCGTATGCCTCGAGAGGGTGTTTCGATTGTGGTGGAAGTCGCTCCTCTGCTCGCGTCCTAAGAGTTTGACACTCCTCTTTGGACTGTGTCCCAGTTCGCTGCTAAGAAAAAGTTCCACGTTGTGCTGCATCCTTTTTGATGTCGTCCGCACCTGTGGTTTTTGGAGAATTCATCCTTTTGGATGCTCTCGACAAAGACTGAGATAAAACTCGTCAGCTTAAAAATGCAATTAGGATGGGAGACAGTGTCTTTTACCATGTCGACACTTCATTGTTCTCCTCCATGTTCGACGTCGACTCGATCGGTCTGACGTCCAGCAAATAAAGCTTATGCCTTGTGTTTTGCAATATCATCTTCAGTGTATTCCATACTTGCTGTCAAAATAAATAAATAAATGCTGATGCGATATTGATGTTTCTTTTGTTGTCATCTTCGATGCTCTCCATGATGTTTATTTTTATAAGAAATAACATAATGCAGTCGAGGCCAATGGATAAGTATTGCTCTTTCTTTATCCAAGTACGTCTTCTTGCGGGTCATGAATATCTTCCCGCGATACGTAAATTCTCGCGAGATATGAATCTTCTCGCGATGCATAACTTTCAGCGAAGAATAAAATCTTCTTGCGATGTGCAAACTTTGACGAGGCATGAATTCTTCTCGTCGTGCATAAATTTTGACGAGGCATGAAATCTTCTCGTCGTGCATAATTATTGACTCGCGATGCATGATTTCCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTTGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTTGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTTGACGAGGCGTGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTTGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTTCCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTTGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTCCCGCGATGCATGATTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTCGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTGACTCGCGATGCATGATTTTCGCGATGCATGACTCCGCGATGCATGAATTCCAGCGAGGCATAAATTCTTCTCGCGATGTATAAATTTCGACGAGGCATGAAATCTTCTCGTCATGCATGATTATTTATATTGCCCCCGTGAATAATAACGGCAAGACGACCTCCAAATAATATATCGACTTGATCGATAATGATAAAAATAATAATTGCATAGCTGTCTCTTCTGAGGTAATGCATTGTCTTGATCGACAATAATCATCTTGCTTTGATAACGCGATGCAAATAGCGTCTTCTATAGAAATCGTGAAATCTTCGCTAAGATTCCTTTTTGATCCACCTTTGAATCTGGCCAGGCACTTTGTAAATTTCATGTATTGGGGAACGGCTTGAAATAAACAAACACATTCACAAGCATCTTGGCATAAAGATATGAAATGCTTCTTGCTTTCAATAAAATCACCGGCTTTGGAAATAGTTTTCGCCTTCTCCGTATTCATCAGTCGGAAGAATTCGGCCGATTTCGAAGTGAAGCTATAAACCTGCATCCACAGAAAAATTGTCAGTTTTAAACATACTGATCTGTGTCGATCCCTTCGATTGTTTGCTCCTTGTCTTGACGTCCTCGGTTGATTTCCCGATTTCTCGACTCTTGATAGATAAGAAAATATCTTATGCCAAAACAGATGAAATATAAAAGGGAAATTTTTAAGAGAAGCAAGAAATTTTTTAAGAAATAGTATCTCGAAAAAGTCAATGCCAAGTCATGTCATGTCAGGCTTAACGAGCTTCTTTGAGTCCGACGCTTGTCGAATGAATTTCAGAGGCATTCCAGACTTGTGGGGAAGTCCCATGACGAAATTTTGAGGCCATCCTCAAAATTTCTGTCCCAGTTTAGTATCTTGTTCATCTTTCCATTGTAACCGAACAGATCTCGGAATTTTTGAGATCTTCTAAAAAATTCTGTCCCAGTCGCAAATTCGAAATGTCTTTAGTTTGATCCGCTGAAGAGAAGGTTCCTTCTGGAGAATTTTTGAGTCCCTCTCAAAATTCTGTCCCAGTTTCTATTGTAGAAAGGGGGGAATAGAATTTACGAGATATATGACCGAGCCCTTGTGGCGCCTACGTATCCCGTTGAGGCAGGAATCAGGTCAAACGTAGTTCCCCTTTAAAGGTTAACAAGAATAAAATTTACAAAGAAACCGACCGAAGCCGACAAAGGCCGCCTACGTATCTCATTCTTGAGAATTCAGGTCGAACGTAGTTCAGGTCGTAGAATTTGGTCTTCTGTTTTGACTTGTTAAAGAAGTCTTACTTCAAATGAAATTTTGAGATCCTCTCAAAAATTCTGTCCCAGTTTCCATTATGGAGAGAAATGGAAATCTTCTGGGTATATGACCGAGCCGTTGTGGCGCCTACGTATCCCGTTGAGGCAGGAATCAGGTCAAACGTAGTTCTAACGTAAGGAATAATCAAGGGGAATAAATGGGGTGACCGAAGCCGACATAGGCCGCCTACGTATCTCATTCTCGAGAATTCAGGTCAGACGTAGTTCGTTACAAGAGGGATAAGAATTCAAATTCGAACCAATACAAGCAAACAGAAGATTACAAGTAAATGGTGGGAAATGCAAAACGAACTTCACACGTAATATCTCTTGACAGCATCTGAGTTGATAGGTTTGGGCCATACAGCGCCATCCATCTCTGACAAGACTAAAGCACCTCCAGATAGTACTTTGCGGACCATGTAAGGACCTTGCCAGTTTGGTGCGAACTTTCCTTTGTACTCGTCTTGATGAGGAAAAATACGCTTAAGAACCAACTGACCAACTTTAAAATTTCTGGCTCTTACTCTTTTGTGAAAAGCGCGAGTCATTCTTTGTCTATACAACTGGCCATGGCAAACGGCAACCATCCTCTTTTCATCAATCAAGGCTAGTTGATCAATCCGTTTGCTAACCCACTCAGCATTACTTAGCTCAGCTTCTTGGATGATCCTCAGTGACGGTATCTCGACTTCGACAGGTACGACTGCTTCTGTTCCGTATACTAGCAAGTATGGAGTAGCCCCAGTTGATGTTCTGACCGTCGTTCGATAACCCAGTAGAGCATATGGCAACATTTCATGCCAACCCCGCTGCTTGTCAATCATTTTCCCCAGAATCTTCTTGATGTTCTTGTTGGCGGCCTCTACAGCTCCGTTCATTTGAGGGCGATAAGCAGTTGACCTTCGGTGAATAATCTTAAATTGTTCACATATCTCTTTCATCAGATGACTGTTGAGATTTGCACCGTTAT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 39

Page 40: Introduction

GEN 3040

CAGTGATGATGGATTCTGGAACTCCAAATCGGCAGATCAGATTGTTGCGGACAAAGTCGGCCACCACTTTCTTGGTTACCGATTTGTAAGAGGCTGCTTCCACCCATTTGGTGAAATAATCGATGGCAACCAAAATAAATCTGTGTCCATTGGAAGCGACCGGTTCTATAGGGCCGATGACATCCATTCCCCAAGCTACAAATGGCCAAGGTGAGCTCATAGCATTGAGTTCGTGAGGTGGCACCCTTATCAAATCTCCGTGCACTTGACATTTATGGCATTTTTGCACGAATTTGCAACAGTCGTTCTCCATAGTCATCCAGAAGTAACCGGCTCGAAGGACTTTTCTTGCCAAGGTAAGCCCGTTCATATGTGTGCCACAGACTCCAGCATGTATCTGTTCAATAAGCCTCACGGCTTCAGCAGCATCCACGCATCTCAAAAGACCCAAATCTGGAGTCCTCCTGTAAAGGACTTCTCCATTCAAAAAGAAATTGAGAGCCATACGACGTATCGACTTCTTTTGATTAGATGTGGCGTCTTCTGGATATGTCCCAGACTCCAAGTACCTCTTTATGTCGAAATACCAAGGCAAACCATCTGGTTCTGATTCCACGTGTGAACAATGGACTGGATGCTCCTTCAAATCTATATCCAGCGGGTCGATGTAATCAATATCCGGATGTTTGATCATTGAAGCGATGGTGGCAAGAGCATCAGCTAATTCATTCTGTATTCGGGGAGTATGTCTGAACTCGATCTTGCGAAACCTTTTACACAGGTTTTGCACATATTGTACATAAGGTACAATCTTTGGGTTCTTCACAGCCCATTCTCCTTGAACTTGATGGATCAAGAGGTCTGAATCTCCAATAACCAGTAACTCGTAAACATTCATGTCAACGGCCATTTTCAAACCGAGAATGCAAGCTTCATACTCAGCCATGTTGTTTGTGCAGTTGAACCGTAGCTTAGCCGCCATAGGATAGTGCTGACCAGATTCTGATACCAAGACTGCTCCAACACCTTTACCCTGGTGATTCACCGCTCCATCAAAGAATAATCTCCAACCTGGATAAACTTCAGAGATATCTTCACCCACAAATGACACTTCTTCATCGTGAAAATATGTCTTGAGAGGTTCATACTCTTCATCAACGGGATTTTCCGCAAGATGATCAGCCAAAGCTTGTGCTTTTATTGCCTTCTGAGTCACGTACACGATGTCAAACTCACTCAACAGCATTTGCCATTTAGCCAGCTTCCCGGTCGGCATCGCTTTCCGGAAAATATACTTCAATGGATCCATTCTGGAGATAAGATATGTAGTATACGACGACAAATAGTGCCTCAGCTTCTGGGCAAGCCATGTCAGAGCACAGCATGTTCTTTCCAGCAAAGTGTAACGAGACTCGTACGGAGTAAACTTCTTGCTGATGTAGTAGATAGCTTTTTCCTTCTTCCCTGTCTCGTCGTGTTGACCAAGTACACATCCAAAGACGTTATCCGAGACAGACAAATATAGCAACAAAGGACTCCCTTCTCGCGGAGGAACCAATACTGGTGGGTTAGACAAGTAGCTCTTGATAGCGTCGAAAGCGGTCTGGCACTCCTCAGTCCACTTAGTTGGGGCATCTTTCTTCAGCAACTTGAAGATAGGCTCACATACCACCGTCGATTGTGCTATAAAACGGCTGATATAGTTTAACCTCCCTAAGAAGCTCATCACCTCTTTTCTCGTCTTCGGCGGAGGTAACTCCTGAATTGCTTTGATCTTGGAAGGGTCGAGCTCAATACCTCTTCTGCTGACTATAAACCCTAACAATTTCCCAGCTGGGACTCCAAAAGCACACTTGGCGGGATTTAGCTTCAAGTTGTACCTACGCAAACGTTCAAAGAATTTTCGCAGGTGTGTCAAATGATCCGAACTCTCGCGGGATTTAATTATGACGTCGTCCACGTACACTTCAATTTCCTTGTGCATCATGTCGTGGAAGATAGTCGTCATGGCTCTCATGTAAGTGGCACCGGCGTTTTTGAGCCCAAACGGCATCACCCTGTAGTGATATACCCCCCAAGGTGTGATGAAGGCCGTTTTCTCCGCGTCTTCTTCATCCATCAGAATCTGGTGATAACCCGCGTAACAATCCACAAATGACTGCATCTCATGTTTCGCACAGTTATCAATCAGAATATGGATATTTGGCAACGGGAAATTATCCTTTGGACTAGCCTTGTTGAGATCTCTGTAATCAACGCAAATCCTGATTTTTCCATCTTTCTTGGCGACCGGAACGACATTCGCCAACCAGGTGGGATATTGCGTTACTTCTACCAACCGGGACTCTATCTGCTTGGTGATTTCCTCCTTAATCTTCAAACTCAACTCAGGCTTGAATTTCCGAGTCTTTTGCTTCACTGGCTCGAACCCTGGGTTGATAGGCAGCTTATGAGATACGACATCGGTACTCAGCCCCTGCATGTCACTGACCTTCCAAACAAACACATCGCTGTATTCGGCAAGCAAATGAATCAGGCTCTCCTTCTGAGTTTCATTCAGGTGAGTGCTGATCTTAACCTCTTTGACACATTCTGAATCTCCCAAATTTACCGTCTCTGTTTCCTCCAGATTCGGTTTATGTTGATTCTCGAACTGTCGAAACTCTTCTACTACATAGTCTGGTTCCCCACTTTCTTCATCGTAGTCGTCAGCCTCATCGTCATTTACCTCATTTTGTTCGTTTAGCTCATGACATGACATGACGTTGGCAGGTTTGTAACTTACGTTACTGAAATCATAAACAGACAAAATTAGAAAAGTAAAACATAACAAGCAATCGAATAAACAAAGTCAAAATAAGGAGGCTTTCATTTAAATTGAATCAAGATGCAAAGTTGGGTCATGGCACAAGGCCACGTCGCCCTTTAAACAATCATAACAAAATTCAAAATCAAGAAAATGCAGAAATGGTGGGTCTCGGGCCTCTTCCGAACGACCACCGATTTTGGCATGCACAAAATAATTCCTTTCTACCCAAAAGTCCGGGACATCAGGATTGGTGTGGACGTCCAATTCTTCAGCATCTCCCCCGGTTCAGCATCGCGAAAGCCAGCTGGCTCGGCCTCCTCCTCTATGACGGCATTGATCTCCTTGAACAGGTCACAGATTCCTTCCCCGTCGTCTTCAGGCTCGGCATGCTCCCGAATTGGAAAGGAGTGATAGAGATGCGGGATCGGCTTAGTCAACTCTTGATCCCTTCTCCTCTTCATCTTCATATCATCATCTGTGGGGATGTACCCCAAACCATACTTTGATCCCTGAGCAAGGACCGGAACAGGCTCAATAATTCCTTGGGAATCTCTTCCCAATCCGAAACCTGGCTCGAACCCATTTTGCAACATCACCGTGGCTATCATTTTGTACACGGTTGGCATGGAGTTGAGGGCCAAATCTTCGTTGGTGGCATTCACCAGCTCCACCGTGTAAAAGTCTGTGCCTTGCGGCATCTCATCAATGACCGACACCTGCTTGCCCGAGCGACTTCCCTCGCCGTGAATCACTAACTCTTCATTTTTCCACACAAGCTTCATCATCTGATGGAGAGTAGAGGGGACGGCTCCAGCCATGTGGATAAACGGCCTTCCCAAAAGAAGGTTGTAGCTAGTGTCTATGTCCAATACTTGGAATTGCGCACTGAACTCTGCGGGGCCCATTTGAAGGGTCAAAGTCACAGCCCCCAAGGTGTCTCTTTGCACACCATCAAATGCCCTTACATTGACCTGATTCTGCTCCAGTTTCCCAAGGTCAAAATTTAGTTGCCTCAACGTCGTCAATGGGCAAATATTCAAACCGGACCCATCATCTACCAAAACACAGTTGACAACCTTTCCACGACATATCACGGTAATGTGTAGCGCTTTGTTGTGGGATCTTCCTTCGACTGGCAACTCATCGTCACAAAAGCTGATCCGGTGCCCCCGAATGACTTG

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 40

Page 41: Introduction

GEN 3040

ATGAATCATAGAAGCCACGTTATCACTACTTGTACCTGAGGGTACGTATGTATCATCAAGAGCTTTCATTAAGGCCTGCCTGTGGGACTGAGAGCTCATCAGCAAGGCCCACACGGAGATCTGAGCCGGAGTCTTCTCTAAATGTTTGACGATGGAATAGTCCTTCGGTTGCATCCTTCTCCAGAATTCTTCTGCTTCCCCCTCGCTTATCGGCCTCTTAGCATGGTCCTTCTTCTGTCCTCCGAAGAAACCTCATCAGGAGTGTAGCACCTTCCGGACCTGGTCATGCCTTGGGCCACGGCAGTTTCAATGACAAATTTGGGCTTAACGAGGGTTTTCGAAGGCACCAAGGCAACAGCCTTTGCAGGTGTCAGAATAACAAACTTCCTCCTCTCTTTGACACTTAAAGAAGCGACAGCCCTTTCCAAGTCCTCATTCATAATAGGGGTAATCATCTTTGTTCCACACCCGTCTTCATCCGTTTCAATCATATTAAGGTTGTCACCTCCATGATTCGGCAACGGATTTGTGTTGACGTTTGGCACCGCCGGTTGAAGAGAAACTACCTCCTGATCGATCAGGTCTTGGATTTTGTGCTTAAGGTTGATGCAATCTTCCGTGTCATGTCCAACACGATTGGAATGATAAGCACACCTTTGATCTGGCCTGTAGAACTTTGAGTTGACATCCCCGGGTTTGGGCCCCACAGGGTGGATGTATCCATCTGCGGCTAGCCTCTCAAATAGTTTGGTCCGGCTTTCAGCAAGTATGGTAAAGTTCCTTGAAGGCCTTTTTTCAGATCTCGGACGGGAAGTATCATAATTGCTTTGCCGAGGGGGAGGCACCCGCTGATAATTTTGGGTATTTGTACGAGGGGCTTGGCTCCGGGGATAAGGGTTTGCCTGGAAATTTGGCATTGGAGCTTGGTAATTCGGGAGGGGGTCCTGGTATAATGGAGCTTGGACTTGATAAGTGGGCGGAGGTGGTCGGTAGCTCGACTGTACGTTGGCACAGTTGGGAGCAATATTCTGATAAGGGGATGGGATCTGGTATGGTTGAGGGTAATTTGGGAATATGGGAGAAAAATTTTGGAGATTAGAGGGTGGACCTTAGTACGACGAAATTTGAGCATTCTGATAGGTGGGGATAGTATTGTGGTTATTAGGATGATTGGATTGTGGGTAGTAGGATCGGTGAGACTTTGGGGAAGGCCTGGAACGATCTTGGGAATGTGACGGGTTTTTAGGGGTTTTTCTTCCCCCATACGAGACAGCGGCAACTTCCTCTCTTCGCTTTCTTATCAATCCTGAAGACCCAGGCGACGCAGATACACGGGCTATCTTCCCCGATTTTAGACCATCTTTGATAGTCTCACCGACTTTGACTATCTCAGCGAATTTAGCTCCGACCAGCAACATGATTCTATCATAGTACTCAGGCTCCTGTACCCGCACGAACACTTCCACAATCTCTTTCTCGGTCATGGGTGGCCTTACCCTCGCCGCTTCTTTCCTCCACCTATAGGCAAATTCCCTATAGCTTTCAGTTGGTTTCTGCTTCATCTTCTCTAGAGAATACCGATCGGGCACTATTTCAACATTGTAGGCGAATCGGTCAATGAAGTCCTTAGCCAATGCATTCCAACTGGGCCACTGCCTGGTTTCATGTGAAGTAAACCACTCGAGGGCCTCTCCACACAGACTTCGGCTGAAAAGCCGCATCAACAAGGCCTCATCCCTGCCAACTCCCACGAGCTGGTCACAATACGCTCTCAAATGCGCCATAGGATTGCCCACTCCTCCGAAGGTGTCAAACTTCGGAATTTTAAAACCTTCGGGAAGGTTCAAATCTGGATGGATGCACAATTCTGCGTAGCTAAGTCCGACGGCGTCTGGGATGCACTGCAGCTCTTTCATAGCCCTTTTGATTTCTTCCTTTATATCGATCTTCACTTCTTCTTTCGCCTTCCATTCTCTTTCCTGTTCTTCATAGTGATCCAACTCAGAACCGTGCACCTCGTGCTCGGCAGGAACGGGAACTTGGAAGGTGGCTCTTTTTGGTAGGGGTGGAGCTATAGAGGGACTTGGAACGTTTTGGACGGTCTGGTAGTTTTGTGGGTAGGCCTGAGGATTGGTGTTCTGACTCAAATGCGGATGAAATGCTTGGGTATTGAAGGCAGTTTGAGTTTGGTTTTGATTTTGTGGCGGTGGGGCAGTTTGAGTATTCTGAGGATGGGGTGGTATTTGAGGGTGGTTATTTGGAAGAGGTGAAGGAGTTTGGTAAGATGCAGAAGCGTATTGAGGGTTTTGGGTTGTGAGGTCAATCACAGACGGATTATGCACAGGTGTAGAAGGCTGGTTCTGAGTTGGATCTATGTTTGATGAAAGGAAGTAGACCGGAGGTCTTGCATCAGCAATATTAGTACCAAATCCAGGTGGAGGCGCATCCTGTCTCCGTTGCATTTCAACTTTCATTTCAGCAATCTGTTGCATCAGTTGCATGATCAATTCGTTCTGTTCCACTACAGTGGGCTGAGCCACCACAACGTCTGTAAGACGCCCTTCGTCATTGTTGTCTCCCATAACTATTTTTCCTTTGTCTGAATTTGATCGAGGGAAGGAATCTGTAGGACCTTTTGACCTGGTGAAGTAAGAATGATCTGCCAGCTTTATCGACACAGATCAGTAAAAGGTACCTGAGAGGTAGAAACAACTCAAAGGCAAATTTGTTAGTGCATATCCAGAGTACAAAATGCATAATATCGCACATAAAACAGATAAGCACACAAGTCGTTTTGTTTGAGGATATCTTAACCCACGAATAAGGACGGATATATATTTTGAACAAACAGGAATTTGTTTGTTTGGAGATATCTTGGGAAATCTGAATCACGTTGAGCTATGGTCTTCTTGCAGCTGCTTTGCGACGTCTGTTGATCTTATCTTCGTATCTCCGAAACATGGAGGAGAATGAAAATTTTCTGTGATAGGGGCCGGGCCGGGAAGGCTGCCTACGTATCTCACGGGGAGAATTCAGGCCCAACGTAGTTCGGATTTTAGGATGAAAATTTTCCATTCATTCTTTCGTTGTGATTACAAGGGAATTGAAAAACAACATTGAGATTTCTAGAAGACCACATTTGGAGATTTCTTGTCTTGTGGCTGCGCCATTCCTTTCCTTTCATGCAGTCGAAAATGGAGGCTTGTAGACGATTTCTTCTTCATCATCCTCCTCAATCACTTCATGGCCGGGGCCACTGTTGCCTGTTCCCTGATCACGGGCGAGGTATTTTCCGCCCTTTGGGTCGCTATGGGTTGCCAATTCCTCGTCGAGCGCCGTGCTTCTGTCCAACAACACAGCAGTCTCGATATCTATCTTCGCCTCTCTCGCCTTTCCTTCGTGTATCTCCAAACGAAGCAAGTTCAACCTTTTCCTTAGGCTTTCGGTTTCCATCTCAACCTCCTTCTTTCTCTTTATCTGTGACGCCAGATCTTCATCCAATGTTGTGACCGCAGCTTTGTAGATCGCCGTGGTCGCGTCTACTTTGAGTCCTTCTTCCTTAACCTTCTGAATCTCTCGAGTGACTCGCTCGATCTTTCTTTGCAATTCCTCCTCAGTGAGCTCCGTCTGGACGTTCTTGCCTTTGTCCATAGCGGTGATTCGACTTTCCTGAGATGATAGAGCGGTATTTAGGATTTATGGTATGGATTTTGTGTGAGAAACTTGGGACTCGGGAATTTTGGTCGGACATTTGTAATGTTGGCTTCTGTATACTCTTTTGAATTTTGGCTAGGAGTGGGTTTACGATATCCTCAATCAAAGCAACATAACATATAAACATTTAAACAAATATATCGTGTCCTAAACATGCATGTGACCCTTTTGTGCCAAGGGTAGGCCTAGCGATCTGAAGGATGTATTGCGCCATTTAAATAAGTAATTGTATCCCCGAAACAAATTTCACTAGATGTCATAGCGTTCATTACACAA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 41

Page 42: Introduction

GEN 3040

TCAAAATTAAACGGGATATATCTTGGGAAGGGGATGTAGTGCAAATACACATATAAAATCAATCCCACGCAGGGGAGTTAAAAGGTCGATGATCCAGCTCCTCTTCTGGCCTTCTCCACTTCTTCTCGGATAATGCATTTGCTGGTGAACACGTATGGCTCCGCGAACGCAGCTTTGGAAGATGTTGGTACGGCTTGTAAGGTGTTCATCTGCTCGTCCAAAATAGTGTCCAAACGCATTAGACAAGTCCTTGTGTCAGCTAATTCTTGTGACATTACGGCTAAAACTCTCTGATTTTCTAATTCCCGGAGTTGGTACGCTTGCTCCCTTTGTTGAAATTCCTGTTCTTTCTTATAGAATTCCTGCTCCCTCCGGTCCCACTCTTGCTGAAAATGGTACGCGTGAATCTGAAGCCTTGTCTCCACGTCTTCGATTTTCCGACTTGTGCGTGGTGTCGGATCCACTACACCATGCAAATTATTATGCAACCATGTCTTATATTCGTCGACATACCCAGCTCCATACCTATCTTCGGTGACCTCCCACTTCATGTTGACACGTCCGGCCCATTCCTGGAGGATAATTTTAGCGAAAGGTATTTTGTCCTTCCCGTTGTAATCAACAATGAAAGAGCTGGCATCTCCTTGGGCGGGTATGATTTGGACTTTCCCAAATTGCCGCATTACTCGTGTCGGGTTGTAAGGACGCAATCCTCTGATGCCGGGAAGGACTAAAAATGGCAATTTGGCCCCTTGGGCTGCTATCATCTGAGACCTGAAGCTCGGTAATACCCATTGTATGTCTCCTTCTTTCAAATTGTAAAACACCTTTGCCCACGCATCCCTGGTGGTGTGAATTGGAAATTTACGGATACTCAAGTAGAACTCGTAATTATTGAGAGGACTTGTTTGACTTGGTTCGGCCAAACCATGTAGCTGGGGGTTGTCTCTTTTGCATAGATGCTCCGTCAACCACCATTGAAGTAAGATGTTGCAACCTTGGAAAAACGGAAAACCATTCTTGCACAGGCTCAAAGATCGGTAAATGTCGGACAGGATGACTGGGGCCAAACAGCAATATTTCTTTTCTTCAGATTTTCTACCGACTCCTCTGAACATGGCATTCACTACCATTATTACCCGGGTGTCGATCTCTTTTCCTTCGTCTTTAGGGAACACCATAGTTCCCAAGAGGCATGCGGAGAAAGCGATAGCTTGAGTTTGCCTCCAAGTACTGTGATCATGGAATTCGTCCGGGAACAATCTAAAATAATTGTCGTGCCCCCACCTCTCGAAAAACCTCATCAGTGGAATGTTGTGAGTCTCTAGCCAATTCGCATTCACTAGGGATAATATGTCTCTTAGCTCAGTCTCGGTTGGCCTATCTGGCAGGAGAATGTGATGATCGGGACGCTTTTTCCTTTTGCCACACGTCCCAATCGAGTCAAGGCAATCTTTGATCTCTTCTAATGTGGGTTTTATTTCGACATCTCCAAATCGAAAGACCATCTTCTTATCGTCCCAGAATCGGGTGATGACTTCGATAAACGTGGGCCATACCTTGAAGTCCATTATCGAAGGTAGGTGACCCAAGTAAATCGAGATCTCTCTCTTCTGACACTCCTTAAGATTTTCCCACCATACTCGAAGAATGTCGGGTGCCTGGGTGACCATTTTGAAGTGAAGGAACTGACTTGACATCTACAAAAACATACAACTAGTTAATTTTCAATCCCCCCGATACAACAACTCGTTAAACACGCAATACATCCTTCAAATTTAAACAACAAACATGAGCGTCCTATCGAGCCGCAGGCTCTTTGGACTCAAGGTGGACTTTTCTAATGGGCCCGACACTCCTTGGGTGTCGGGTCATCTTAGGCCAATGCGCTATTTTCGGGATTTGGTTTCGCTCTATCCTAGTTTGACTAAGAGTGGGTTGTATTTAGAGTGGAATGGGTAACCCAAGCGGACTACTTGGGCTGGAACAATTAACGATCGTTGACTATCAAGTAATCAACTACACTCGTCAGGGTGCCCCGAGAAGATTTTTGGTTAACGAACGACTGCGAACCCGCATAATCGTCCTTTAGTGGAAATGTAAAAAAGTGTCGTTTGACGGAAGTATGTATGCTATGTATGCAGCAGTTTAAAAATCTAAACAATTTAAACATTTAAACAGTTAATCACAAATAATCAAATACGTTAATCTTAAGGTTAGAATCCTCCAAGTCCCCAGTGGAGTCGCCATTTCTGTTTTATCGAAAAAGGGTGGTGATTTTCGAAAAGAGTTTGTTTTTAGACTTTTAGGAGTCGCCACTTAATTTTTAAGAAAAATCAAGAAAACTTCGTTTCTAAACAACTTAAACAGAAAAAATTGTTTTGAAACATTTTAAGGGTTCGGGATTCCTATTAGTATCTTAGGAAGGTGTAAGGCACCCAAAACACTCCGTTAACTTACGGTTTTCCAATGATTAACTATTTGATTATTTTTATTTTTAACCTTTGCAAATTTATTTGAACTTTTAACATGACTTACTTAGCCAAACTTAGCCAATTTTGAAATATTTATTGTTTATCTTGCCAGTTTACGAAACCATTTCAAAGTTAAGCTTACTCAAGGTTTTATCTAAACAAGCCTTACGAATTTGAAACAATTTATCATTTAGAAGTTAGTTTTAAAATTTTTTAGGTTTGATAAAATAACAGGCGTTCGATGAGGGTTTGGAGTTTCCTAATCCTAAGCTAACGGCATTGAAAACAAAATAAGATAAGCAAATAGTAAAGAGAAGGATAGAGAGAGAGAGATTTGGGTCCAACTCCGGGTTCTGTTATCCTTGACCGAATGTTGGACCCACCTGCCTTTGGGTTTTTTAGCCCATTTCTGTACCTGTCCTAATCTAAACATACAAACATAATATAGAAAACAAAAAAACGACAAGAGGGCTTAGGCCCATTACAAAATATGCAAAGTAAAATAAAACGAAAGAAAAGTCTTTTAGTCCATCGTCTTCAATCTCCATGACCCTTGTCTTTCTACACTTGCTCGTTTTGATGAATTGACTCGATAAAGTTCAGAGTCTTCAAGACTCTCTCGAAATGCGAGGAGGATAATAAGATTCCCATAATGGACTCACGGGGAGTTTTCACATGTCACAAAAAAAACGATAAAATGAAGATTGGCATTTTTAAAACTACAATTAGAATAAGCAAGGTTCTAAAGCATAACAAGGATATCTTTATAATGCAATACTAATTAAATCTTCATGAGAAGCAAAGCCAACATCCAACAACCTTAATGGTACTTAGAGATATGCAAATCAGTCAACCTAAATGACACTACATAACGATACATAAACTGTGCGGCGAAACCCCAAGTGTAGTGAAACCTTCACACAAAATTGTTGGTAATTTCCAAATTTTATATTTAAAGAAAGGACTTAGCATGGGATAAATAACGAATACAAATAGAGATGCATCATAAATTGAGTATCAATTAGCCGTTTTGCGCTTCAAACATTTGAAACAAACTTGAACCAATCATAAGAGGACAGCCTCATCAAATAAAACAGGATCCCGATCACCTTATCACCCAAACATCTAACACACTAAGCGAAAGGACCATTCAATCGACCTACTACATGGCTGTGACCTCGGACAGCTAACACAAGTGGCCACCAAGATAAAGATGGCCTTTCAAACCACAGATTTCCAAAATCCACACCACCCATACGGCATACAAAACGAAACTTTCATGCAAGACGCCTGACAAACATGATATCTAAACAAGATAACAAGTTTGGGAGATAAATACATTATAGAGAATAAACAACCCTCATATATTCTTAAAGCGATACTCGAACCAAAAGCTTGAAGAAGGTTCGATTGAAAAGAAGCTTACAACTACAACTCCCGCGAGATAGACATCATGAGCAAGCTTGGACGAGACAACAAGGGAAAATTACACTCTAAACATACATACATGCATACA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 42

Page 43: Introduction

GEN 3040

AACAATCCAAGAGCCTGCTTTAACGAATAGACATCGATACTCGGATTAATGAGTCATTTCACAGAAGCTCTTGCCAATATGTCTTAACAATCTCATTCCAATCATGTGACGAATAAATAGTGTTCTACGAGATTAAGCTAGCATATATTGAAAGTTCTAAATGGCAAGCAAAGCAGCGACAGTGTTAACAAACCACAAAGACAAGCTCAACCGACCATGTATATTCACATTTTCGAGTATCTCAACATGTTCGACCACAAACTAAGAAAGGGCCAGCAAGTCGATTAACAAACATCACATGTCATACGGCGAAGGCTCCTCGAAACAAATACAAAGTCATGACTAACTCGTGAAGACAAACAGAGAAGCAATGTCGACGAACCATCACATTGGGACAGAATCACAACGTAAGACATATTCCAGACTTAAACAACCAAACGAAATATGATACCTCATGTTGACAGCACCACAAACCACGACTCAACTCCATAAAGAAAATTTAAAATAATAAAGAAAAATGGGACATGTACCTTTCTCGGAGCAGCGGAACAGGGATGACGGCGAGCAGACGACGATGACTTCCCCACGCGACCCACGGCTAGACGTTTATCTCGTTTGTTCTTCGTATTGGAGTCCTATTCCCGAAAAATCCCGACTGTTTTGCTACACGAACAGGACAGGAAGAAATCTTCTTCAAAGTCTTCCTAAACCAGAATTTCTAACCAAATTTTCTAACTTCTTTTGGTTATCTCAGAAACAATGGAATAAACCCAGAATTTCAAACAACGTTCCCAACCCAAAATTTCCTAACAAGAGACGGAAGAAGCAGAGTCTTTTTTTTTCGAAAATTTTCTCAACCAAAATCCCTAACCCTTTTCTGTGTCTTTCTTTTCTTTTTCAAAAATCTCAAATATCTAACATTCTTTCAACTCCCAAAATTCCCAGCGCTCCAAGTTCTGCTTTTCCGAATTTTTCGATGAAAAAATCTCTCACCCCTTTTCTGATGAAGAAAAGGGGTATTTATAGAGGAGAAGAAATAGCCCCCCTTTTCTCCTTTTGAAATCGAAACTAAATCCCGTTTCGGTCAAACTTCACATGGCACAGGCGTGTTGGAGAGGATGAAATTGACACCTCGCAGACCTACTCGACAGCGTATGGGTCTCTGCGCTTCCGTTTGGTTCACGCGAAGAGAAAGATGGAAGGAGAAGAGCGTGGGGTTGTTGGAGTCTCTCTGTTTTTCGCGTGATTTCTGGGGAATTATGGGTTTTTGTCGAGAGGGGAAGGAGAGAGCGAGCGTGGGGGAGACGCGGGGAGAGGATAAGAGGAAGGGAGAGAGCGAGGAGAGAGCGTGGGGAGGGGGTCGCGTGGCTGCGCATCTGGGAATTGTTTGGGAATTTTGGCTTTCTGTTTTGCTGGGTTCGCGTGGGGTGAGGGGAGAAGAAGGAAAGGGCAACGGGTCGGGTCGGGTAGGATGGGTTCTGGGTCGGGTAGGATGGGTTATGGTGAATTGGGTTGGGCTGGAGAATAAAAGAATTGGGTGTTGGGATTAATTTGTTGGGCTTGTTTTAAAAATGTGTGTTGGGTTGGTATTTTTTAAAAGAAATAAAAGGGGATGTTGGACTGGGTTGAAGAGGATGGGCGTCTGGGGGGAATTGGGCCTTGAGTCTGTGGGCTGTTGGGAATATTGGGTGTAAGGAATTTAGGCAAAATGGGTTGAGGACGAAAGTGGGCCTCGGAATAAGAGTGGGCTGATGGGTAAAGGAATGGGTTCAAGATATACATATATACATATACATATATGTATATACATATATGTATATATGTACATATATTTATTTTCGCAAGGTTTATAAAACTAACTAACTCATACCGACGCGGGTCAAAATTGGGTGTCAACAACATTTTTAAAGAGTTGTGACTTTTATGAAAAATTGTGACTTTAATAAAGAGTCGCGATTTTTATAAAGGTTGTGACCTTTCTAAAGGAATGCAACTTTTTTAAATAGTTGTAAACTTTTTGATAACACACAAAATACATTTGTTCACACTTCTTTGTTGTCTATAAATAGAGGGATTTTCTTTTATTTTAAAACAACAAAAATTCTTAAACCTCCTCCTCTTCTTCTTCTTTTTGCACAATTAAATAGTTATGTATTTTGCTCATGTTGAGTGACTCACTGAAACTATTTCTTATGTAACAGATCCTAGTATGTAATTTTATAGTCACTAATTTATGCTTATGTTATTAAGGATAAATGATAAGATTTGATAATTTTCATTTTCCTTTACATTATTAATATTTGTTATTACATAATCTTGTATGTAAGATAATATATATAGTGTTTTAGTTTTAATGATCGATAGATTTTTAGTATTATTTTATAAATTAATGAAATACACGTCTTATTTTTCCCTTATTTTCAATATTCCCTTTTATTTCTAAAAACCTCTAAAATTCCTTATTTATTTAATGTAACTGAGTTACATATTAATGTATCTAAATCAATATTTAATATATTTGAGCTGCATATTATGTATTCGGTTTATTTTATAATGTATCTGAGCTAGTTCTTAATGTATTCGCGCTGCATATTAATGCATCCAAAAACTAGTATGTAATGTATCCAAGCTACATATTATGTATCCTATTTATGTTATGATGTACCCGAGCTACATGTTATGTATCCGAGCTACATGTTATGTATCCGATTTATGTTATAATGTATCCGAGATACTCTTTAATGTATCTGAGCTACATATTAATGTATCCGATTTGTGTTACTTTTAAGAGATTAATGTAATTACAAACTAAAGAGGGATAGATTGTAACTTCACATTAAAACTCTGTGATTTCTAAAATTTATCCAAAATCTGTAATATTAAATGCACGAATAGTTTTACTAGTATATATATAGCATATCAAACTATGTTTTCACTTTAAGTTGAAAGGGTTTGTGTATATATATTCCACACGCATATATACAAACTTATTATCAATATTATTTTGGTAATTTTTCATTCATTTTTTTCGCAACCAAAATATTCTTTTTATGGGCAAAAAACTACTTTATAATTAGTTCCTCCAAGACAAGTAAAATATTAGTTTTAGCATCCTTAGGGTAAGCATCATAACACCTTTCCTTAAAAACACTTACACTGTTGGATCACATTTACCGGAATAACAACAATATTGATCGGTCTTAAAAATCGTACAAGGATTGTTATAACATCCTCTGCGAGTTTTTAGCACTCATGGACAAATCATGTTAATATCAACTCTACACCTAGTGCCACGAGCACGCCCATTCGATATAGGGCTAAATTCTATTGACACGTTGAATCTAGCGATGAAATAGACATTCAAAAAATTAAAAAAATATATCATCTTAAAACTTCCACTACATAAATTGTGAAAATTTGTAACTCTTTCAATACAGAATTAAATGGATGGAATGATTACAAGGAAAATTTCTATAAGGGAGAAATTTAATAGGTAGCCACTTATATATTTGATAAGTGTAAGACATCATATTGAAGGAGCAATAGAGAAAATTCATCACATGAAATCATGATATTAATTATGGCTATATTATATTACATACTGTCATTTAATCTAAATTACCAAACTGCAATGATGATTACTTCTTTAATTTCTGGAGATTCAAAGTTAATTAAAAAAACTCTGCTAATGAAATTAAGAAAACAGGACTGTTGGTTCATTGGTTAAGTTTGACTATCGATTTAGCGTGTTATAAATAAAAAATACACAAAAACCTAACTGAACCAATCAATAACCACATGGACAATATCCAATACGAGTTATGAACCTGCCAATTTATTAGCACTAGGCATGTTCAGGGATGGTCGACAGTGTTAAAAAAAGACATGTGTCATAGGCCTTCAAATTTAAATTATAACAGTCGTAAATTATTATTTTTTAAAAAAAGTTTAACATTTCATGAAAAATGAAGAAATCATTAGAAGAATTTT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 43

Page 44: Introduction

GEN 3040

GACATATTTGGAATACTATATCTCATGAAAAATAATTAAAAATAAGAAATGATTATACTATCACATGAAAACTAGACGAAATCAATTATATAAAAATTATTAACAATTTAAGTTTAAAACCCCATTTAATTTTTACTTTAGAACGTAATACGCTTGAGCCGCCCCCTCGATTTAACCAGGACGTAAAAGTACTAACGATATTATTGTTTATTCATTATGATAAAACTATATTCAGTTGAAGGGAGACATACATAACACACGTTGGAGATTATTATCACATTAAGGCAACATAGTTTAGCAGACCGTTTAAGATGCTTATTATTTTGGTAATTTTTTCATACCATTCAATTTGTAGCAAAATATGCTTTATTGTGGCATCAATTATGGGCAAAAAACAACCCTATAATTTGTACTACCACTAGGGCAAGTGAATAAGCTAGTAGGATCATCTTGAGGGTAGCTATACGCATTAGGGCACCTTTGTTTGAAAAATCTCGAAAATTTTGTAGGACCACATGGGCCTTGAGTACAACAATATTGTTGTCCTCCGAACGTGGTACAAGGGTTGTTACATCCTCCGGGTACCCTAAGTGGACTAGGGCATTCACCGTTTATATTAGCCGTGCAATGAATTGCATGGCATTTTCCTCCACTAGGATTGGTTGGGGCGAAAGTCATCGGTATATTAAATCCGTCAACCAAAGAAATGTCCCAGAAATCTAGGTTGCTGAATTGGTCCAAAGCATATTCAGCTAGGGTGTTTGGGGGTTTGCCCCACCCGGTACAGTGTAGGACCCCACCACAATCACCAGTTTGGCATGAACCTCTACCAGCACCATTAAAATTGCAATTAGTACGACCCCATATACGTGCCATCTTAGTGCCCCTCGGCGCATTGATGACCCATGTTTGGCCTCGATCGAGACGTCGACCACCGCCTATCGGGGTCGATGCTGCCCAAACGGTGTAGGGACAATTGTTGCGTACCTCGATAGTGGCAGCATAAGTGTAAGTCACTAAAGCAAGGAAGAAGAAAATAAAAGAAGATGTCAAGTAGCCCATGTTTGTGTACATATTTTTGTTTATTTTTGATATATAGTTACATATTATTGGGTAGTTTATATAGGGGTGTGACAACTTTTGGCACCTTGCACATTTAGAATTCCACTATATAAGTTGTGGAAATTTTTACCATAAAGAATTAAATGGATGGAATGATCATATGGAAAATTCCTCTAAAGGAGAAATTTACAAGCCACTTAAATTGGAGGAGGAATAGAGAAAGTCATCACATGAAGTGATATTAATGACAATATTATATAATTGTCTTTTAATTTGTTAGCAACAATAAAAAATTCATAGCTAATTCCTAATCTGCAAAAATGATTAATTCTTTATTTTCTAGAGATTCAAAGTTAATTAATTATAAAACTCTCCCAATGAATTTAGAAAACAGGGAATGTTTTTTGGTTCGTAGGCAAGAAAGTTTTTGGAATTAAACACAACAAAAATAATCCTCATATCCTATTTTTAACTAATACTTTAATGTGTCTAGATTTTTATTAATATGTCTGCTTTATATTAAAATTTCATTTTTTTAAAAAATAATAGGAAAATTCTCAGACCGTCCATAAAGAAAAATCAAAATTAAATAGTTTCTTCTATTCAAAATTAAGATGAGTTGTCACCATACTTTTAGAATTTTTGATTTTTTTATGCAAGAATAAGACTTACGAGAACAAATGGTAATCAAAATAAGATAGGAAAAAGGGTCAAATTTCCTTTTTTTTTCGCCTAACTTATTTTATTAACTAATTCATATTTAGTTGTGTCTAAAACAGGTTTAACTGATAAAGGTTAAATAAAAAAAAATGTTACTACTGTGTTGATTTATCAATTTTGGGTTATTGAATTAACCGTCTTAGAATATTAGTTCAGCTTCCAATTTTAAAATTTTAGTTAATGATTAAATCAATAACCTAATAAAATTATATTAAAACTACTTTTTTTTATTTTTATTTCTATTTATATAATAATGACTTTATTTAAACTTCAAATACAAAACCTATTTTCTTTTGCATTTTATAATCAATGTATTGATGCTGATGTGTTTTGTGCACTTGTTAAAGTTGTGTTATTATATTATTTTAATATTAATATAATAAATGAAGCAGTGTTTTTGCTTGTAAATTTCTAGATTTCTATTTATGAAGTTCTGACTTAAACGTATCTAACGAATAATTAAATAAAAAATCAAGTATTTAAGGATCGATAGTTGATTATCTAAATTATTGATCATTGATTTAGCATGTTTATAAATAAAAAATCGATAAACTAAACCACAAAATCTAACCAGTAAACATTGAGACCATATCGAATACACCTACAAGTAATTCATTAATAAGTTTCAAATTCAATATACAACAACAAAAAAAAAAGGCAAGAAAATGTTGAAGTAAAGTAAATAACGACACACGATTATTAAAGATATTATTATTTGTTCATCAATGATCAAAATATATAAGTTGAAGGGAGTGTGTATACATTACACAAATAAGATATTATTATCTCACCCAACATAAATAGTTTAACAAACCTTTTAAGATACTTATTGTTTTGGTAATTTTTCATACCATATAATTTGTAGCAAAATATGCTTTATATTGTGGCATTAATTATGGGCAAAAAACAACCTTATAATTTGTACTATCACTAGGGCAAGCAAATGTGCTAGTAGCATCATCTTGTGGGTAGCTATAGGCATCAGGGCATCTCTGTTTGAAAAATTTCGACCACTCAGTAGGACTACATGGACCTTGGGTGCAACAATATTGTTGTCCTCCAAACGTGGTACAAGGATTGTTACATCCTCCGGTTACTTTAAGTTGACTAGGGCATTCACCGTTTATATTGGCGGTGCATTGAATTGAGTGGCATTTTCCTCCACTGGAATTGGTTGTGGCGAAAGTCATCGGTATATTAAATCCATCGACTAAAGAAATGTCCCAGAAATCTAGGTTGTTAAACTGGTTCAAGGCGTATTCGGTCAAGGTGTTTGGCGGTTTACCCCACCCGGTACACTGCAAGACTCCACCACAATCACCAGTCTCACAAGAACCTCTACCAGCACCGTCAAAGTTGCATTTAGTACGACCCCATATACGTGCCATCTTAGTGCCCCTCAGTGCATTGATGGTCCAAGTCTGGCCTTGATTGAGACGTTGAAAAATCTTTTTTTTTTCATGTTTCTTGTTTATTTTAAATTTTATATATTTGTTTGTTTGTTTTTTTCTCTGAAAGAACGTTTCTCTCTTCTTCTTTTTTTTATTATTTTATTTTATCATTATTTTTAGTATATTATTATTTTATAGTTTTGTCTTCTTCTTTTCTTTTTTTTTTTCCAATTATTTTGTACTAATTTTGTTATTCTTTTATCGTTTTTTTTAATTGTGTTGTTATTATTTGTTTTGAAACATTTTAAGGGTTCGGGATTCCTATTAGTATCTTAGGAAGGTGTAAGGCACCCAAAACACTCCGTTAACTTACGGTTTTCCAATGATTAACTATTTGATTATTTTTATTTTTAACCTTTGCAAATTTATTTGAACTTTTAACATGACTTACTTAGCCAAACTTAGCCAATTTTGAAATATTTATTGTTTATCTTGCCAGTTTACGAAACCATTTCAAAGTTAAGCTTACTCAAGGTTTTATCTAAACAAGCCTTACGAATTTGAAACAATTTATCATTTAGAAGTTAGTTTTAAAAATTTTTAGGTTTGATAAAATAACAGGCGTTCGATGAGGGTTTGGAGTTTCCTAATCCTAAGCTTACGGCATTGAAAACAAAATAAGATAAGCAAATAGTAAAGAGAAGGATAGAGAGAGAGAGATTTGGGTCCAACTCCGGGTTCTGTTATCCTTGACCGAATGTTGGACCCACCTGCCTTTGGGTTTTTTAGCCCATTTCTGTACCTGTCCTAATCTAAA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 44

Page 45: Introduction

GEN 3040

CATACAAACATAATATAGAAAACAAAAAAACGATATTTTTGTGAAGTTAACTTTTTAATAAATGATAAATATAATAAATTTCAAAAATCAAATATATTTGGACGACTACTTAAATGTTTCAGTTTATTTTTTGAGCCAAATTTTTATCAGTGTGAATTGAATTGATTCAAAATTTGAAAAAGAAAAATATTTTCCACCTCAATTTTTCTATATATTTTGTGTAAAAATGATTTTTTTTTCCCGAAGTTCTTAAAGCAAACTTGACAATATCACAAAATATTTAAATTTCATTACTTCAAAATGATCTAAATATTCCTGCTGAATATGATGTTAGTTTATAATGAAATTAGACGAGAAATTCCTGAATATGACTGCCCACGTGAATTTTAGTTTTTCTTTGACAAAGAAAAACATGGCAACTACTTATTAATTTTTTTAAATTGATCAAATATTTATTTTCAAAAATAATTAATTATTAAGAGTATAATTAAAAAATTATTATAATTTATACATTAATTTTACAAAATAACAAATATTACGAACTAACTATTTTATAATAAAAATGACATATAAAAAGATACGGAATAAAAATAATCACATATATCAATAATATTACATCGGAATAATGATGAGGAAAATTTTCCACGAGTCACTATTCCACGTCCGCATTTAATTCTTCATGAATATTTTTTGGTCTTTTTGGGAACCTTCTCCTAACTTTACTCGTATCGAATTTAACTAAATCTAAATTCTTATTAAAAAATTCTATATTAAAAATATAATATTTGTGATTTAAAATTGTGAACATCTAATTTTTTATCAAAATATAAAGAATCAAACACCAATTTTTTTAAAAAAAGGTTATATTTAAAATAAAATAAAATAATAATAATAAATTGTAAATTAATAAAACTAATAAAATCAATTATAGTTTATTAATAAAAAAAGAATTGATAAATAGAAGTTGGGAATTTAAGTTATTTGTTAATTATCTAGTTTAAAAATAATTTAAGTTATTTGTTCACCTTATCCTAATTATTCCCAACTTTCTAACTTATTTTACTAAAAAGTTTCATTCCATTAATTTTGATTTTTTTTTCTTCTGACGTTCATTTGACTTTTTTTTTAATTTTTTTTTTAAATTTAATTTTCTATTATTATTATTTATTTATTTTATATCCTTATCTCAACCCCAATTCCCTCAACAATATTTCAAACTAAATCCTAATCCAACTCTATTACTATCCATTTAATTAGCATCAAACTCTCTCAGTTGAGAAGAAAACAGAGAATACAGACAAAGAAGAGACACAGACACACAGATACAGAGAGAAATATAGGGGAAAAAAACGAGTTATTTCTTTTGAGAATTAGGCTAGAGTCAGAAATTTGGAAATTTGATCGTGATTCCGAATTTGTGGATACAGCTCGCTTTTTGTGGTATCAATTTTCGCAGCGAGCTAACTATAAATTCTCGCGTGATTTAAATAATACTACTTCTATGAAAAATTGATTTTAATAGCAATCACATGTATTATATTATTGTTACGTTATTACTATTATTATGTTTTTTTTAGATAGATTTTGCAATTTATGTTGCTTCATGATATTGAGTTGTTGAATTTAAATAGATTAATATATATTTTCGTCTATTTTCACTTTAGAAGATTTTTGTTGAATTAATTGTATACTTGTTAGATTAGGATATAAAAATAATAACTAATAAATAATTTTTTTTCAAAAAGAAAAACTAATCTAAGTAGAATTGTAATTAAAAGGTAAATAAATTTCATTTAGAGATTAATATTATAAACTAATAATAATAATAAAAATAATAATAATAATAATAACAGAAGTAAAAATTGAGAAATGATATAAAGAGAAAAGAGATTTTAAAAAGAAAAAGAAAAAAAAAGGGGCATGAAAATATAATAATAAAAAAAACATAACAAAATTTTTTAACAAAAACAAAAAAAAAAGAACGTAAATTAAAAAAAAATACAAAAACGTAAAAAATAAAAAGGGAGAAGTAAGAAATAAAAAATAGAAATTTAAAATTAATAAAATAAAACGTTAAAAAAAAACAAATAATAACAACACAATTAAAAAAAACGATAAAAGAATAACAAAATTAGTACAAAATAATTGGAAAAAAAAAAGAAAAGAAGAAGACAAAACTATAAAATAATAATATACTAAAAATAATGATAAAATAAAATAATAAAAAAAAGAAGAAGAGAGAAACGTTCTTTCAGAGAAAAAAAAAACAAACAAATATATAAAATTTAAAATAAACAAGAAACATGAAAAAAAAAAGATTTTTCAACGTCTCGATCAAGGCCAGACTTGGACCATCAATGCACTGAGGGGCACTAAGATGGCACGTATATGGGGTCGTACTAAATGCAACTTTGACGGTGCTGGTAGAGGTTCTTGTGAGACTGGTGATTGTGGTGGAGTCTTGCAGTGTACCGGGTGGGGTAAACCGCCAAACACCTTGGCCGAATACGCCTTGAACCAGTTTAACAACCTAGATTTCTGGGACATTTCTTTAGTCGATGGATTTAATCTGCAAATTAGAAATTCGGGTAAGGAATTCTGTTGACCGAAGGGAAGGTATTAGGCACCCCTCGAATCCCGTGGTTCTAGCACGGTCGCTTTATTAACTTTAATATGACCTGACTTAATTTTGGATATTATATTTTTTAAACTAATTATAAAATTGTTATTTATCCTCAAATTTATTTTAAACTTATTTCAAACTGGCTTATTTATTTTAAAGGGTGTTTATTTTTATTTTTATTTTTGTGTAATTTTAAAATTGAACATCCGTATATATTTTTTTTGCTTATATTTGTATGTTTTTAATTTTTAATGTTATAATAGGTAGAATATATATAGATATCTAATTTCTAAGAATTAGAGTTTGTGTAAAAAGGACATATTTAGAGTTCAAGTTAATTTAGGATTACATATTTTAGATTTTATCGAGACTCAATACAGATACCAGATATTAATTTCTACTGCAGGGTCTATTTGGGAGTGGCGCTTCTAACATGATTTTTTTCATATCCAAAGCTCGAATTCGAGATTTCTGATTAAGGTAAAGCATTCTCACTAGTGCACCACAACTCATGTTGGTCATATAAAATTTTTTAATTTTTGTGATAATATTATTTTCTTCTAAAAAGTCTCACTCAATTAAGAATATTCTTAACACTATATTTAGCTTCTTCTATTTTTCAATTATATATTAATATTTATTTGCACATATTAAATTCATAATTTAAATTGCAATTGACCGCATGCAAATCAAAGTCTTCGTCAAATACGACGAGTCGTACACATGCCTTAAAATTTGAGGGGCTTATTTTTTTAGCAATAATATACTTATAAGTAATTTCAAAAGTAAGTATCATCTATTTCAATTTTTAGTTTAACAGTATAAGATTTAAATGATTTTTTTATTTAATTTCAATTATCATATCAAGTCAAAAAATAAGATAAATAAATGGAAAATTGAAAGTACTATTTATTGAATAAAAATAAACTATTTCATGTGAAAGAAAAAAAATAACTTTCCATCACTTTTTAAGAAATATATTAGTAGCAACTTTGTATTCTAAAAAAAAATATAGAAGAATGGACTTTCTTTAAATTTTAAATAGTACTTATTTGAATTATAAAATAATCTCAATTAATGTTCTATTTTTTTTCCAAGAGATATATTATGGTTTAAAAATTAATCCATGATGTGGAAAAATGATCAGGAAATTTTCCCACAAACCACTATATAACTTCTTCTTTTCTGTACTTTTGAATTCGATTAAATTAAAAATTTTGTTAAAAAATTCAAATTGAAGATATAACTTTCAACTTTAATGAAAATTTTAGATCAACATTTAAATTTGAGATTACTGGAAAAAAAATTGGCATGTATTACTTATGTAATAATAGTTATTAACTATTTGTTTTAGAATTATCAAAATT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 45

Page 46: Introduction

GEN 3040

CCAGTATAATTAGGTGATTCTTTTTTGGGACATTGTGGGGTTTCAACTTTCAATTTACAAGACTAGCAAGACTTGCTGTTGATAGGTTAGAAAAGTCTTTGTAAGAAATGAATTGTATTATTTTTCAAAAAAATTATAATATAAAAGAAGATATAAATACATGTGTAATGATATATTATCTAATTAAAATAATAAGTATTTTAAAGTTAAAGTATATTACTAAATATAAAAAAGATATTTATCATTTATTCAAAACTGATTAAAGAAAATCACATAAATTAAAATGAAAGAAATACAGCCTTATTTTCTCAAATTAGTGGAAATAGTTAAAATTTGAATTTGAAATAATGACATTGCAAACCCGTATTGTATATATAATATATCAAAAAGCTATATACCTATAGGGAAATATAATTAATATATAAATTTTTCTCGTCAAGTAAAAAAGTGTTAATACATATCTTAATATATAGATATTCATTCAAATTCAAATGTCTTAATTTTAAGGAAAGTATTAGAAACAATTAAATGCTTACAACTAAACGAAACAAAAAATTCAATCTGTCGGTGGCATTTACTGACACTAAAAGGGAATAATTAGAAATGTTCATTTTTTAAAAAATAAAATAAAAATAGAAATCTCCTATAACCAAAAACGACATTTTCTATTAATTTACCCTTTATATATTTTTTATCACTTTTAAAACACAATCCTAATTTTTAGGGAAAAAATAAATTAATTCTTCATATCTTGATTTTTAAATTGACTAGGATTTTAAAATGAACTTTGTCTTTAATAATGTTGATAATTAATATGAGACAAAGTCGGAGAATACTTAATGGCTTTAGTCATGTTGCTCAGAAAGAAATGTTTAATTAGTTATATACTACGGAAAAGGGTTAAAATATTCTTAAACTATCAGAAATAGATTTACCCTTAAATTATATTTTGATTTAAAACTATTTTTCTATCAAATTATTGGGTCAAAAATACCCTTCTTATTTTTTTTACCCTTTTCCGTATATATTTTTAAAAAAGACATTTTGATGAAAATTTTTAGAAATTGGAGCAAAAAGCATTAGATGATCATAATAATACATATTGCAAATATTTGACTGCAAAGCTAAGTCTTTGTCAATAATAAATATGAAGCTATTGCAATTTGTGGTCCTCATGCATTGGAAATAGTATGCATAGCCTTTTAATTAAGATGTACATTGAGTTACATAGCTGGAAAGTCGTCGTTTATTAAGAGTGTTTTATTTATTAATGTGAATTTTTTAAATACAAATTTATATTTAATCGAGATTTAATATGAATAATTAAACACCAAAATTTAAATTTAAGAAGGTACAATAATTTACCCCTCCTTTTGTTTTTAACTAGCTAGATCGAATTGTATATATAGGTAATCATAATATTCAAATTGTGCAATATTTGACTGTGTATAGCTAAGTGGTCAATTTATAGTTGGTCGTCATCACTAGAACTTTATAGCCTTTTAACTCTTCTTTGAAAAATTAAATTGATTAGAATACTTTAAAGAAATTAATTTAATTTCATGTGAAAAAGTAATTTTAAGCCACTTAATTAATTTCATGTACTTAATAATGGGAACGAGGAATATAAAATTATATTGGGTGAAGCTTCCACAAAGAGAATCCTTAATATTATAATATTATATAATACTTTTTGTTCTGAATTAAATCTCGATGATTTTTAAAATAAAATCTAGCTCGCTTAATTAAGAGTGTTCATAATGTTAATCGTAACTTCTCTTACTTTTCAATTATATGGCAAATGTGAAGTTTTGCACAACTAGAAAGCAATCTCCTTCTCGAATTAAGTTTCATGTATGCATGACTTACCACTTTATTCCTCGAATAATTTTACATAGCTCATCACTAAAATCGTCGATTCATGTGTTGAAATGGATATTAATTATCTGTCACGTATAGTTTTCAAATATTTATGATCATCGAAGTCCATATATAATGCAAGCTCCTATTATTATTAATTTTTGTCTAGCTTTTCATAGAGGGTGAAAGACGGATCAGACTCAATACCATGCATCTTCATACTCATCTCGTCAAAAGCTATTAACTTAAACCTTGAGGGAAACTTAAAAGTGACCAAATCATCATAAAGATATTCTTTATTTTCCTTTTATTCATCACAAACTAAGGAACTACATAGTACTTCCATATTATCACTATATATATGATGAATAAAAACATCATATTTTTCCTTGACTTAGTAATTAATCCTCCTAGCAAGGAGAATAACATAGCCTTATTCAAATTATTAATACAAAAGGATATATAGATAGGCAGATAGATATTTAAGGACAGAATACAACCCTATAATTAGTACCAGCAGGGCAAGTGAAAGTACTAGTTTGATCATCCTTAGGGTAACTATATGCATCAGGACACCTCTCCTTAAAAAATCTCGAAAATTTAGTCGGACCACAGTTACCAGAGTTACAACAATATTGATCAGTCTTAAAAACAGTACAAGGGTTGTTACATCCTCCGGGAGCTTTCAATTCATTAGGACATTGTTGATTAATTTCCGCTTTACACGTTATGCCACGGGTGCACCCATTGGAAGTTGGACTAAATTCCATAGGAACATTAAATCCATCGACAAGAGAAATATCGAAAAAATCTTTATTTGCAAATTGGTTTAGGGCATATTCAGCTAATGTATTTGGTGGTACACCAAAAGATTTACATACAAGAAGACCATTACAATCTCCAGTTTGGCATTTTCCCTTACCGGATGCATCGAAATTGCATCCGGTACGGCCCCAAACTCGAGCTTGTTTAGTCCCCGGAGGGGCGTTTATCGTCCACGTTTGGCCACGATCGAGGCGTTTGCCGCCTCCGGCTGGGACTCCGGCAGCCCAAACGGTGTAGGGACAATTGTTACGAACTAAAATTGTGGCTGCATTGGCATAGGTGAAAAAAAGGGAAAAGAGAATAAAAATTGAAGTTTTCATATTTTTATTTTTTGTGTGACTACTAATAAATTGGGATGAGTGATTTTTTATGAGGGAGTAATAATAATTAGAAGGGTATTTATAGGGTAAAAATGGGGCACCAATATAGTTGGTAAGTAACTAATTTGTTTGAAATTTTAGTTTGTGATAATGTAATGTAATGTTTTGAGTAGACTAGTACATTATTCTATCGAGGGTGCAAGGTGTGGCTACTAGAAGTTGTAGAAAATGTAGCTAGGGTCTATTATTCAAGCTTGGATTTTATTTTTTACTTTTGAATAATATGTATATATATATATATTTATTTTTTTTACGTATCAGAACGAAGTTCGATTAAATTTAGATACGCGCTTTAAAGTTGTATATTGAGTATAAAACGTTTCCTAACAAAAATAACTATGTACTTAAGAGACTCGAACTCGAAACATCTGATTAAGAATTAATGAAATAGAATACTTATCACTTCACGGCCAAAAGCGAAAAAGAAAAATCAAAGTCAAGAACAATGAACTTAGAATTTGGATGGTACATAAATTCAAACTTTTTTAGATAATCATGCAAGTTTAAGGACCTCTATTTAGCAACAATCGAAAGGCTATGCACACATGAACATTATATTGCTTTGTGATTTACATAAGTGACTTTTTTCAGTATCCACATTTGATATTCATATTGAATTTTAACACTCTATAAATAATAAAAATCGTGATTGAATTCGATCGAAATCTTAAATTAAAAATTTGAATCAAAATTACTGAATTCTTATTCGTCAATATTTAATGATCTGTATTAGTGCCCCTCGAATCAATAAAGTTTAAAAGATGGTGACTTGTTAAGTGTACAAAAAAATCAAAAGAAAAATAGAGACTTGAAAGTAAGAAATATCAAAGTTTACTAAATATGAAAGAAAATTGTTAACTACCTTGTACAAAATTAGGTGGTGGAAAATTCGTCTTCAATCATATAAAGGCATTTTTATAAAAGT

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 46

Page 47: Introduction

GEN 3040

TATGTCATAGACAAATGACTGGCCAAAAAGTTATATATTTGCTAAAATAAAAGGTTTAAAATTAAATAATTTTTAAATATAAAATATATTAATCTTTATGAAGTAGATTAAAAAGAGATGTATTTATATTAAAAAGAATAAAGACTTTTAAAAAAATAATTATAAATAAGAACAAAACTCGAATAATATAGGAAATTTGTAGAAAGGATAATGTCTCCAAATAAAATATGGGACGACTCAAGCAAACAGTGCTACAATTCAAACGCAATTAACAAGTGAAATAAAAAATTTGTCAGAAATTTCAAAATCAACAAACGATTTATGAGGTTCAAAAATTTATAAGAGTTTCATCTTTCTTCTGCATTCTAATATCTCATTTCCAATAATACGTTTTAATACACTTTGTCTCAAATACAAATAGTAAATGAGTCAATATATAACGTAAAATAACGGCGATATACGATAAATATGGGAAATCTAAGCATGTGTGAACATAACACATAATAAGTCATCCATTATTTACTGATTTATGGTAAATTAAGTAAGGAAGAACGTGACATTTTATTATTATTTAATGAAAGAAAATACTTTGAGAATATATTAAACATGACTAAGAGAATTCAATATTTTATACATTTTAATTTATGGCACATTAATATAGTTTTTCCACTTCTGTAAAAAATATTTTAACTTAAATTACACTTTATCTAGTTTCTAAAGACGAAAATATTAATTTTCAAACAATAATTAGTCTAAACTGATTAATTGAGCTAAAAAGGTAAAAATTGATGAGATTGTATACCAAGAAGATTATTATAAAGTAGTTTACTATGAGAACATATTATTGCAAATGAGTGTATTGATACTTTAACCTAAATTAATGTGCCAAGCTAATGTATTTTTTATCAATAGTTTACTGGACTGATATAAAAAATAAAGGGATTTTTCCCCTCGAAATTTTGATCAAAATTTATTATAATCGTATTTTATTTATGTTATTTTATGTTTTAGTTTTTTCTATTCTTTTAATTTTAATAGCTTTTTATTTTAAAAAATTAATTTATTTTAAAATAAAAAATATATTTAATATTTAAAAAAAATTAATAAAATTTTCCGACCATAAATAAAATATCATATGCAACTTATAAATTAAATGAAAAAAAAAACTTATTTAAATGTAAGGAATATTTTTAAAATTTAGCTAAAAAGTTTCCCAAGTTACGAGGGAGGGAATATGGATGTTCATATTTTGGATAAAATTGATTCGGAAAGCTCAATCAAATTGATTAAACAAAATAATTTGTTTTTTATTTAATTTTACATTTTTTAAATTAGATAATAGTTGGCTTTCCTTAAAAAGCTCAAAAAAAAATTAATCAAATATATATTATGAACAATTTTAAATATTGTATATATGCTTTCGTCAAGATTTATATATATAAAAATTGTTCAACACAAGAACATCTATTTTATGTATCAATTTTATTTATGTGAGTTTTTTACGAGATTAGGAATGTCATTGCCACTTCCTAGTCAGGGACAAAGTGGTGAATTTCAACGCTTTGTTCACATTAAATTTAGTCTTTGAAAGTTTTGTAACGTCATCCGATCTAATTATTTTTTTGTTTGAATAACTTGTTAATTTTTTCTTCCCTTTTATATGAATCAGTTCTTTTATGTTTTATATATGATTAATAACTGAATAATCGAATAAAATCACATTGAAATCGATAACAATCAAGCTGATAAATGTATATTATATTAGGTTTGATTTGATTTTGATTTTGATCAATAACTAACCCTAATCGACTTGTGAACACTCATAGTTGATGATAGTAATGGTTGTAGTTGAGTTGAGGACGGTAGTTATGGACGTTAGATGATGGTGATAATTATGACTGAAAATAGATGAAACAGTGATGATTTGTTATCTCTATAAAATTCTTAAATAATAAACATAAGACTTCCATATAAATTTCAATCATTCCGACTCATTTAGACCCATTAAGTACGGTAAAGATCAACACGTGAGTAATTAAAAACTCATTCACAGTATTTCAAACTTGCATTTGTTAACATTAAACATTTTACAAAACACGTGAATTATTAACCATGCAAAAGAATTCCAAATTTGCAGAATTCGACCATCCTTCTGTGACTGAATTCAATAAAGTCATATCGATAACTTTTAAATCCGTTACTAACAAAAACTATTTGTATGCATATATACAAAAAAAAATTGGCTTATTTTTTTAGTCTAGTTGAAAAATATATTATCTCGAGATAAATTATTACCCTAAAGAGAGACGGAGCTAGAAATCGAGTAATATAAAAAAGAGGTAAATGAAACGGCGTTAAAAAAAAAAAGTCCTTCCTCAACCCCACAGCTTGAGTGTGTTTCTTCTATAACACACTACTTTTGTATATATATTTTCAAATCCAACAAATCCGTCAGTTACTCTTCAGAATCTTCTTCAGTTTCCCGCTACTCTCTCTCTCTACAAATTAATATCTTTAAACCCTAACTTCACTCAAAAACGGTCAGTTACATTCAATTATACTCTCTTTCTCTCTCTACAATTTAGGTACATTTTTTTTCTGATTTTTATTTAGATTTTACGAATTCGAAGCAATTTTAGCTGTTTTGTTCATCATTGTATGTTTTCAAGCTGTTTTATTTTGGATTTTGCGGTTTTTTGTTCGTTCGTTAGGTGTGAATCTGGTTGTTGTGAATTTTTGATCCTGTTCGATTGATTGATGGAATGTTAGTGTAAAGTTTTGATTTTTTTTAGTTAAGAGATGTAAAAAAGGGGATTTTTTTGTGTGTTTTATGCACCTGGGTGCATCTATGGAAAGGAATAGATATATGAAAATTGTATTGGAATGAGTGAGTTTGAGTGGTTAGGAATGATTTATATAGCGGATCTGTACTGGTTTGGGGTTCAGGTGTACTTCACTGAAACAGTTCATTTAATTTGACTACTCTGATTAGCTATATCTGTATCTAGGTTTTAGGTGGATTTTTATGTGTTTTATAAATTTGTTTTTGTATCCATTCCCTGTTTATTTCGAAGCTTTTCAAGTGTTTATCTGCATTTCGTGGCATCTTGTTCATTATTGTATGTTTTTAAGCTTTTTTGGGGGGAATATATTTTTTTTTGCTTATTTGTGTTTTTTATTAAGGGGTTAATTCTCTTCGTTTGGAAAATCTCGTGATGTTTGATTGCTGAAAAAAATGCTAGTTTAAAGTTTTGATCTTTGTGTTGAATTTTGAGTTTTTTTTGCTTAAAATTTGGACTTTTGTGTGTTCTAAGGTTCTTGAGGAGGTACTTTAGGGATATTCATGAAATTGAAATTATGAAGTTGATTTGTTTGGCTACTATGCAAAAATGATGTTCGTATTGCATAAATTGATTTATATTGGGATTGAGCTTGAGTTGATACAAATGATTTTGATAGAAGATCTTTACTAGTTTGGGGTCAAGGTGTGGTAATTGATTGATTAGTTCCTGAGGAGCACACACATCTTTATTGTCTTTCTCTTGCAGAGCTGTGAGATATTTGATCGTATTACTGGATTGTTGGGAATTCTTTCCATACTTTTGTTTGGGTGCCTTGAACATTTAGGATTTATGACATATGTTTGTTTTGTTGAAAAAGATTCTATTTTCTTACTTGGTGTAACTAATAGCTCTCGCGACTTCCATGTCTAGATAAAAGTAGAGGGTTATAAGTTGACAGCTAGTATAAACGTAGTCTAGCATTATACTGATTGTCGGGCACAACATGTTTACTGATGCGAATAGAGCAAAATGTGGTTTTAATTTCTATAACTGTTCTTGTTTTTCTTCATTAGCTTGTGCATTTGAGCATTTGTTCTTACTTCTTCTTTGTCGTAATTGTTTTGCAGTTTAAGGATATTCTTTGGCTGCAAGACATGGTACAAGTTAAAGAGGAAGCTCCAACTCTGGACATTGGTGGATTCATTTC

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 47

Page 48: Introduction

GEN 3040

TTGTTCATCTCTCTCAGATAGCAGTTATGAGGCTAGCACTCCGAGATGCTCCTCCGAGCCTGGGTATGCTGAAACCTTTGATTTTTTTCTGTCTTTGAACAGTTGGGCCTCAAATAAGGTCTTTCAAACAGATACGCTGATTGATGCAATTTTAGTTTTCACTTCAATTGTTTAATCCTATTTCAGTTCCAGTTGTCGAAGGAGCTCAGGTCCAACTAAGCGTTCTTCTCAGGCAGGCTGGACAGAAGAAGAGGTAAAAGGCTAGTCGCTAGAGCTTTTGTTGTACCTATATGTTGTTTGATTTCTAGTGCATTAATTCTCCATTGAAGTATCCTGCATACATGTATGATGGTTTTTATAACCTTGTGATGAGAGTGTGTCTCCGTGATAGTTAGGTACTTATAGAAGTATCTGTATGAGTTGTCATCTCCTTTTTCCTTTTGGGTTCTGATGCACGTGGATGCTTAGGATGTATAATTGAGCTAAGGAAAGGATAAAGATAGATTTTGTTACTCATAATATCAGTTTAATTTACTCATTTTTACTATTTCCATAGGCTCATTTTCTTTGATCTAGAAATTTCTGACATCTTTATTTCTTTCACAGGACAATCTGTTGACTGAAGTGGTGGAACGGTTCAAAGGAAGAAACTGGAAAAGAATTGGTAAAACTATTTACTCTGTCATGCTCTATACTTTTCAGCTAGGTTAACCACAAAGACCTTCACTTTTTGCTCCAAGTCACTTTATGACTTGGTTTCCATATTTTGAATACCTGACTATTTCATGGAGTAACTTATACTGTTGTATCTATGAGCAAAATTTGTCTGCTGAACATTTTGTGTGGTTTTATTGTGTTCTCTTTTGTTCTATAGCTGAGTGCTTGAATGGAAGGACTGATGTGCAGTGCTTGCATCGCTGGCAGAAGGTTCTGAATCCTGAGCTTGTAAAGGGTCCTTGGACAAAGGAGGTATTAAACAGTTCTACATCCATTGGTTTTCTACGAGTTGTATTCCAATTATCAATACTTTGATTATTAAAGACAACTGTTTTGTTCTATTTGGAATTTCAGGAGGATGACCTGATTATTGAGTTAGTTGAGAAACATGGCTGTAAGAAGTGGTCCTCTATTGCAAATTCTTTGCCTGGTCGCATTGGCAAGCAGTGTCGGGAAAGGTCAGTTTCTAACTCCGCCTAAATCTGTTGCTAGGCCACAGCTTCTATTTATACCTCCGACTAATTTACATGATCATAAACCCAGAATTTTTCAAAAGGCTTCCTGGAACATCTGTAGTTATGTTGCTTACAGAGTTATTCCTTGCCATCAGAACTCATTTTGTCTCACTTTTCTCTCGTGTTTTGCAGGAATTGTATCTTTTTGTCTTATTTTCACTTAGTCCTACTGGATTTTTTGTCCTGTTGAGGCCCTGCTCGTTTGTAGGAGCATTGTATTTCTTGTTATCTATTGCTTGATATATTGTGTGCGTCTTTTCAAAGCAATTAATCAATCAACTACTGCCCCAATTCGTAGTAGGGGTTGGCTATATGAAAAATCCATAATTTGGGGCACAACTCTTCTTTATAAGGATTCAAACTGGCTATGTTTTGCACATAGGCAGATTTCTCTTTTGTAAGTGAATAAGAAAAAGCATGAAAACTGGGAGCCTGCTAAATATTTGGTTACTGCAATTTAATGGTATATTATTCGTTGGATTTTGCAGTTAAAGCTTAAAAGGAGATCCACATTTCTATGTTCTGTTCCTTTTGAATAAGAAAATGACATTGGTTGATTAATTGCAAGTTATCATAGTTGTTTCAGTCTGCTATGTTCACTAACGTTGTGAGATCGTCCATGAATGTCACAGGTGGCATAATCATCTTGACCCAACAATAACAAAAGATGCCTGGACTGAACAGGAAGAAATGGTCCTATGCCACTATCACCAATTATACGGAAATAAGTGGGCTAAAATTGCGAAGTTTCTTCCTGGAAGGTATTCTTATAAAACTATTCTTCTATTATGTATATCAGTGTCTGATTTTGAGATTCCTTGGGCATTCGTAAAATGTGGATCAAGTAATTTGTTATATCTATCACCCTCTCATACCAATGGCAGATCGTTTGGAGTGCATGATTCTGAAAAGTCAGTATTAGTGTGGTATAACTTGTATGCCAAATCAAACACAACTTTTTGATTTTTAGGTTTCCTTGGTACATTGCTTACAGAGTGTATCAAAATCTTGGATACTACTCGAGTTTGTTCTAGGGAGATAAGTAGATAATTTTTATAATGAAGGCAGTGAAGGCATATGCAGCTTTCTCTTTACTTTTACTTCTTTTTATCTTATTCTCCAGTTTCACTTCCATCCAGACTTCTAAATCATTTTCTCCCAATATGGTTTCTTCTTTTACTTCTTGAGGTTCAAAACTATTCATGTCTGGAACTCTCTTAATATTTTATTTTGTCAATTTGACATGGAAAAAAAGTTGAAAGAATCATTTTGGGCCTAACTCCACCCCAATATCTAGTTCAAGAGGGAGGAATGCCCAAGACTATGTATGTAGATCAGCCATCTCTTTTTCATTCTATAGAGGGCTCTTAAACACTCTCGTCATGCCCAGGACTGAACATCAGGAGTATGGGCAATTCAAATAGGGGCCCAACATCGGGAAACAAGAATTGGGATGAGTTCTGGTATCACCAGGTAAAGAAATGGATCTTGGGACTAACTCAACCCCAAAAGCTAACTCATGAGTGGAGGATTTCCAAGGCCATATACGGAGACCACCTATCCCTCTTTCACCCTATGTGGAACTCTTCAACAATTTTAAACTTGTAGTAATAAGCTAAGTTTTTTTTTTTTGGTCAGATTGACCCTCGGCAAGCAAGTTAGGAGAACCAAATGGGGACAAAGTTAGTAATAGTATTATAACTTCTTTCTGCAGAAGCCTGGATATTCTAGGTATCGTAAGTGTACTTTTAAGGACTCAATCGTGCACTCTGTTTAATTCAGAGGATTGTCGTTCTTTTGTGCTAATAGCTGCAGGTACATGTGTCAATTTGTATCTATTAAGTCCAGTTTCTAATTTATCATGATTACTCTAAAATGATTTTTTTGCCTACGCTTGATGCGTTCTGCTTGAATGATTTTTGCTCAAGTATCATATAGGTACATATCTTTGATTAGTTCTCCTTTCTGATCTATTTTTCCTTTTGGCACTCTCTCAGGACTGATAATGCAATTAAAAATCATTGGAATTCCACTCTGAAGAAAAGATTGAATTTGAATTTACTAAGTAGGTTAGTGCCGGATATCAAAAGTGAGGGATCTCCGGACTTCTCAACTCACAAGAAAAATATGGAAATCAAGAAGCATCCAACACAGGCTCATAATGCAGAAACAGTCTTTTTGAGCGAGCAAACAGGAGTAGGTAATGCTGCTGATGCTTGCCCAACTGATTTGAGGATTGGATCTGCTAATTCTCCTCAAAATGGTTTGCATAAGGGTACTTCTTCATTGGGAACCTGTAAATCATCAGAACAGAAGGAGAGGGATCTGATAAAGCAATTTGGTGGAATACAATTTGTCAAGGCAGAAATTCTTCCAATAGGTGAGACAGATAAACAATGCCAATCCACTTTAAGTCTCACTAAAATATCATATCCTCTGTTGGACTCTTCTTTAGATGCTCGTTTGGATCCATCACACAACACCGGATGGAGTAGTTCTCAAGTTGAGGCTGTTCGTCCTACTTCTTTTGGGAGTATGTATGAATCTCCCAAAAGGTCTAGGTACGATGCTGCCAATGATCCTGACCATGATTTCTTGAGTTTGTCATTGGCTGCCTTCACCGAGGTTTCCTACGGTAACAAGAAAAACAGAACATATGATACACAATATTCTTTGGCTCTCAATCAGCCTGGCCGCTTGTGTTATGAACCACCAAAAGTAAAAGACCTGATGATTTCTTCAATGGATGAAAAACTTA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 48

Page 49: Introduction

GEN 3040

GTAAAGACAACTTTATCGGGCAACATCATGGCCATCCACTTTGCTCTACACCTCCTAGTCTTAAGTTAACAGTCTCTGCTAATGGTAGCAGTCCAGAATCTGTGTTAAGGAACTCCGCAATGAGCTACACAAAAACTCCTTCAATCATAAGGAAGAAGAATTC

Figure 10. The cladogram tree of the genetic relationship of the cloned sequence and five

others using ClustalW

Figure 11. The phylogram tree of the genetic relationship of the cloned sequence and five

others using ClustalW

Q3. Have both the cDNA-derived sequence and the genomic sequence of this gene been

determined?

Yes. Both the cDNA-derived sequence as well as the genomic sequence of the Nicotiana

tabacum osmotin gene were determined. The entry number of the genomic sequence in NCBI os

X95308 and the cDNA is M29279.

Q4. Are there likely to be any post-translation modifications of this gene product? What are

they/what is their purpose?

There are post-translation modifications of gene product and the they are the protein motifs at

specific sequence found in the amino acid. For this sequence, the sequence motifs were

determined by using Patmatmotif program

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 49

Page 50: Introduction

GEN 3040

(http://inn-temp.weizmann.ac.il/emboss/output/220946/outfile). The protein motifs found were

EGF_1, THIOLASE_3 and 2FE2S_FER_1.

######################################### Program: patmatmotifs# Rundate: Thu 19 Aug 2010 20:00:04# Commandline: patmatmotifs# -auto# -sequence /var/www/html/emboss/output/220946/.sequence# -nofull# -prune# -outfile outfile# -rformat2 dbmotif# Report_format: dbmotif# Report_file: outfile########################################

#=======================================## Sequence: from: 1 to: 1304# HitCount: 6## Full: No# Prune: Yes# Data_file: /usr/local/share/EMBOSS/data/PROSITE/prosite.lines##=======================================

Length = 12Start = position 1271 of sequenceEnd = position 1282 of sequence

Motif = EGF_1

GTCGACTCTAGAGGTCCCCGGT | | 1271 1282

Length = 14Start = position 1053 of sequenceEnd = position 1066 of sequence

Motif = THIOLASE_3

TTCTGAAGAACTACATGGGTCCGG | | 1053 1066

Length = 14Start = position 1016 of sequenceEnd = position 1029 of sequence

Motif = THIOLASE_3

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 50

Page 51: Introduction

GEN 3040

GGAGTAAATACAATATAAATTCTT | | 1016 1029

Length = 14Start = position 830 of sequenceEnd = position 843 of sequence

Motif = THIOLASE_3

CAGAGAAGAACAACACAGAAGATT | | 830 843

Length = 14Start = position 85 of sequenceEnd = position 98 of sequence

Motif = THIOLASE_3

AGGAAGAAAACAAAAGAAGATCTC | | 85 98

Length = 9Start = position 1244 of sequenceEnd = position 1252 of sequence

Motif = 2FE2S_FER_1

ACTACCAGAACATCAACTC | | 1244 1252

#---------------------------------------#---------------------------------------

Q5. How many amino acids are present in the mature protein?

By using the Expasy-Translate program, there were 246 amino acids were found in the sequence.

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 51

Page 52: Introduction

GEN 3040

Q6. What is the molecular weight and the isoelectric point of this molecule?

The molecular weight and the isoelectric point of the molecule were obtained from ExPASy

Proteomics Server by using Compute pI/Mw. The isoelectric point obtained is 5.09 and the

molecular weight is 109296.44.

Compute pI/Mw

Theoretical pI/Mw (average) for the user-entered sequence:

10 20 30 40 50 60 CGCGCGAGCG CCAACGGTGT ACGGGCAGTT GTTTCGGACC TCGATAGTTG CAGCATAAGT

70 80 90 100 110 120 ACAAGTCACA AAGGCAAGGA GGAAGAAAAC AAAAGAAGAT CTCAAGTATT CCATGTTTGT

130 140 150 160 170 180 GGACATTTTC TTTTTAACAA GTTGCGTTGT TGGATATAGT GACAAATTAT TTGGTGGTTT

190 200 210 220 230 240 ATATAGGAGT ATGGCGGCTT TTCGCACATA GACATTAATC ATATATATAT TTATTAACAT

250 260 270 280 290 300 TTGACTAGTA AATGGATAAT TCTCAAGGAT GCTTTAGTAA AGGTGGAGGC GGCTAAAAGT

310 320 330 340 350 360 ATAAGAGCCG CCTAATAAAC TAATAATTTT ATGAATATAG AAGTCAATTT ATTTGGTGAT

370 380 390 400 410 420 GATGCTGACC TATAAGCTCT GTAACGTGAC ACTTGTAACA ATATCTCTTT TAGTCAATTT

430 440 450 460 470 480 CTTGCTTTCT TTTATCTATA CCATTTTTTT AATTCACAAG AAATTGGCAG ATAGCCATCA

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 52

Page 53: Introduction

GEN 3040

490 500 510 520 530 540 TTTTGAAGCT ATTCAATATT AGTTGCTTTT TAACATGAAA ATAAACTTAA ACAAAAATAT

550 560 570 580 590 600 TTTTTCGCAA AAATAAGAAA AATATATAAA ATTTTTATTA GAGTTTAAAT ATTTATCAGT

610 620 630 640 650 660 TTGAACTTCA TATTTCATAC TCCCGAAAAT TTTTATGGAG TTTGCACAGA TCATAAATTT

670 680 690 700 710 720 TTCATGTAGT TTTAATACTA AATCTTGGAG TTTCACTTTT AATATTGATG TTATATTGTC

730 740 750 760 770 780 CCCTACTTAT TTTTTCTTCC TTCGTTTTCT TTTTGAGAAC TATTTCATCT ATTTATATAT

790 800 810 820 830 840 AGCAGCTCAT TAAGTAGATG GCGTTATTCT GGATGTGGAA TAAACAGAGA AGAACAACAC

850 860 870 880 890 900 AGAAGATTAA TGCCAATCGG GTAGACTTGA TCAACAATTT AACAAATAAG CTGATTGTAG

910 920 930 940 950 960 ATAACATAAA CTCATAAAGT TACGCCAATT CAATAGAGTT TGTTGAACAA TTGAACTAGC

970 980 990 1000 1010 1020 ATAAAATCAT GCCAACTGGA TAGAGTTTGT GAACACTTAA CAATATAGGT GGAGTAAATA

1030 1040 1050 1060 1070 1080 CAATATAAAT TCTTCCGATA GAAAAATTTC TGAAGAACTA CATGGGTCCG GAGATCATTT

1090 1100 1110 1120 1130 1140 TTTAAATATA TTATAAAATG GAGACAATAT TTAAAGACCA GTCCGTTATA GGCCGGCCAT

1150 1160 1170 1180 1190 1200 TAATTCTTGT CCATCAAATT TGCATCTGCT GTTGGTTTTC TGCTAGATCT ACTTAAAATT

1210 1220 1230 1240 1250 1260 TATAAAACAA TCAGTTTCGA AACAAAGTCT AGTGTAGTAC TACCAGAACA TCAACTCATA

1270 1280 1290 1300 TAAAAGTCGA CTCTAGAGGT CCCCGGTACC GAGCTCGAAT TCGA

Theoretical pI/Mw: 5.09 / 109296.44

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 53

Page 54: Introduction

GEN 3040

Q7. On an annotated printout of the promoter sequence highlight and list the features common to

the regulation of eukaryotic genes. In addition, using web-based promoter analyses too and your

knowledge of gene transcription/regulatory machinery and the references listed, locate and

record the position on a printout of the above file any sequence motifs in your promoter

sequence which may be important in regulating the expression of this gene?

A promoter is a part of the DNA that is essential in the transcription of a gene. Generally,

promoters are located on the same strand close to the gene they regulate. The RNA polymerase

must first attach to the DNA sequences close to the gene for transcription to occur. The

promoters contain specific DNA sequences and response elements which provide a binding site

for the RNA polymerase to attach as well as for the attachment of the transcription factor. The

location of the promoters is usually adjacent to the gene which is near to the transcriptional start

site. If the gene sequence encoded for the promoter goes wrong, the amino acid could not be

produced. As a result, the production of the protein will be affected and the wrong promoter

sequence will regulate the expression of the gene. The sequence motif or the DNA-binding

proteins in the promoter region regulates transcription, polymerases, histones as well as

nucleases. They are also capable of incorporate the domains that aid the binding process to the

nucleic acid such as the leucine zipper, zinc finger, and helix-turn-helix. The DNA-binding

proteins involve the chemical modifications such as methylation, acetylation and phophorylation

that could make the DNA more or less accessible to transcription factor. As a result, it affects the

rate of transcription process. The DNA-binding protein also regulates the expression of the

osmotin gene. The TATA box is essential for the initiation of transcription and bind to the

specific region.

The sequence of the TATA binding protein can be obtained by using fuzznuc (http://inn-

temp.weizmann.ac.il/emboss/output/470819/).

# Program: fuzznuc# Rundate: Thu 19 Aug 2010 20:29:38# Commandline: fuzznuc# -auto# -sequence /var/www/html/emboss/output/470819/.sequence# -pattern TATAAA# -outfile outfile# Report_format: seqtable# Report_file: outfile########################################

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 54

Page 55: Introduction

GEN 3040

#=======================================## Sequence: from: 1 to: 1304# HitCount: 5## Pattern_name Mismatch Pattern# pattern 0 TATAAA# # Complement: No##=======================================

Start End Strand Pattern_name Mismatch Sequence 565 570 + pattern . TATAAA 1024 1029 + pattern . TATAAA 1092 1097 + pattern . TATAAA 1201 1206 + pattern . TATAAA 1259 1264 + pattern . TATAAA

#---------------------------------------#---------------------------------------

#---------------------------------------# Total_sequences: 1# Total_length: 1304# Reported_sequences: 1# Reported_hitcount: 5#---------------------------------------

Q8. Why is EDTA included in the GTE buffer?

EDTA is responsible in the removal of the magnesium ions. The ions must be removed as it is

essential for the preservation of the cell envelope structure. EDTA in the GTE buffer works by

protecting the DNA from the enzymatic degradation in the chelation process of free divalent

cations and also capable in causing the bacterial to burst (Brown, 2006).

Q9. A manual sequencing autoradiogram of the Sanger dideoxy reactions often has a region

where there are terminated fragments in all four lanes with a fainter sequence continuing past

this point. Suggest a reason why this may occur. What might you expect to see in this region of

you sequenced this DNA fragment using dye labeled dideoxy nucleotide terminators?

There are terminated fragments in all four lanes with a fainter sequence continuing past this point

as it might be caused by falling of the template by the DNA polymerase. The phenomenon can

be explained by the secondary structure of DNA template, the hairpin loop that leads to the

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 55

Page 56: Introduction

GEN 3040

falling of DNA polymerase. As the replication happens, the sequencing fragments in the manual

sequencing which are visualized by radioactivity attached into the sequence fragments and

causes the premature fragments termination. As they have no labeled terminator, the premature

terminated fragments will not be able to be observed. The secondary structure will not always be

carried by the template molecules as DNA breathes in solution, the template molecules will be

able to overcome the hindrance and fragments in the regions by the aids of some DNA

polymerases. Thus, it will be terminated by the correct labeled nucleotide. The termination will

be appeared as a reduced peak in the automated sequencing file.

Q10. Perform a visual inspection of the following DNA sequence. Can you suggest an

origin/function for this sequence?

The presence of the CAAT box indicates that the short sequence is the promoter region with the

coding sequence. There are also a few sites for the restriction enzymes in the sequence.

Q11. You have a large linear fragment of dsDNA of unknown length. To determine the size, you

have digested the fragment with three different restriction enzymes that have six base pair

recognition sites and run the resulting fragments on the agarose gel. The three digests gave 5,6

and 7 fragments respectively. Having forgotten to run size markers you are forced to estimate the

length of the original fragment (assuming a strictly random sequence). What is your estimate of

the length in bp’s?

Assume that the four nucleotides occurred randomly, the possibility of the occurrence of the

recognition site for the restriction enzyme is 46=4096. Thus, for each 4096 bp, there will be a

restriction enzyme. The restriction enzyme digested 5,6 and 7 fragments with an average of 6.

Thus the sequence size will be 6 x 4096 = 24576bp.

Q12. The sequence is 5’CAATTTCACACAGGAAACAGCTACGACCATG 3’.

Q13. The experiment can performed following the procedures outlined follows: the gene of

interest is inserted into the plasmid after it is isolated. The gene of interest is ligated into the

plasmid as the vector for further step, the amplification. The plasmid is cut by using the same

restriction enzymes used to cut the gene of interest. The EcoRI and SalI restriction enzymes aids

the identification of the insert into the plasmid. The DNA segment will be ligated into the LacZ

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 56

Page 57: Introduction

GEN 3040

portion. The transformed plasmid will be grown on the X-gal plate. Colonies with white color

indicate that the β-galactosidase enzyme is not produced as the lacZ gene is disrupted. Contrast

to the normal/undisrupted lacZ gene, the production of β-galactosidase caused the cells to turn

into blue color colonies.

The white colonies will be isolated and extracted where the successful transformed

recombinant cloned will be cloned and digested by using the restriction enzymes such as EcoRI,

BglII and HindIII to produce different fragment sizes. The different fragment sizes can be

separated by using agarose gel electrophoresis that will be loaded with the marker gene. At the

same time, another set of digest is performed by using the original plasmid that is also digested

by similar restriction enzymes. The results of fragmentations will be compared to each other

(Glick and Pasternak, 2003).

As the vector orientation is identified, the DNA sequencing can be performed to ensure

that the size is 400bp. The fragmented plasmid is isolated and the agarose gel is purified. Next,

methods such as chain termination, chemical degradation or automated sequencing can be

performed to identify the sequences of the fragments. As only single stranded DNA is required,

chain termination method is preferred. BglII and HindIII restriction enzymes will cut the 400bp

fragments from 5’ to 3’ direction. The sequences obtained are shown as below:

5’ A↓GATCT……………..AAGCTT 3’

3’TCTAGA……………...TTCGA↓A 5’

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 57

Page 58: Introduction

GEN 3040

References

Brown TA., (2006). Gene cloning and DNA analysis: An Introduction 5 th Ed. Blackwell

Publishing, UK.

Glick BR., and Pasternak JJ., (2003). Molecular Biology: Principles and Aplications of

Recombinant DNA 3rd Ed. ASM Press, US.

Sanger F., (1981). Determination of nucleotides sequences in DNA. Science, New Series

214:1205-1210

NCBI (National Center for Biotechnology Information) (2010) BLAST.

URL: http://www.ncbi.nlm.nih.gov/blast/Blast.cgi

Date accessed: 15th August 2010

Plasmid isolation, automated DNA sequencing and bioinformatics analysis Page | 58