mining bacterial genomes for antimicrobial targets

5
470 Reviews MOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6) 1357-4310/00/$ - see front matter © 2000 Elsevier Science Ltd. All rights reserved. PII: S1357-4310(00)01815-3 as well as a broad range of gram-negative and gram-positive eubacterial species and it is estimated that .100 additional microbial sequencing projects are under way (see TIGR homepage; Box 1). Some of these are available for analysis as partial genome sequences [see unfinished genomes homepage at the National Center for Biotechnology Information (NCBI, Bethesda, MD, USA); Box 1]. Computational genome analysis From the point of view of an industrial scientist, the goal of computational analysis of microbial genomes is to extract as much information as possible in silico, in order to simplify target selection for antimicrobial drug discovery. Given the vast amount of data, this is a difficult task. Parameters that are critical for bioinformatics-based target selection include the degree of conservation of potential targets in various bacterial species and the success with which functional hypotheses can be generated. Developments in bioinformatics in recent years provide a set of tools that can be used in an integrated way to select genes of particular interest 4 (Fig. 1). Thus, specific lists of target candidates are generated that can then be validated rapidly using bacterial genetics 5 . Homology-based analyses The wealth of genomic information from a broad variety of evolutionarily distant bacterial species makes the automated comparison of bacterial genomes a powerful tool with which to categorize genes and the proteins they encode 5,6 . In addition to the primary methodology of using sequence comparison programs, such as BLAST or PSI- BLAST, the concept of orthologous families in establishing the clusters of orthologous groups (COGs) has set the basis for homology-based genome comparisons 7 . The gene categories that are generated by these approaches allow for the preselection of target candidates on a whole- genome scale, that is, scientists are able to define targets according to the tics required from a given antibacterial treatment. For example, genes that have orthologs in a wide variety of evo- lutionary distant organisms are target candidates for broad-spectrum applications. Conversely, genes that are present only in a small subset of bacterial species represent possible targets for narrow-spectrum antibacterial compounds. Using this approach, target candidates can THE use of antimicrobial drugs to control infectious diseases is one of the greatest achievements of medicine in this century and the world market for these drugs is now an estimated US$23 billion (Ref. 1). Most of the existing classes of antibiotics were discovered by systematic screening of natural-product libraries within two decades of the intro- duction of penicillin. The difficulty of introducing novel chemical classes for antibacterial treatment is demonstrated by the fact that linezolid (Zyvox) is the first new chemical class of antibiotic to receive FDA approval in over 30 years. Antibiotic therapy also faces severe challenges in other ways, namely resistance to the well-established antibiotics, the increased occurrence of opportunistic infections and the emergence of novel pathogens associated with chronic diseases (e.g. Helicobacter pylori). Short-term measures, such as chemical modification of existing antibiotics and development of inhibitors of resistance genes, will have a significant impact on antibacterial therapy in the immediate future. However, it is evident that the field of anti- biotic therapy requires additional targets, innovative assay-development strategies and new chemical entities 2 . Bacterial genome sequencing: current status The Institute for Genomic Research (TIGR, Rockville, MD, USA) uses the strategy of shotgun sequencing combined with computer- based assembly of DNA sequence fragments to a closed genome and has set the speed for sequencing microbial genomes with the pub- lication of the first, complete bacterial whole-genome sequence (Haemophilus influenzae) in 1995 (Ref. 3). To date, ,30 bacterial whole-genome sequences are available publicly, covering the Archaea Hannes Loferer PhD Senior Director Microbiology GPC Biotech AG, Fraunhoferstrasse 20, D-82152 Martinsried/Munich, Germany. Tel: 149 89 85652650 Fax: 149 89 85652610 e-mail: [email protected] Mining bacterial genomes for antimicrobial targets Hannes Loferer The elucidation of whole-genome sequences is expected to have a revolutionary impact on the discovery of novel medicines. With the availability of complete genome sequences of more than 30 different species, the field of antimicrobial drug discovery has the opportunity to access a remarkable diversity of genomic information. In this review, I summarize how microbial genomics has changed strategies of drug discovery by applying bioinformatics, novel genetic approaches and genomics-based technologies, including analysis of gene expression using DNA microarrays.

Upload: hannes

Post on 19-Sep-2016

216 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Mining bacterial genomes for antimicrobial targets

470

R ev i e w s MOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6)

1357-4310/00/$ - see front matter © 2000 Elsevier Science Ltd. All rights reserved. PII: S1357-4310(00)01815-3

as well as a broad range of gram-negative and gram-positive eubacterialspecies and it is estimated that .100 additional microbial sequencingprojects are under way (see TIGR homepage; Box 1). Some of these areavailable for analysis as partial genome sequences [see unfinishedgenomes homepage at the National Center for BiotechnologyInformation (NCBI, Bethesda, MD, USA); Box 1].

Computational genome analysisFrom the point of view of an industrial scientist, the goal of computationalanalysis of microbial genomes is to extract as much information aspossible in silico, in order to simplify target selection for antimicrobialdrug discovery. Given the vast amount of data, this is a difficult task.Parameters that are critical for bioinformatics-based target selectioninclude the degree of conservation of potential targets in variousbacterial species and the success with which functional hypothesescan be generated. Developments in bioinformatics in recent yearsprovide a set of tools that can be used in an integrated way to selectgenes of particular interest4 (Fig. 1). Thus, specific lists of targetcandidates are generated that can then be validated rapidly usingbacterial genetics5.

Homology-based analysesThe wealth of genomic information from a broad variety of evolutionarilydistant bacterial species makes the automated comparison of bacterialgenomes a powerful tool with which to categorize genes and theproteins they encode5,6. In addition to the primary methodology ofusing sequence comparison programs, such as BLAST or PSI-BLAST, the concept of orthologous families in establishing the clustersof orthologous groups (COGs) has set the basis for homology-basedgenome comparisons7. The gene categories that are generated by theseapproaches allow for the preselection of target candidates on a whole-genome scale, that is, scientists are able to define targets according tothe tics required from a given antibacterial treatment.

For example, genes that have orthologs in a wide variety of evo-lutionary distant organisms are target candidates for broad-spectrumapplications. Conversely, genes that are present only in a small subsetof bacterial species represent possible targets for narrow-spectrumantibacterial compounds. Using this approach, target candidates can

THE use of antimicrobial drugs to control infectious diseases is one ofthe greatest achievements of medicine in this century and the worldmarket for these drugs is now an estimated US$23 billion (Ref. 1).Most of the existing classes of antibiotics were discovered by systematicscreening of natural-product libraries within two decades of the intro-duction of penicillin. The difficulty of introducing novel chemicalclasses for antibacterial treatment is demonstrated by the fact thatlinezolid (Zyvox) is the first new chemical class of antibiotic toreceive FDA approval in over 30 years. Antibiotic therapy also facessevere challenges in other ways, namely resistance to the well-establishedantibiotics, the increased occurrence of opportunistic infections andthe emergence of novel pathogens associated with chronic diseases(e.g. Helicobacter pylori). Short-term measures, such as chemicalmodification of existing antibiotics and development of inhibitors ofresistance genes, will have a significant impact on antibacterial therapyin the immediate future. However, it is evident that the field of anti-biotic therapy requires additional targets, innovative assay-developmentstrategies and new chemical entities2.

Bacterial genome sequencing: current statusThe Institute for Genomic Research (TIGR, Rockville, MD, USA)uses the strategy of shotgun sequencing combined with computer-based assembly of DNA sequence fragments to a closed genome andhas set the speed for sequencing microbial genomes with the pub-lication of the first, complete bacterial whole-genome sequence(Haemophilus influenzae) in 1995 (Ref. 3). To date, ,30 bacterialwhole-genome sequences are available publicly, covering the Archaea

Hannes Loferer PhDSenior Director Microbiology

GPC Biotech AG, Fraunhoferstrasse 20, D-82152Martinsried/Munich, Germany.

Tel: 1149 89 85652650Fax: 1149 89 85652610

e-mail: [email protected]

Mining bacterial genomes forantimicrobial targets

Hannes Loferer

The elucidation of whole-genome sequences is expected to have a revolutionary impact on the discoveryof novel medicines. With the availability of complete genome sequences of more than 30 different species,the field of antimicrobial drug discovery has the opportunity to access a remarkable diversity of genomicinformation. In this review, I summarize how microbial genomics has changed strategies of drug discoveryby applying bioinformatics, novel genetic approaches and genomics-based technologies, includinganalysis of gene expression using DNA microarrays.

Page 2: Mining bacterial genomes for antimicrobial targets

be selected that are specific to the spectrum of bacterial species foundin a given disease (e.g. H. pylori in the case of duodenal ulcers8). Thiscategory of targets is particularly important in chronic infections. Here,narrow-spectrum drugs would be highly advantageous with regard tothe spread of resistance and the side effects caused by destruction of thecommensal bacterial flora, both of which are major disadvantages oflong-term treatment with broad-spectrum antibiotics.

Comparative genomics of closely related organisms will provideinformation about a given species9,10 or even from strain variants8. Forexample, the comparative analysis of the genomes of Chlamydiatrachomatis and Chlamydia pneumoniae has generated testablehypotheses of genes that might be responsible for the differences intropism and pathologies between these two organisms9.

The concept of homology-based comparisons is enhanced throughthe analysis of phylogenetic profiles11,12. The phylogenetic profiles foreach family of orthologs are defined by the set of genomes in which thefamily is represented. Genes that function within the same pathwayfrequently have the same phylogenetic profile, given that a respectivepathway rarely is present in every organism. Searching the genome forproteins that have phylogenetic profiles that match the one from knowncomponents of that pathway, for example, can identify additionalmembers of a relevant pathway12.

Structural genomicsProteins with low amino acid sequence identity frequently have similarfolds. This is consistent with the observation that, of all the three-dimensional (3D) protein structures determined to date, only ,700 proteinfolds are observed13. Thus, a powerful approach for assigning previouslyunknown molecular function to a protein is to determine the 3D structureof the protein and compare this, rather than the amino acid sequence, withthose of the protein structure database. If there are significant structuralhomologs, the protein under investigation is predicted to have similarmolecular properties. These predictions can than be tested experimentally14.

The success rate for obtaining high-quality protein crystals isunpredictable, however, and this step represents a crucial time lag insuch projects. However, the power of the structural genomics approachwill increase when a larger number of protein structures become available.Furthermore, some computational modelingstrategies indicate that this approach providesimproved functional predictions, compared toalignments of amino acid sequences15.

Motif analysisThe rapid growth of publicly available DNAand protein sequence databases, combinedwith large amounts of experimental data, hasidentified signature sequence motifs indicativeof certain biochemical activities. Several data-bases exist that search for such motifs in a newsequence (e.g. the PROSITE database; Box 1).

Arigoni et al. identified a number ofpotential antibacterial targets among geneswith unknown functions5. Motif analysisproposed that three of these targets encodenucleotide-binding proteins, one encodes ametalloprotease and no motifs were detectedin the remaining proteins5. A decision oninvesting resources for assay developmentwould favor the putative metalloprotease, as

inhibitors of this class of proteins have already been developed.This shows that the identification of biochemical signature motifs isvaluable in order to prioritize target candidates.

Metabolic modelingIn addition to categorizing genes purely by sequence-based comparisons,the coding sequences elucidated in new sequencing projects can be

471

R ev i e w sMOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6)

Figure 1. Mining genomic information for target and drug discovery. Data from computational and ex-perimental genome analyses can be used in an integrated manner to select targets and prioritize drug candidates (see text for details).

Molecular Medicine Today

Computational genome analysis• Comparative genomics• Phylogenetic patterns• Motif analysis• Molecular/metabolic• Gene fusion

Experimental genome analysis• Microarrays• Proteomics• Genetics• Interaction technologies

Genomic information base

Target identification/drug development priorities

Box 1. Some useful web pages

Basic Linear Alignment Search Tool (BLAST) and PSI-BLAST (iterative BLAST) at NCBI:http://www.ncbi.nlm.nih.gov/BLAST/

Clusters of Orthologous Groups (COGs) of proteins at NCBI:http://www.ncbi.nlm.nih.gov/COG/

TIGR web site. Contains a comprehensive list of links to genome sequencing projects and databases:http://www.tigr.org

Unfinished genomes web site at NCBI. Allows homology searchingin genome sequences that are not complete:http://www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgenome.html

PROSITE database of protein families and domains:http://www.expasy.ch/prosite/

WIT web site. A resource from Argon Ntl Labs for functional assignment of genes:http://wit.mcs.anl.gov/WIT2

KEGG web site: resource for searching pathway and other genomedatabases:http://www.genome.ad.jp/kegg/kegg.html

Page 3: Mining bacterial genomes for antimicrobial targets

mapped to metabolic pathways. Although metabolic modeling basedon whole-genome sequence information is still a new field, somestrategies have emerged. The bioinformatics company, DoubleTwist(Oakland, CA, USA), uses the approach of comparing annotatedgenome sequences to reference databases of cellular pathwaysamassed mainly using Escherichia coli and Bacillus subtilis16. Theprocedure assigns a score to each pathway that indicates the strength ofthe evidence of the particular pathway in the organism in question.This strategy allows the assessment of the metabolic capabilities of anewly sequenced organism, identification of putatively essentialpathways and postulation of missing components of pathways. Karpet al.16 outline the results of such an analysis for H. pylori, postulating74 pathways for this organism, compared with 138 for E. coli.

Other approaches are represented by the Kyoto Encyclopedia ofGenes and Genomes (KEGG) and What is There? (WIT) web sites(see Box 1). These approaches do not take an annotated genome astheir starting point rather they use raw protein sequences. Furthermore, theKEGG database disregards organism-specific composition of pathwaysand neither integrate experimental information from the literature.

Putative physical interactionsRecently, a computational method to infer protein interactions fromgenome sequences has been proposed, based on the observation thatsome pairs of proteins that interact have homologs in other organismsthat are fused into a single protein chain17. Several interactions knownfrom the literature have been confirmed in this study. For example, inE. coli the GyrA and GyrB subunits of DNA gyrase form a complexand both components are fused within a single protein in yeasttopoisomerase II (Ref. 17). Thus, this strategy might provide functionalinformation for putative target proteins.

Experimental functional genomicsLarge-scale geneticsAs an alternative to the strategy of placing bioinformatic analyses at theforefront of target selection, efficient genome-wide genetic technologiesthat allow the experimental scanning of the bacterial genome forantibacterial targets in a reasonable timescale have been developed.

The basis for these approaches is the knowledge of the whole-genomesequence of a given organism. Bioinformatic tools, as outlined in theprevious section, will then be used to select the best targets for drug-discovery programs.

Traditionally, the search for novel genes required for bacterialsurvival or virulence has been based on a variety of methods to inducerandom mutagenesis of a bacterial genome followed by scoring for therelevant phenotype18. Until recently, however, state-of-the-art methods(e.g. transposon or chemically induced mutagenesis) were unlikely toinvolve all genes encoded by a bacterial genome.

A breakthrough in using transposon mutagenesis to map genesrequired for viability has been achieved by combining transpositiontechniques in vitro with natural transformation of some bacterialspecies19. In this method (called genomic analysis and mapping by in vitro transposition, or GAMBIT) the genome of the respectivebacterium is covered by an overlapping set of long-range PCR products,each of approximately 10 000 bp. Each PCR product is subjected to in vitro transposition, a procedure that leads to high frequency, almostcompletely random insertions into the target DNA (Ref. 20).Subsequently, the bacterial strain is transformed with the mutagenizedPCR product and the locations of transposon insertions are mappedby PCR. With this procedure, several insertions are obtained withineach open reading frame (ORF). Hence, genes essential for survivalunder standard laboratory conditions can be recognized by their lackof transposon insertions. Thus, within a relatively short period of time,the majority of essential genes of a small-genome bacterium, such asH. influenzae, can be mapped. Furthermore, the pool of transposonmutants obtained using GAMBIT can be used to identify genes thatare required under non-standard conditions, including virulence,similar to the signature-tagged mutagenesis approach21.

Microarray analysisSeveral technologies, such as gene expression profiling using DNA microarrays, large-scale protein interaction mapping22 and proteomics23,have gained paramount importance with the knowledge of whole-genome sequences24 and will be instrumental in functional analysis ofgenes and identification of target candidates.

In the microarray approach, DNA probes specific for the entireset of genes of an organism are immobilized on a solid support.Such an array can be hybridized with a labelled RNA sample andthe hybridization pattern reflects the expression status of thegenome under a given condition (see Ref. 24 for an overview of thistechnology).

Using microarray technology, functional information is gained byinvestigating the temporal patterns of gene expression of a biologicalprocess of interest. Because it can be assumed that genes with relatedfunctions tend to be expressed in similar patterns, possible roles forgenes of unknown function can be suggested, based on their temporalassociation with genes of known function25. In genetically accessibleorganisms, these hypothetical gene functions can then be testedby mutation of the genes in question and the analysis of theireffects.

A landmark study of this kind has analyzed the transcriptionalprogram of sporulation in yeast25. Of the 6200 protein-encodinggenes in the yeast genome, .1000 showed significant changes inmRNA levels during sporulation. The investigation of the levelof expression of every yeast gene in the course of sporulationprovides clues to potential functions of hundreds of previouslyuncharacterized genes.

472

R ev i e w s MOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6)

GlossaryBLAST – Basic Linear Alignment Search Tool developed at NCBI(see Box 1).

Clusters of orthologous groups (COGs) – Orthologous gene families are grouped into clusters (see Box 1).

Orthologs – Conserved genes that perform the same function indifferent species. Orthologs evolve by speciation.

PSI-BLAST – A modification of BLAST that performs iterative homology searches (see Box 1).

Phylogenetic profile – The pattern of occurrence of orthologs indifferent species.

Surrogate ligands – Non-natural ligands (e.g. peptides).

Tropism – Some bacteria can only infect specific tissues in a hostbecause of specific mechanisms for interaction with the host (e.g.adhesion). This phenomenon is called tropism.

Page 4: Mining bacterial genomes for antimicrobial targets

This analysis of the already well-characterized process of sporulationbenefits from the fact that the biochemical functions of many proteinsrequired for this process are well established, which leads to highlyaccurate predictions about protein function.

However, this approach is also relevant to antibacterial researchas it can generate functional hypotheses even in less-studied systems.For example, 34 ORFs of unknown function were shown to be up- ordownregulated following heat-shock of E. coli (Ref. 26). Similar tothe experiments described above, bacteria can be challenged withconditions that are relevant for antibacterial therapy. For example,genes can be identified that are induced in animal models or in in vitro models of infection21. Zhang and Normark investigated thechanges in the transcriptional program of uropathogenic E. coli followingP-pili-mediated adhesion to its host cell receptor27. At the time of thisstudy, the whole-genome sequence of E. coli was not available andthe authors used differential RNA display to assess differences ingene expression. One set of genes induced following adhesion encoded a,hitherto uncharacterized, two-component regulatory system. A mutant,defective in responding to iron starvation, was detected. In contrast tothe wild-type bacteria, this mutant is unable to grow in urine, indicatingthat these genes are required under conditions prevailing in theurinary tract.

Genes of unknown functionA significant proportion of all bacterial genes sequenced to date areof unknown function28. Even in well-studied model organisms such asE. coli, approximately 38% of all genes fall into this category28,29.Two pilot studies show that novel antibacterial drug targets can bediscovered within this category of genes. First, 26 genes of unknownfunction that are broadly conserved among diverse bacterial species(including Mycoplasma genitalium, which is the smallest bacterialgenome sequenced so far) were investigated using bioinformaticsand genetics, and six novel genes essential for the growth of E. coliand B. subtilis identified5. The second study, a global transposonmutagenesis experiment using M. genitalium, revealed that ,100genes of unknown function are essential for growth of this organismunder laboratory conditions30.

Concluding remarksFrom the point of view of drug discovery, microbial genomics faceschallenging questions, such as, how quickly can whole-genomesequence information be generated, how efficiently can relevant targetsbe identified and, how can genomics-based technologies be integratedinto the drug discovery process? The question of whether genomicsfundamentally changes the discovery of antimicrobial drugs must alsobe asked. In this regard, microbial genomics can be viewed as a fore-runner for the field of human genomics, which will eventually face thesame questions.

Given the large number of complete bacterial genomes alreadysequenced, there is no doubt that microbial whole-genome informationcan be generated rapidly. Furthermore, the technologies describedin this review will identify most of the targets required for bacterialsurvival. One might argue that there are many potential antibacterialtargets described in the literature that have not been explored inantibiotic research. However, given the fact that resistance develop-ment will always be a part of the game, all possible targets ought tobe explored. This strategy is supported by the fact that technologiessuch as surrogate-ligand-based screening are now available, thatenable screening for inhibitors in a uniform assay format without

detailed functional knowledge31. This will allow the exploration ofa large number of targets for lead candidates in a manageabletimescale.

Furthermore, genomics-based technologies (e.g. DNA chips) willalso revolutionize diagnostics. Novel and rapid diagnostic techniqueswill allow us to identify targets for narrow-spectrum antibiotics foruse in acute infections, which up to now are only tackled by broad-spectrum drugs. The main limiting factor for putting this vision intopractice is technology development: at the moment genomicstechnologies are expensive, and issues such as the reproducible production of DNA arrays on an industrial scale or handling of largedata sets are still a challenge. Eventually, these issues will be resolved, and we will use genomics technologies as an integrated tool.Thereby, the results of computational analyses, genetics and otherfunctional experiments (including gene-expression profiling, protein-interaction studies, etc.) will be integrated into databases. Such databaseswill also include data obtained from the analysis (e.g. by gene-expression profiling) of the biological effects of drug candidates viathe drug discovery process24.

Acknowledgement. I thank Bernd Hutter for comments on the manuscript.

References1 Halls, G. (1999) The Complete Guide to Anti-Infectives, PJB Publications Ltd

2 Schmid, M.B. (1998) Novel approaches to the discovery of antimicrobial agents.

Curr. Opin. Chem. Biol. 2, 529–534

3 Fleischmann, R.D. et al. (1995) Whole-genome random sequencing and assembly of

Haemophilus influenzae Rd. Science 269, 496–512

4 Galperin, M.Y. and Koonin, E.V. (2000) Who’s your neighbor? New computational

approaches for functional genomics. Nat. Biotechnol. 18, 609–613

5 Arigoni, F. et al. (1998) A genome-based approach for the identification of essential

bacterial genes. Nat. Biotechnol. 16, 851–856

6 Mushegian, A.R. and Koonin, E.V. (1996) A minimal gene set for cellular life derived

by comparison of complete bacterial genomes. Proc. Natl. Acad. Sci. U. S. A. 93,

10268–10273

7 Tatusov, R.L. et al. (1997) A genomic perspective on protein families. Science 278,

631–637

8 Alm, R.A. et al. (1999) Genomic-sequence comparison of two unrelated isolates of

the human gastric pathogen Helicobacter pylori. Nature 397, 176–180

9 Kalman, S. et al. (1999) Comparative genomes of Chlamydia pneumoniae and

C. trachomatis. Nat. Genet. 21, 385–389

10 Cole, S.T. (1998) Comparative mycobacterial genomics. Curr. Opin. Microbiol. 1,

567–571

11 Gaasterland, T. and Ragan, M.A. (1998) Microbial genescapes: phyletic and

functional patterns of ORF distribution among prokaryotes. Microb. Comp.

Genomics 3, 199–217

473

R ev i e w sMOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6)

The outstanding questions

• Will genome-sequencing technology improve to a levelwhere whole-genome sequences can be generated rapidly,even from strain variants of given species?

• Will genomics-based methods become more rapid andspecific than traditional molecular diagnostics?

• Will decision-making during the drug discovery process bemade more efficient and rational by integrating genomicstechnologies?

Page 5: Mining bacterial genomes for antimicrobial targets

474

R ev i e w s MOLECULAR MEDICINE TODAY, DECEMBER 2000 (VOL. 6)

12 Pellegrini, M. et al. (1999) Assigning protein functions by comparative genome

analysis: protein phylogenetic profiles. Proc. Natl. Acad. Sci. U. S. A. 96,

4285–4288

13 Thornton, J.M. et al. (1999) Protein folds, function and evolution. J. Mol. Biol. 293,

333–342

14 Zarembinski, T.I. et al. (1998) Structure-based assignment of the biochemical func-

tion of a hypothetical protein: A test case of structural genomics. Proc. Natl. Acad.

Sci. U. S. A. 95, 15189–15193

15 Grigoriev, I.V. and Kim, S-H. (1999) Detection of protein fold similarity based on

correlation of amino acid properties. Proc. Natl. Acad. Sci. U. S. A. 96,

14318–14323

16 Karp, P.D. et al. (1999) Integrated pathway-genome databases and their role in drug

discovery. Trends Biotechnol. 17, 275–281

17 Marcotte, E.M. et al. (1999) Detecting protein function and protein-protein interac-

tions from genome sequences. Science 285, 751–753

18 Schmid, M.B. et al. (1989) Genetic analysis of temperature-sensitive lethal mutants

of Salmonella typhimurium. Genetics 123, 625–633

19 Akerley, B.J. et al. (1998) Systematic identification of essential genes by in-vitro

mariner mutagenesis. Proc. Natl. Acad. Sci. U. S. A. 95, 8927–8932

20 Devine, S.E. and Boeke, J.D. (1994) Efficient integration of artificial transposons into

plasmid targets in vitro: a useful tool for DNA mapping, sequencing and genetic

analysis. Nucleic Acids Res. 22, 3765–3772

21 Hensel, M. et al. (1995) Simultaneous identification of bacterial virulence genes by

negative selection. Science 269, 400–403

22 Uetz, P. et al. (2000) A comprehensive analysis of protein-protein interactions in

Saccharomyces cerevisiae. Nature 403, 623–627

23 Wang, J.H. and Hewick, R.M. (1999) Proteomics in drug discovery. Drug Discovery

Today 4, 129–133

24 Ivanov, I. et al. (2000) DNA microarray technology and antimicrobial drug

discovery. Pharmacogenomics 1, 169–178

25 Chu, S. et al. (1998) The transcriptional program of sporulation in budding yeast.

Science 282, 699–705

26 Richmond, C.S. et al. (1999) Genome-wide expression profiling in Escherichia coli

K12. Nucleic Acids Res. 27, 3821–3835

27 Zhang, J.P. and Normark, S. (1996) Induction of gene expression in Escherichia coli

after pilus-mediated adherence. Science 273, 1234–1236

28 Hinton, J.C.D. (1997) The Escherichia coli genome sequence: the end of an era or

the start of the FUN? Mol. Microbiol. 26, 417–422

29 Blattner, F.R. et al. (1997) The complete genome sequence of Escherichia coli K-12.

Science 277, 1453–1462

30 Hutchison, C.A., III et al. (1999) Global transposon mutagenesis and a minimal

mycoplasma genome. Science 286, 2165–2169

31 Loferer, H. et al. (2000) Integrated bacterial genomics for the discovery of novel

antimicrobials. Drug Discovery Today 5, 107–114

Recent articles in Molecular Medicine Today

Mitochondrial DNA mutations in the pathogenesis of human diseasePatrick F. Chinnery and Douglas M.Turnbull

Adeno-associated virus vectors for gene therapy: More pros than cons?Paul E. Monahan and R. Jude Samulski

Molecular aberrations in human systemic lupus erythematosusGeorge C. Tsokos and Gary M. Kammer

X-linked severe combined immunodeficiency: from molecular cause to genetherapy within seven yearsWarren J. Leonard

Cellular and transgenic animal models of cerebrovascular amyloidosisWilliam E. Van Nostrand

p53 gene therapy: a key to modulating resistance to anticancer therapies? Esther H. Chang, K.F. Pirollo and K.B. Bouker