crystal structure of a bacterial ribonuclease p rnarna could not be traced because of disorder in...

6
Crystal structure of a bacterial ribonuclease P RNA Alexei V. Kazantsev , Angelika A. Krivenko , Daniel J. Harrington , Stephen R. Holbrook § , Paul D. Adams , and Norman R. Pace Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO 80309; Stanford Synchrotron Radiation Laboratory, Stanford University, Menlo Park, CA 94025; and § Physical Biosciences Division and Computational Crystallography Initiative, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 Contributed by Norman R. Pace, August 3, 2005 The x-ray crystal structure of a 417-nt ribonuclease P RNA from Bacillus stearothermophilus was solved to 3.3-Å resolution. This RNA enzyme is constructed from a number of coaxially stacked helical domains joined together by local and long-range interac- tions. These helical domains are arranged to form a remarkably flat surface, which is implicated by a wealth of biochemical data in the binding and cleavage of the precursors of transfer RNA substrate. Previous photoaffinity crosslinking data are used to position the substrate on the crystal structure and to identify the chemically active site of the ribozyme. This site is located in a highly conserved core structure formed by intricately interlaced long-range interac- tions between interhelical sequences. ribozyme RNA crystallography tRNA processing R Nase P catalyzes hydrolysis of a phosphodiester bond in pre- cursors of transfer RNA (tRNA) to form the 5-phosphory- lated mature tRNA with the release of a 5-precursor fragment (1, 2). RNase P homologs occur in all organisms, and the cellular RNase P always is a ribonucleoprotein that consists of one large RNA and one or more protein component. In bacteria, RNase P is typically comprised of a 350- to 400-nt RNA and one 120-aa basic protein. Although both RNA and protein components are neces- sary for cell viability, in vitro at high salt concentrations, bacterial RNase P RNA can act as a catalyst independently of protein (3). Bacterial RNase P is a ribozyme, an RNA-based enzyme. Knowledge of the structure of RNase P RNA is essential for understanding its function, and structure has been the focus of numerous studies of the RNA. Phylogenetic comparative analyses of RNase P RNA sequences have established the secondary and some tertiary structure of the RNA in a broad diversity of organ- isms (4–8). Photochemical crosslinking studies provided structural information to orient the helical elements and identified nucleo- tides associated with the active site of the RNA (9, 10). There are two major structural types of bacterial RNase P RNA, A (ancestral) and B (Bacillus), which differ in a number of structural elements attached to a homologous conserved structure. About two-thirds of any bacterial RNase P RNA is shown by sequence covariations to be involved in Watson–Crick base-pairing interactions, but the interactions that form the global structure have been speculative. To gain a better understanding of bacterial RNase P, we crys- tallized and solved the structure of a 417-nt B-type RNase P RNA from Bacillus stearothermophilus, a moderately thermophilic, low GC Gram-positive bacterium. Although the structure does not yet explain the chemical mechanism of catalysis, it is in agreement with a wealth of available biochemical and comparative data, and it provides a structural context for the chemically active site of this ribozyme. Materials and Methods RNA Purification, Crystallization, and Data Collection. As detailed in supporting information, which is published on the PNAS web site, RNA was transcribed in vitro with T7 phage RNA polymerase from linearized plasmid pBstHH 2 . Transcribed RNA was purified by preparative electrophoresis in denaturing PAGE, followed by non- denaturing ion exchange chromatography, rapid dialysis, and con- centration to 10 –20 mgml. Diffraction quality crystals of annealed RNA were grown by vapor diffusion from agarose gels within 9 months. To prepare heavy atom derivatives, crystals were soaked in the cryoprotectant solution, containing Os(III) hexamine trif late or Pb(OAc) 2 (see supporting information). Cryoprotected crystals were flash-cooled in liquid nitrogen before data collection. Dif- fraction data were collected at 100 K. Inverse beam strategy was used to collect anomalous data for Os and Pb derivatives. All data sets were processed with D*TREK (11). Structure Determination. Crystallographic phases were solved to 3.75-Å resolution by a three-wavelength multiwavelength anoma- lous diffraction experiment on the Os(III) hexamine derivative and extended to 3.3-Å resolution by density modification against native structure factors with CNS (12). The initial model was constructed from phylogenetically predicted helical pieces fit to the relevant regions of the experimental map. As the quality of the combined electron density maps improved upon refinement (supporting information), the remaining bits of the structure were modeled and further refined. Typical cycles of the refinement, interspersed between the rounds of manual rebuilding with program O (13), included restrained simulated annealing by torsion dynamics, crys- tallographic conjugate gradient minimization, and grouped B- factor refinement. Table 1 outlines the quality of the refined model. See supporting information for details on model building and refinement. Results Overall Structure. Crystals of B. stearothermophilus RNase P RNA were developed as outlined in Materials and Methods and used in diffraction analysis. The phylogenetic structure model provided a foundation for building the structure into the electron density maps. Abbreviations: tRNA, transfer RNA; A-type, ancestral type; B-type, Bacillus type; S-domain, specificity domain. Data deposition: The atomic coordinates have been deposited in the Protein Data Bank, www.pdb.org (PDB ID code 2A64). To whom correspondence should be addressed. E-mail: [email protected]. © 2005 by The National Academy of Sciences of the USA Table 1. Refinement statistics Resolution, Å 27–3.30 Number of reflections Overall, F(F) 2 (%) 35,599 (97.8) Test set (%) 1,798 (5.1) Rcryst,* % 32.5 Rfree, % 34.2 Number of atoms 6,383 Bond length rms deviation, Å 0.003 Bond angles rms deviation 0.80 Average B-factor, Å 2 154.4 Crossvalidated coordinate error (LuzzatiA), Å 0.720.80 *R cryst F o F c F o , where F o observed structure factor amplitude, F c calculated structure factor amplitude. R free , R-factor based on the test set (data excluded from the refinement). Calculated with CNS. 13392–13397 PNAS September 20, 2005 vol. 102 no. 38 www.pnas.orgcgidoi10.1073pnas.0506662102 Downloaded by guest on January 1, 2021

Upload: others

Post on 12-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

Crystal structure of a bacterial ribonuclease P RNAAlexei V. Kazantsev†, Angelika A. Krivenko†, Daniel J. Harrington‡, Stephen R. Holbrook§, Paul D. Adams¶,and Norman R. Pace†�

†Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO 80309; ‡Stanford Synchrotron RadiationLaboratory, Stanford University, Menlo Park, CA 94025; and §Physical Biosciences Division and ¶Computational Crystallography Initiative,Lawrence Berkeley National Laboratory, Berkeley, CA 94720

Contributed by Norman R. Pace, August 3, 2005

The x-ray crystal structure of a 417-nt ribonuclease P RNA fromBacillus stearothermophilus was solved to 3.3-Å resolution. ThisRNA enzyme is constructed from a number of coaxially stackedhelical domains joined together by local and long-range interac-tions. These helical domains are arranged to form a remarkably flatsurface, which is implicated by a wealth of biochemical data in thebinding and cleavage of the precursors of transfer RNA substrate.Previous photoaffinity crosslinking data are used to position thesubstrate on the crystal structure and to identify the chemicallyactive site of the ribozyme. This site is located in a highly conservedcore structure formed by intricately interlaced long-range interac-tions between interhelical sequences.

ribozyme � RNA crystallography � tRNA processing

RNase P catalyzes hydrolysis of a phosphodiester bond in pre-cursors of transfer RNA (tRNA) to form the 5�-phosphory-

lated mature tRNA with the release of a 5�-precursor fragment (1,2). RNase P homologs occur in all organisms, and the cellularRNase P always is a ribonucleoprotein that consists of one largeRNA and one or more protein component. In bacteria, RNase P istypically comprised of a 350- to 400-nt RNA and one �120-aa basicprotein. Although both RNA and protein components are neces-sary for cell viability, in vitro at high salt concentrations, bacterialRNase P RNA can act as a catalyst independently of protein (3).Bacterial RNase P is a ribozyme, an RNA-based enzyme.

Knowledge of the structure of RNase P RNA is essential forunderstanding its function, and structure has been the focus ofnumerous studies of the RNA. Phylogenetic comparative analysesof RNase P RNA sequences have established the secondary andsome tertiary structure of the RNA in a broad diversity of organ-isms (4–8). Photochemical crosslinking studies provided structuralinformation to orient the helical elements and identified nucleo-tides associated with the active site of the RNA (9, 10). There aretwo major structural types of bacterial RNase P RNA, A (ancestral)and B (Bacillus), which differ in a number of structural elementsattached to a homologous conserved structure. About two-thirds ofany bacterial RNase P RNA is shown by sequence covariations tobe involved in Watson–Crick base-pairing interactions, but theinteractions that form the global structure have been speculative.

To gain a better understanding of bacterial RNase P, we crys-tallized and solved the structure of a 417-nt B-type RNase P RNAfrom Bacillus stearothermophilus, a moderately thermophilic, lowG�C Gram-positive bacterium. Although the structure does notyet explain the chemical mechanism of catalysis, it is in agreementwith a wealth of available biochemical and comparative data, andit provides a structural context for the chemically active site of thisribozyme.

Materials and MethodsRNA Purification, Crystallization, and Data Collection. As detailed insupporting information, which is published on the PNAS web site,RNA was transcribed in vitro with T7 phage RNA polymerase fromlinearized plasmid pBstHH2. Transcribed RNA was purified bypreparative electrophoresis in denaturing PAGE, followed by non-denaturing ion exchange chromatography, rapid dialysis, and con-centration to 10–20 mg�ml. Diffraction quality crystals of annealed

RNA were grown by vapor diffusion from agarose gels within 9months. To prepare heavy atom derivatives, crystals were soaked inthe cryoprotectant solution, containing Os(III) hexamine triflate orPb(OAc)2 (see supporting information). Cryoprotected crystalswere flash-cooled in liquid nitrogen before data collection. Dif-fraction data were collected at 100 K. Inverse beam strategy wasused to collect anomalous data for Os and Pb derivatives. All datasets were processed with D*TREK (11).

Structure Determination. Crystallographic phases were solved to3.75-Å resolution by a three-wavelength multiwavelength anoma-lous diffraction experiment on the Os(III) hexamine derivative andextended to 3.3-Å resolution by density modification against nativestructure factors with CNS (12). The initial model was constructedfrom phylogenetically predicted helical pieces fit to the relevantregions of the experimental map. As the quality of the combinedelectron density maps improved upon refinement (supportinginformation), the remaining bits of the structure were modeled andfurther refined. Typical cycles of the refinement, interspersedbetween the rounds of manual rebuilding with program O (13),included restrained simulated annealing by torsion dynamics, crys-tallographic conjugate gradient minimization, and grouped B-factor refinement. Table 1 outlines the quality of the refined model.See supporting information for details on model building andrefinement.

ResultsOverall Structure. Crystals of B. stearothermophilus RNase P RNAwere developed as outlined in Materials and Methods and used indiffraction analysis. The phylogenetic structure model provided afoundation for building the structure into the electron density maps.

Abbreviations: tRNA, transfer RNA; A-type, ancestral type; B-type, Bacillus type; S-domain,specificity domain.

Data deposition: The atomic coordinates have been deposited in the Protein Data Bank,www.pdb.org (PDB ID code 2A64).

�To whom correspondence should be addressed. E-mail: [email protected].

© 2005 by The National Academy of Sciences of the USA

Table 1. Refinement statistics

Resolution, Å 27–3.30Number of reflections

Overall, F��(F) � 2 (%) 35,599 (97.8)Test set (%) 1,798 (5.1)

Rcryst,* % 32.5Rfree,† % 34.2Number of atoms 6,383Bond length rms deviation, Å 0.003Bond angles rms deviation 0.80Average B-factor, Å2 154.4Crossvalidated coordinate error (Luzzati��A),‡ Å 0.72�0.80

*Rcryst � ��Fo� � �Fc����Fo�, where �Fo� � observed structure factor amplitude,�Fc� � calculated structure factor amplitude.

†Rfree, R-factor based on the test set (data excluded from the refinement).‡Calculated with CNS.

13392–13397 � PNAS � September 20, 2005 � vol. 102 � no. 38 www.pnas.org�cgi�doi�10.1073�pnas.0506662102

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021

Page 2: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

Helices and some other structural features are evident in experi-mental electron density maps and could easily be matched with thephylogenetic model. The availability of the detailed phylogeneticmodel (14) thus obviated the need to use substituted RNAs toconfirm the sequence register. Approximately one-fourth of theRNA could not be traced because of disorder in the crystal.

The structure of the traceable part of the RNA, which containsall of the elements required for catalysis, is shown in Fig. 1. TheRNA forms six coaxially stacked helical subdomains held together

by a variety of long-range interactions. An unusual feature of theRNase P RNA architecture is the remarkably flat face formed bythe relative spatial arrangement of the coaxially stacked helicalsubdomains (Fig. 1). The long-range interactions that seem partic-ularly important for structural integrity are discussed as follows andinclude: (i) contacts between loop L5.1†† and L15.1; (ii) the dockingof loop L8 into the minor groove of helix P4; and (iii) a complicatedstructure formed by a number of interhelical joining regions (J3�4,J5�15, J15�15.1, J15.2�2, J19�4, and J4�1) that are drawn togetherfrom distant parts of the RNA sequence and form the catalytic core(below).

Docking Between Loops L5.1 and L15.1. The long-range interactionbetween loops L5.1 and L15.1, a characteristic of B-type RNase PRNAs, was proposed first on the basis of phylogenetic comparativeanalysis (15, 16). The crystal structure confirms the interactionbetween these loops, but the details of the interaction are unex-pectedly more complex than simple base pairing. As shown in Fig.2A, L15.1 forms an irregular antiparallel duplex for most of itslength, with 12 of the 16 nucleotides engaged in cross-strand purinestacks (17) and canonical, sheared, and N7-amino A-A base pairs.The base of A291 is flipped out of the L15.1 stack and inserted intothe L5.1 stack of bases between G73 and A74 (Fig. 2A). Theconformation adopted by loop L15.1 poises the bases of nucleotides

††Paired sequences with Watson–Crick complementarity are named P1, P2, etc., in the orderof their occurrence in the RNA sequence, from the 5� to the 3� end. Loops capping suchelements are named according to the name of the elements that they cap, e.g., L3 denotesa loop capping helix P3. Interhelical joining regions are named according to the names ofthe helical elements that they join, e.g., J3�4 denotes a sequence element that joinshelices P3 and P4.

Fig. 1. Overview of the B. stearothermophilus RNase P RNA structure. Theelements of phylogenetically refined secondary structure are colored accord-ing to the coaxially stacked helical domains in the ribbon representations ofthe structure. The nucleotides colored in gray in the secondary structure (partsof P9, P10.1, P12, L15, and P19) could not be modeled because of disorder inthe crystal. See supporting information for an expanded view of the secondarystructure.

Fig. 2. Long-range interactions in the RNase P RNA. (A) Dock between P15.1(blue) and P5.1 (green). Nucleotides A290, A295, and A296 in P15.1 involvedin A-minor interactions with P5.1 are colored in yellow. Nucleotide A291,involved in a long-range cross-strand stacking interaction, is colored in red. (B)Dock between P8 (blue) and P4 (green). Nucleotides A99, A100, A106, andA107 in P8 are involved in A-minor interactions with P4 and are colored inyellow. See supporting information for an expanded view of the secondarystructure.

Kazantsev et al. PNAS � September 20, 2005 � vol. 102 � no. 38 � 13393

BIO

CHEM

ISTR

Y

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021

Page 3: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

A290, A295, and A296 to dock into the distorted minor groove ofL5.1 by means of A-minor interactions (18). Additional contacts areprovided by the backbones of A295, A296, and C69 and U70, whichform a canonical ribose zipper (19). All nucleotides that appear tobe critical for this interaction (G73, A74, A290, A291, A295, andA296) are highly conserved in the B-type RNase P RNA (15).

Docking of L8 into the Minor Groove of P4. Interaction between loopL8 and the minor groove of highly conserved helix P4 also waspredicted on the basis of phylogenetic analysis and modeling (15).Again, however, this long-range interaction observed in the crystalstructure is more complicated than predicted. As shown in Fig. 2B,the conformation of L8 allows the bases of A99, A100, A106, andA107 to form a continuous stack and dock into the minor grooveof helix P4 by A-minor interactions (18). As is often the case withA-minor interactions, the backbones of all four docked adenines areinvolved in ribose zippers: A99 and A100 of L8 form a single ribosezipper with A392 and U393 of P4, whereas A106 and A107 of L8form a canonical ribose zipper with C54 and A55 of P4 (19 forribose zipper nomenclature).

Interhelical Joining Regions. Helix P1, which draws together the 5�and 3� ends of the RNA, has been predicted to stack on helix P2 (20)or alternatively on helix P4 (15). In contrast to those predictions,helix P1 in the crystal structure is coaxially stacked on and caps acomplicated structure that draws together interhelical joining re-gions and contains the most highly conserved nucleotides in theRNA. As illustrated in Fig. 3A, nucleotide G14, at the interior endof helix P1, sets up the complex by formation of a dinucleotideplatform-containing base triple (21) with U400 and A401 fromjoining region J4�1 and is stacked on a column of bases drawntogether from J19�4 (A385, C386, A387, and G388), J3�4 (A45),and J15.2�2 (U337, A336). U400 in the base-triple starts anotherstack with A399 (J4�1), A48 (J3�4) and continues with highlyconserved helix P4. These two long base stacks interact with eachother by a canonical ribose zipper (nucleotides A399, U400 andA385, C386) and through a variety of noncanonical interactions,with a third smaller stack of bases comprised of G47, G46, and A49of J3�4. On the opposing side of the latter stack, two short stretchesof nucleotides (A255, A257 of J5�15 and G275, C276 of J15�15.1)connect helix P15 to helix P4. Two more stacks of bases are formed

by nucleotides A389 (J19�4), A330, G331 (J15.2�2), A277 (J15�15.1), and A390 (J19�4), A332, and U333 (J15.2�2) and contributeadditional complexity. Individual conformations adopted by thebackbones of the joining regions vary from fairly stretched (J19�4and J4�1) to sharp turns (J3�4, J5�15, and J15�15.1); J15.2�2 formsa jagged loop (Fig. 3B). A full account of internucleotide interac-tions found in this complicated structure, many involving highlyconserved nucleotides, is listed in supporting information. We pointout that the available crystallographic data do not allow for explicitmodeling of the bound solvent, which is expected to furthercomplicate this structure.

Nucleotide Conservation and the Catalytic Core. Of 65 nucleotidesthat are highly conserved (at the level of at least 80%) in thebacterial RNase P RNA, 48 are found in our model, and analysisof their spatial distribution reveals interesting trends. First, most ofthe absolutely (100%) and many of the highly (�80%) conservednucleotides in this RNA are located in helix P4 and the adjacentstructure formed from joining regions, as described above. Becauseof this evolutionary conservation and the long history of biochem-ical data that implicate the nucleotides in this portion of themolecule in catalytic function (22–27), we believe this regionconstitutes the catalytic core of this ribozyme (Fig. 4A). Patches ofbase conservation found throughout the rest of the structure aredistributed mostly along the flat surface formed by the coaxiallystacked helical subdomains and lead from the catalytic core towardhelices P10 and P11, at the base of the highly conserved, so-called‘‘specificity domain’’ (‘‘S-domain’’) of the RNA (28). Such spatialdistribution of conserved residues probably reflects conservation ofRNase P properties that are important for recognition of thesubstrate (below), which is expected to bind to RNase P RNA alongthe flat surface with the T-loop docked to the base of the S-domainand the 5� end of the acceptor stem (site of RNase P processing)bound to the core (9, 10, 28).

Identification of the Chemically Active Site. The core as we define it,including helix P4 and adjacent interhelical joining regions (Fig.3A), contains �40 nucleotides. Closer perspective on the active siteof this ribozyme is provided by photoaffinity crosslinking experi-ments that identified a small set of nucleotides located within 9 Åfrom the product 5�-phosphate of tRNA in the RNase P–tRNA

Fig. 3. Catalytic core of bacterial RNase P RNA. (A) Slab diagram representing stacking interactions in the core. Only some base pairing interactions are shownfor clarity. Highly conserved helix P4 is colored in red, J3�4 in purple, J5�15 in green, J15�15.1 in yellow, J15.2�2 in cyan, J19�4 in orange, and J4�1 in blue. (B)Stereo view of an all-atom model of the core with the backbone trace shown as a round ribbon. Structural elements are colored according to A.

13394 � www.pnas.org�cgi�doi�10.1073�pnas.0506662102 Kazantsev et al.

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021

Page 4: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

complex (ref. 9 and Fig. 4B). These nucleotides (homologs of A256,A332, U333, A334, and G335 in the B. stearothermophilus RNase PRNA) occur close to each other in our model and, while providinglittle information on the identity of the catalytically importantresidues, lead us to propose that the chemical site of catalysis byRNase P is located in the cleft formed by nucleotides A256,A332-G335, A390, C391 (backbone), and G275-A277, as depictedin Fig. 4C.

Metal-Binding Sites. RNase P is a metalloenzyme, dependent forcatalysis on divalent metal ions, preferably magnesium (29, 30).Three major functions performed by divalent metal ions in RNaseP include contribution to the correct folding of the ribozyme (31),participation in substrate binding (32), and direct involvement inhydrolysis of the substrate phosphodiester bond (31, 33, 34). Someinsight into the distribution of the metal-binding sites is provided bythe anomalous signal of the heavy atom derivatives, most informa-tively the derivative with Os(III) hexamine, which mimics a fullyhydrated octahedrally coordinated Mg2� ion, and is expected tobind RNA only by outer-sphere coordination (35, 36). A similarmode of Mg2� binding to RNA is rare (one or two inner-spherecontacts with RNA are made most frequently; ref. 37), but manymetal-binding sites in RNA can accommodate a variety of metalions that differ in hydration state and coordination preferences(38). Of the 27 strongest Os(III) hexamine sites in the RNase PRNA structure, 23 are found in major grooves of double-strandedhelices, often distorted from A-form geometry by helical junctionsand noncanonical base pairing. Twenty-one of the 27 Os(III)hexamine ions appear to be coordinated by Hoogsteen faces ofpurines, including at least one guanine. Both of the two Pb(II)-binding sites are found in the major grooves of helices distortedfrom the A-form geometry. Most of these metal-binding sitesoverlap with significant residual density in the �A-weighted Fo�Fcmaps calculated with native amplitudes. This suggests that orderedsolvent, presumably hydrated magnesium ions that would be de-tectable at this resolution, is indeed bound at these sites in the nativeRNA structure.

Of particular interest are metal-binding sites in the vicinity ofhighly conserved helix P4. Biochemical (39–42) and physical data(43, 44) indicate the importance of magnesium-binding sites in thisregion for the activity of RNase P RNA. Three Os(III) hexamine(sites A, B, and C, Fig. 4C) and one Pb(II) site are bound by themajor groove of P4 and the adjacent highly conserved nucleotides.The Os(III) hexamine site A, which is formed by consecutive basepairs U56-A392 and G57-C391, overlaps with one of the Pb(II)sites, which suggests low cation specificity for this site. Two moresites, D and E (Fig. 4C), deserve special attention: they are foundin the cleft we identify as the chemically active center. Although thelatter metal-binding sites can be the candidates for direct involve-ment in chemistry, sites A and B, in the major groove of P4, areremote from the proposed chemically active site and thereby mostprobably serve a different function. Docking of tRNA onto theRNase P RNA structure (below) positions the backbone of theacceptor stem of tRNA directly in the major groove of helix P4.Therefore, we propose that the primary role of the metal-bindingsites in helix P4 is to screen electronegative repulsion between thenegatively charged substrate and ribozyme RNAs.

Reconstruction of the Full-Size RNA and Its Complex with the Product.Homologs of 73 nucleotides in the present structure are found in therecently determined S-domain structure of the Bacillus subtilisRNase P RNA (PDB ID code 1NBS; ref. 45); 67 of the overlappingnucleotides are identical between the B. subtilis and B. stearother-mophilus RNase P RNAs. Initial comparison of the structuresrevealed differences in the orientation of helix P8 and in theconformations adopted by loop L8. This loop is docked intothe minor groove of helix P4 in our structure but is involved in theformation of crystal contacts in the free S-domain structure. P8 and

Fig. 4. Base conservation in the bacterial RNase P RNA and identification of thechemically active site. (A) Distribution of the conserved nucleotides in the bacte-rial RNase P RNA. Nucleotides absolutely conserved in bacteria are highlighted inred,whereas thosehighly conserved (�80%)are showninyellow. (B)Nucleotidesin bacterial RNase P RNA crosslinked by a photoagent at the 5� end of producttRNA (9). Nucleotides crosslinked in both A- and B-types of RNase P RNA arecolored in red, whereas nucleotides crosslinked only in the A-type RNase P RNAarecolored inyellow. (C) Proposedchemicallyactive site (orange).Balls andsticks,nucleotides crosslinked by a photoagent at the 5� end of product tRNA in A- andB-types of RNase P RNA. Helix P4 is colored in blue, P15 in green, and nucleotideand A256 in purple. RNA-phased anomalous difference maps from Os(III) hex-amine (red mesh) and Pb(II) (yellow mesh) contoured at 4.5 � indicate potentialmetal-binding sites. The metal-bindng sites discussed in the text are labeled Athrough E. The orientation of the model is approximately the same as in Fig. 3B.

Kazantsev et al. PNAS � September 20, 2005 � vol. 102 � no. 38 � 13395

BIO

CHEM

ISTR

Y

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021

Page 5: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

L8, therefore, were excluded from the superimposition, and only 51nucleotides, comprising P7, P10, P11 and parts of P9 and P10.1,were used to superimpose the two structures, with rms deviationsof 1.98 Å for the phosphorous atoms in the selected nucleotides. Asshown in Fig. 5A, the resulting model of the entire RNase P RNAis still remarkably flat, but in agreement with the RNA packing inthe crystals (see supporting information).

Photoaffinity crosslinking data (9, 10) allow reconstruction, ingeneral detail, of the complex between RNase P RNA and producttRNA. Photoagents chemically attached at different tRNA nucle-otides (Fig. 5B) form crosslinks to a limited set of nucleotides inRNase P RNA that consequently are expected to be within 9 Åfrom the site of the photoagent in the tRNA. The docked complexreconstructed on the basis of the crosslinking data is shown in Fig.5C. This model of the complex satisfies available photoaffinitycrosslinking data and provides a working model for the RNase PRNA–tRNA complex.

DiscussionExtensive previous phylogenetic comparative studies had identifiedthe elements of secondary structure and defined the conservedelements of the RNA. The comparative structure was useful in theinterpretation of electron density maps, which in turn agreed withstructure predicted on phylogenetic grounds (see supporting infor-mation). Patterns of coaxial stacking of the helices and their overallspatial arrangement agree remarkably well between the crystal andphylogenetic-crosslinking structure models. Thus, as with the struc-ture of the ribosome (46–48), crystallography validates the phylo-genetic-comparative approach to RNA structure. Comparative andmodeling studies with RNase P also indicated some tertiary struc-tural elements, although these generally proved, in the light of thecrystal structure, to be more complex than predicted. Phylogeneticcomparisons perceive only base interactions and so cannot detectthe wealth of backbone and other contacts that are revealed by thecrystal structure.

Fig. 5. Reconstructed entire B-type RNase P RNA and its complex with tRNA. (A) Reconstructed entire RNase P RNA. Nucleotides in P7, P10, P11, and parts ofP9 and P10.1 of B. stearothermophilus RNA structure used in superimposition are colored in purple. The missing substructure, imposed from the B. subtilisS-domain structure, is colored in red. (B) Photoaffinity crosslinking data (9, 10) used in reconstruction of the complex with tRNA. S-domain of RNase P RNA (Left)is colored in gray. Colored spheres in tRNA (Right) represent different sites of photoagent attachment. Colored spheres in RNase P RNA represent sites ofcrosslinking by those photoagents within the RNase P–tRNA complex. (C) Reconstructed complex. Small gray spheres indicate nucleotide G230. Reconstructionwas performed interactively with the computer program O (13); high-resolution tRNAPhe structure (PDB ID code 1EH Z; ref. 53) was used in the modeling.

13396 � www.pnas.org�cgi�doi�10.1073�pnas.0506662102 Kazantsev et al.

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021

Page 6: Crystal structure of a bacterial ribonuclease P RNARNA could not be traced because of disorder in the crystal. The structure of the traceable part of the RNA, which contains all of

The crystal structure also provides a framework for understand-ing functional aspects of previous comparative, crosslinking, andother biochemical analyses. Thus, those data were instrumental ininference of the location of the active site of the ribozyme. Ourassignment of the chemically active site is further corroborated bythe spatial proximity of structural elements implicated in therecognition of the substrate precursors of tRNA. The base of thehighly conserved A256 is thought to pair with nucleotide N-1 ofthe 5�-precursor sequence (49, 50) and is positioned next to thecleft. Loop L15, shown to be involved in binding the substrate3�-CCA (51), although partially disordered is also within consistentdistance to the proposed active site (Fig. 4C).

The global structure of RNase P RNA as seen in the crystal isunusual in that it is remarkably flat and extended. At least a fewdifferences between the crystal and solution structures are ex-pected. One possible distortion in the crystal structure involvesP15.2, which possibly is pulled from the body of the RNA by

formation of crystal contacts. Biochemical data also suggest thatsome rearrangement of the RNase P RNA structure may occurupon tRNA binding. The homolog of nucleotide G230, at the baseof the S-domain (Fig. 5C), is protected from chemical modificationin B. subtilis RNase P RNA (52) but is separated by �11 Å from thenearest tRNA atom in the reconstructed complex. Other potentialdistortions of the crystal structure from the solution structure arepossible, indicating the need for further structural study of addi-tional examples of this universal RNA.

We thank Drs. R. T. Batey and J. Jankarik for fruitful discussions andhelp at different stages of this project and Howard Hughes MedicalInstitute for beam-line access during scanning for the heavy atomderivatives. Portions of this research were carried out at the StanfordSynchrotron Radiation Laboratory, a national user facility operated byStanford University on behalf of the U.S. Department of Energy, Officeof Basic Energy Sciences. This work was supported by National Institutesof Health Grant GM 34527 (to N.R.P.).

1. Frank, D. N. & Pace, N. R. (1998) Annu. Rev. Biochem. 67, 153–180.2. Altman, S. & Kirsebom, L. (1999) in The RNA World, eds. Gesteland, R. F.,

Cech, T. R. & Atkins, J. F. (Cold Spring Harbor Lab. Press, Plainview, NY),2nd Ed., p. 380.

3. Guerrier-Takada, C., Gardiner, K., Marsh, T., Pace, N. & Altman, S. (1983)Cell 35, 849–857.

4. Harris, J. K., Haas, E. S., Williams, D., Frank, D. N. & Brown, J. W. (2001) RNA7, 220–231.

5. Massire, C., Jaeger, L. & Westhof, E. (1997) RNA 3, 553–556.6. Brown, J. W., Nolan, J. M., Haas, E. S., Rubio, M. A. T., Major, F. & Pace, N. R.

(1996) Proc. Natl. Acad. Sci. USA 93, 3001–3006.7. Tanner, M. A. & Cech, T. R. (1995) RNA 1, 349–350.8. Tallsjo, A., Svard, S. G., Kufel, J. & Kirsebom, L. A. (1993) Nucleic Acids Res.

21, 3927–3933.9. Burgin, A. B. & Pace, N. R. (1990) EMBO J. 9, 4111–4118.

10. Chen, J.-L., Nolan, J. M., Harris, M. E. & Pace, N. R. (1998) EMBO J. 17,1515–1525.

11. Pflugrath, J. W. (1999) Acta Crystallogr. D 55, 1718–1725.12. Brunger, A. T. et al. (1998) Acta Crystallogr. D 54, 905–921.13. Jones, T. A., Zou, J. Y., Cowan, S. & Kjeldgaard, M. (1991) Acta Crystallogr.

A 46, 110–119.14. Brown, J. W. (1999) Nucleic Acids Res. 27, 314.15. Massire, C., Jaeger, L. & Westhof, E. (1998) J. Mol. Biol. 279, 773–793.16. Haas, E. S., Banta, A. B., Harris, J. K., Pace, N. R. & Brown, J. W. (1996)

Nucleic Acids Res. 24, 4775–4782.17. Moore, P. B. (1999) Annu. Rev. Biochem. 68, 287–300.18. Nissen, P., Ippolito, J. A., Ban, N., Moore, P. B. & Steitz, T. A. (2001) Proc.

Natl. Acad. Sci. USA 98, 4899–4903.19. Tamura, M. & Holbrook, S. R. (2002) J. Mol. Biol. 320, 455–474.20. Harris, M. E., Kazantsev, A. V., Chen, J.-L. & Pace, N. R. (1997) RNA 3,

561–576.21. Klosterman, P. S., Hendrix, D. K., Tamura, M., Holbrook, S. R. & Brenner,

S. E. (2004) Nucleic Acids Res. 32, 2342–2352.22. Siegel, R. W., Banta, A. B., Haas, E. S., Brown, J. W. & Pace, N. R. (1996) RNA

2, 452–462.23. Haas, E. S., Armbruster, D. W., Vucson, B. M., Daniels, C. J. & Brown, J. W.

(1996) Nucleic Acids Res. 24, 1252–1259.24. Kazantsev, A. V. & Pace, N. R. (1998) RNA 4, 937–947.25. Harris, M. E. & Pace, N. R. (1995) RNA 1, 210–218.

26. Loria, A. & Pan, T. (1996) RNA 2, 551–563.27. Pan, T. (1995) Biochemistry 34, 902–909.28. Pan, T., Loria, A. & Zhong, K. (1995) Proc. Natl. Acad. Sci. USA 92,

12510–12514.29. Beebe, J. A. & Fierke, C. A. (1994) Biochemistry 33, 10294–10304.30. Smith, D. & Pace, N. R. (1993) Biochemistry 32, 5273–5281.31. Fang, X. W., Pan, T. & Sosnick, T. R. (1999) Nat. Struct. Biol. 6, 1091–1095.32. Beebe, J. A., Kurz, J. C. & Fierke, C. A. (1996) Biochemistry 35, 10493–10505.33. Warnecke, J. M., Sontheimer, E. J., Piccirilli, J. A. & Hartmann, R. K. (2000)

Nucleic Acids Res. 28, 720–727.34. Warnecke, J. M., Held, R., Busch, S. & Hartmann, R. K. (1999) J. Mol. Biol.

290, 433–445.35. Misra, V. K. & Draper, D. E. (1999) Biopolymers 48, 113–135.36. Cate, J. H. & Doudna, J. A. (1996) Structure (Cambridge, U.K.) 4, 1221–1229.37. Klein, D. J., Moore, P. B. & Steitz, T. A. (2004) RNA 10, 1366–1379.38. Batey, R. T. & Doudna, J. A. (2002) Biochemistry 41, 11703–11710.39. Christian, E. L., Kaye N. M. & Harris, M. E. (2002) EMBO J. 21, 2253–2262.40. Rasmussen, T. A. & Nolan, J. M. (2002) Gene 294, 177–185.41. Christian, E. L., Kaye, N. M. & Harris, M. E. (2002) RNA 6, 511–519.42. Frank, D. N., Ellington, A. E. & Pace, N. R. (1996) RNA 2, 1179–1188.43. Schmitz, M. (2004) Nucleic Acids Res. 32, 6358–6366.44. Schmitz, M. & Tinoco I., Jr. (2000) RNA 6, 1212–1225.45. Krasilnikov, A. S., Yang, X., Pan, T. & Mondragon, A. (2003) Nature 421,

760–764.46. Ban, N., Nissen, P., Hansen, J., Moore, P. B. & Steitz, T. A. (2000) Science 289,

905–920.47. Wimberly, B. T., Brodersen, D. E., Clemons W. M., Jr., Morgan-Warren, R. J.,

Carter, A. P., Vonrhein, C., Hartsch, T. & Ramakrishnan, V. (2000) Nature 407,327–339.

48. Yusupov, M. M., Yusupova, G. Zh., Baucom, A., Lieberman, K., Earnest, T. N.,Cate, J. H. D. & Noller, H. F. (2001) Science 292, 883–896.

49. Zahler, N. H., Sun, L., Christian, E. L. & Harris, M. E. (2005) J. Mol. Biol. 345,969–985.

50. Zahler, N. H., Cristian, E. L. & Harris, M. E. (2003) RNA 9, 734–745.51. Oh, B. K. & Pace, N. R. (1994) Nucleic Acids Res. 22, 4087–4094.52. Odell, L., Huang, V., Jakacka, M. & Pan, T. (1998) Nucleic Acids Res. 26,

3717–3723.53. Shi, H. & Moore, P. B. (2000) RNA 6, 1091–1105.

Kazantsev et al. PNAS � September 20, 2005 � vol. 102 � no. 38 � 13397

BIO

CHEM

ISTR

Y

Dow

nloa

ded

by g

uest

on

Janu

ary

1, 2

021