j. theor. biol. (1987) 127, 301-314 - duke universitypeople.duke.edu/~darmaleo/structure and...

14
J. theor. Biol. (1987) 127, 301-314 Structure and Evolution of Prokaryotic and Eukaryotic RNA Polymerases: a Model DAN1ELE ARMALEO Department of Microbiology, Duke University Medical Center, Durham, North Carolina 27710, U.S.A. (Received 1 August 1986, and in revised form 2 February 1987) A comparative overview of the subunit taxonomy and sequences of eukaryotic and prokaryotic RNA polymerases indicates the presence of a core structure conserved between both sets of enzymes. The differentiation between prokaryotic and eukary- otic polymerases is ascribed to domains and subunits peripheral to the largely conserved central structure. Possible subunit and domain functions are outlined. The core's flexible shape is largely determined by the elongated architecture of the two largest subunits, which can be oriented along the DNA axis with their bulkier amino-terminal head regions looking towards the 3' end of the gene to be transcribed and their more slender carboxyl-terminal domains at the tail end of the enzyme. The two largest prokaryotic subunits appear originally derived from a single gene. 1. Introduction The eukaryotic nuclear RNA polymerases are a family of related proteins with three subgroups, each having distinct structural and functional properties. Class I (or A) enzymes synthesize 18S and 28S ribosomal RNA precursors, class II (or B) messenger RNA precursors, and class III (or C) various small RNAs, transfer RNA and 5S RNA (Roeder, 1976). The purified native proteins (500 to 700 kDa) are composed of 10 to 15 different subunits whose individual sizes range from 10 to 240 kDa. Due to the complexity of these enzymes and to the failure of in vitro reconstitution attempts, very little is known about the roles and topography of their component subunits (Lewis & Burgess, 1982; Paule, 1981). Prokaryotes, on the other hand, transcribe all their genes with a single enzyme, simpler and better understood than its eukaryotic counterparts (Chamberlin, 1982). In this paper a model is described of the structure and evolution of prokaryotic and eukaryotic RNA polymerases, based on an analysis of subunit molecular weights and sequences. Within the limitations of a simplifying scheme, the model provides a frame for a more targeted dissection of polymerase structure and function. 2. Subunit Size Conservation vs Change in Polymerase Evolution In Fig. 1(a), eukaryotic RNA polymerases are grouped into their three constituent classes. A prokaryotic and a viral RNA polymerase are also included. Each symbol represents an individual enzyme subunit whose position along the abscissa corre- sponds to its molecular weight and, along the vertical dimension, to its parent polymerase (listed on the left). Such a comparative representation, warranted by 301 0022-5193/87/150301 + 14 $03.00/0 ~) 1987 Academic Press Ltd

Upload: others

Post on 29-May-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

J. theor. Biol. (1987) 127, 301-314

Structure and Evolution of Prokaryotic and Eukaryotic RNA Polymerases: a Model

DAN1ELE ARMALEO

Department of Microbiology, Duke University Medical Center, Durham, North Carolina 27710, U.S.A.

(Received 1 August 1986, and in revised form 2 February 1987)

A comparative overview of the subunit taxonomy and sequences of eukaryotic and prokaryotic RNA polymerases indicates the presence of a core structure conserved between both sets of enzymes. The differentiation between prokaryotic and eukary- otic polymerases is ascribed to domains and subunits peripheral to the largely conserved central structure. Possible subunit and domain functions are outlined. The core's flexible shape is largely determined by the elongated architecture of the two largest subunits, which can be oriented along the DNA axis with their bulkier amino-terminal head regions looking towards the 3' end of the gene to be transcribed and their more slender carboxyl-terminal domains at the tail end of the enzyme. The two largest prokaryotic subunits appear originally derived from a single gene.

1. Introduction

The eukaryotic nuclear RNA polymerases are a family of related proteins with three subgroups, each having distinct structural and functional properties. Class I (or A) enzymes synthesize 18S and 28S ribosomal RNA precursors, class II (or B) messenger RNA precursors, and class I I I (or C) various small RNAs, transfer RNA and 5S RNA (Roeder, 1976). The purified native proteins (500 to 700 kDa) are composed of 10 to 15 different subunits whose individual sizes range from 10 to 240 kDa. Due to the complexity of these enzymes and to the failure of in vitro reconstitution attempts, very little is known about the roles and topography of their component subunits (Lewis & Burgess, 1982; Paule, 1981). Prokaryotes, on the other hand, transcribe all their genes with a single enzyme, simpler and better understood than its eukaryotic counterparts (Chamberlin, 1982).

In this paper a model is described of the structure and evolution of prokaryotic and eukaryotic RNA polymerases, based on an analysis of subunit molecular weights and sequences. Within the limitations of a simplifying scheme, the model provides a frame for a more targeted dissection of polymerase structure and function.

2. Subunit Size Conservation vs Change in Polymerase Evolution

In Fig. 1 (a), eukaryotic RNA polymerases are grouped into their three constituent classes. A prokaryotic and a viral RNA polymerase are also included. Each symbol represents an individual enzyme subunit whose position along the abscissa corre- sponds to its molecular weight and, along the vertical dimension, to its parent polymerase (listed on the left). Such a comparat ive representation, warranted by

301

0022-5193/87/150301 + 14 $03.00/0 ~) 1987 Academic Press Ltd

Page 2: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

PO

X

~,

A

&

ee

X~NO

~ ~

~:

.:

o o

o °

o C

AU

LI

~ 0

0 0

0 W

.E,~

..

..

:~

o

o~

o

AC

AN

T,

A

~ &

~

0 0

0 N

EU

RO

. ~

A

~ ~

~ ~

• O

O

Y

EA

ST

~

A

&

&

&&

••

0

0

, M

OU

SE

~

~ ~

e

DR

OS

O,

~ ~

&

T W

HE

AT

~

t¢=

~

&~

S

Y

B,

A~

AN

T

&A

~.

i l

AG

AR

II

~ &

~

• P

HY

SA

, &

A

A

~

NE

UR

O,

A

&~

o .;!..

.. o

• .

~ M

OU

SE

~

RA

T

CA

UL~

~

o

• W

HE

AT

~

0 t

AC

AN

T~

~

~ A

SP

ER

~

A

~ O

~

BB

N

EU

RO

A

A

A

A

~ A

0

YE

AS

T

~&

~

A

~ A

I

EC

OL

I •

O0

' '

o '

'o '

'o "

8' '

' '

't"

' '

0 2

4 6

0 10

0 12

0 14

0

i

I

0

O

0

• O

I I~

0

10

~3

0

--3

0

--2

0

--1

0

I 0

20

4

0

100

50

~1

00

~n

in

l i

l

60

8

0

SE

CO

ND

L

AR

GE

ST

120

140

160

0

0

(o)

0 0

(0)

0

0

0

O

(o)

O

0 0

0 0

LA

RG

ES

t S

UB

UN

IT

180

20

0

' 220

(a)

MW

K

DA

I

(b)

24

0

MW

. (K

DA

|

> ;:o

> F'

rn

0

Page 3: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

STRUCTURE AND EVOLUTION OF RNA POLYMERASE 303

the relatedness of all eukaryotic RNA polymerases (Armaleo & Gross, 1985b; Ingles et al. , 1984; Weeks et al. , 1982; Huet et al. , 1982b; Paule, 1981), shows that most subunits cluster in two size ranges, one between 10 and 50 kDa, and the other between 120 and 240 kDa, regardless of enzyme type or source. Subclusters can also be recognized within the two major ranges (Fig. l(b)). This overall similarity in the architecture of eukaryotic RNA polymerases is still discernible after thousands of millions of years of divergence (Woese & Fox, 1977) over a background of substantial molecular weight variability.

Assessing in some detail the relative significance of subunit size conservation vs change in polymerase evolution is at the basis of the model discussed below. The assumption is that, in such a multi-subunit system, the degree of molecular weight drift among comparable polypeptides across many species and polymerase classes reflects the degree of divergence in sequence and function of these subunits. It is on this broad basis that the polypeptides listed in Fig. l(a) will be grouped into "conserved" or "variable" clusters (Fig. 1 (b)). The boundaries between size clusters, however, are not sharp. Therefore, whereas the majority of subunits characterizing a cluster will be considered functionally related, some may in fact be "spill-overs'" from neighboring groups with different functions (overlaps in Fig. l(b)).

Proceeding from the smallest to the largest subunits, one finds a first major 10-30 kDa cluster (A), which includes most highly conserved small subunits common to at least two polymerase classes in any given organism (Lewis & Burgess, 1982). Within the 10-30 kDa group, the histogram in Fig. l(b) hints at several subdivisions. A second clustering of conserved subunits (O) is seen around 35-45 kDa, and is distinct from the first (Fig. l(b)). (For the size-conserved subunits in both clusters, sequence conservation has been proven immunologically, by SDS polyacrylamide gel electrophoresis, peptide mapping, and two-dimension electrophoresis; see Armaleo & Gross (1985b) and Lewis & Burgess (1982, and references therein). Proceeding through the region between 50 and 120 kDa, one finds a few widely scattered polymerase I and III subunits (O), but no polymerase II subunits. Finally, the second largest subunits (HI) show limited variability (120-150kDa), whereas the largest subunits (E3) cover a wide range of sizes (150-240 kDa).

FIG. 1. RNA polymerase subunit size distribution. (a) The class and the organism of origin of each enzyme are listed on the left. The abscissa measures molecular weight. Subunits belonging to the same enzyme fall on the same ideal line (parallel to the abscissa) and are represented by different symbols according to the classification described in the text. (A) 10-30 kDa; (0) 35-45 kDa; (O) 50-100 kDa; (11) second largest subunit; (E]) largest subunit (proteolyzed subunits are in parenthesis). The data were compiled from the following references. E. cofi (Chamberlain, 1982); yeast (Lewis & Burgess. 1982; Buhler et ak, 1976, 1980; Dezelee et al., 1976; Valenzuela et aL, 1976); Neurospora (Armaleo & Gross, 1985a). Aspergillus (Stunnenberg et al., 1979); Physarium (Smith & Braun, 1978); Agaricus (Vaisius & Horgen, 1979); Acanthamoeba (D'Alessio et al., 1979a, b; Spindler et al., 1978a, b); soybean (Guilfoyle & Jendrisak, 1978); wheat (Jendrisak, 1981; Teissere etaL, 1977); cauliflower (Guilfoyle, 1980); Drosophila (Weeks et al., 1982); Artemia (Huet et al., 1982b); Xenopus (Sklar et aL, 1975); rat (Matsui et al., 1976); mouse (Sklar, et al., 1975); calf (Dahmus, 1981a, b; Hodo & Blatti, 1977); human (Jaehning et al., 1977); pox (Baroudy & Moss, 1980). (b) Histogram representing the cumulative number of eukaryotic RNA polymerase subunits displayed in (a) as a function of molecular weight. The horizontal lines and the associated numbers mark the molecular weight ranges of the various subunit populations.

Page 4: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

304 D, A R M A L E O

The apparently high variability of the largest subunits can be broken down into three discontinuous ranges related to polymerase class, clustering around 155 kDa in type III polymerases, 195 kDa in type I polymerases, and 220 kDa in type II polymerases. A clue to interpret this pattern may be provided by the largest subunits of type II enzymes. These polypeptides form on the enzyme surface a structurally and functionally independent 40-50 kDa domain (Armaleo & Gross, 1985a), as suggested by the frequent proteolytic loss of a 40 kDa segment from that subunit during enzyme isolation (Guilfoyle et al., 1984) without major alteration in the in vitro catalytic properties (Dezelee et al., 1976). The size of the proteolyzed subunit is reduced to about 180kDa, a value closer to that of the largest subunits of polymerases I and III. This structural peculiarity of type II polymerases suggests that most of the size divergence among the largest subunits of all three enzyme classes may be due to different domains present at one end of an otherwise "constant" core section of about 160 kDa. In type I polymerases, the extra domains would be less sensitive to proteolysis (their largest subunits are usually recovered intact), whereas type III polymerases would have little or no extension (the size of their largest subunit is close to 160 kDa). Based on indirect evidence, it was suggested that the proteolysis-sensitive extension in polymerase II is carboxy-terminal (Armaleo & Gross, 1985a; Armaleo, 1984), an orientation established by recent sequencing data (see section 3). A similar organization might be anticipated also for the largest subunits of type I polymerases.

In summary, conservation is mostly seen among the 10-30 kDa, the 35-45 kDa, the second largest 140 kDa subunits, and the presumably invariant portions (about 160 kDa) of the largest subunits. Divergence, on the other hand, appears primarily confined to the 50-100 kDa polypeptides and to the variable carboxy-terminal extensions of the largest subunits.

3. Related Subunits and Cores in Prokaryotic and Eukaryotic Enzymes

A comparison with E. coil RNA polymerase, representative of most prokaryotic RNA polymerases (Chamberlin, 1982), shows that the bacterial enzyme has no subunits in the 10-30 kDa range, that the molecular weight of its a subunit falls in the center of the eukaryotic 35-45 kDa cluster, that the regulatory ~ and nusA proteins fall in the 50-100kDa span, and that /3 and /~' are located at the lower end (150-160 kDa) of the size range of the largest eukaryotic subunits (Fig. l(a)). Carrying the "'molecular weight conservation vs relatedness" argument across the prokaryotic/eukaryotic bridge, one could expect relatedness between a and some of the 35-45 kDa subunits, between ~ and the second largest eukaryotic subunits, and between/3' and the presumed invariant part of the largest eukaryotic subunits.

Direct and indirect experimental evidence for common ancestry does in fact exist. (1) By way of the cloned DNA sequences, homologies have been found between

the E. coli ~ ' subunit and the amino-terminal regions of the largest polymerase II subunits of Drosophi la (Biggs et al., 1985), yeast (Allison et al., 1985) and mouse (Corden et al., 1985), the largest polymerase III subunit of yeast (Allison et al., 1985), and the 147 kDa subunit of vaccinia virus polymerase (Broyles & Moss,

Page 5: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

S T R U C T U R E A N D E V O L U T I O N OF R N A P O L Y M E R A S E 305

Prokoryotic core

Eukaryotic core

I II 111 Eu,~oryotic polymeroses

FIG. 2. A scheme of structural relationships among RNA polymerases. The four major core subunits are symbolized by/3,/3' and a in both prokaryotic and eukaryotic enzymes. For eukaryotic polymerases, the letter e represents the 10-30 kDa, and p the various "regulatory" subunits. The carboxy-terminal extensions of the largest eukaryotic subunits are hatched. The 10-30 kDa subunits are considered part of the eukaryotic core as most are common to all three polymerase classes. The contacts, shapes and relative dimensions of the subunits are arbitrary. See section six for a description of the roles of the eukaryotic subunits.

1986); the same data indicate that the carboxy-terminal region of the largest poly- merase II subunit is in fact the proteolysis-sensitive domain and that it consists of a unique series of [TyrSerProThrSerProSer] repeats, about 25 in yeast and 50 in mouse, with no known counterpart in E. co/i; (the establishment of homology between /3 and the second largest eukaryotic subunits awaits the cloning of the eukaryotic sequences).

(2) The/3 and/3' E. coil subunits, like the two largest polymerase II polypeptides, are involved in nucleic acid binding and basic polymerization functions (Carroll & Stollar, 1983; Horikoshi et aL, 1983; Huet et aL, 1982a; Yura & tshihama, 1979).

(3) In several cases, a stoichiometry close to two 35-45 kDa subunits per enzyme molecule can be deduced from the SDS gel staining patterns of eukaryotic RNA polymerases (Armaleo & Gross, 1985a; D'Alessio et al., 1979b; Stunnenberg et al., 1979; Vaisius & Horgen, 1979; Spindler et aL, 1978a, b; Hager et al., 1977; Jaehning et al., 1977; Valenzuela et al., 1976) and of pox virus RNA polymerase (Baroudy & Moss, 1980), in striking correspondence with the known stoichiometry of the a subunit of 17. coli; (also in this case, the establishment of homology between a and some of the 35-45 kDa subunits awaits the cloning of the eukaryotic sequences). The 10- 30 kDa polypeptides are thus left as a distinctly eukaryotic subunit popula- tion, with no obvious prokaryotic counterpart (except for some archaebacteria, as discussed below).

The correlations described above suggest (Fig. 2) that today's prokaryotic and eukaryotic cores derive from a common/3/3'a2-1ike ancestor, and that a unique set

Page 6: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

306 D. ARMALEO

of novel 10-30 kDa polypeptides (all grouped here under the symbol e), as well as carboxy-terminal extensions to the largest subunit of type I and II polymerases, are characteristic of eukaryotic enzymes.

4. Orientation of the Largest Polymerase Subunits on the Transcribed Gene

The data available in the literature allow some detailed proposals to be made on the orientation of the two large polymerase subunits with respect to each other and to the transcribed gene. A primary observation is that the fl and fl' subunits of E. coli RNA polymerase have a very elongated shape, as indicated by physical studies of the enzyme in solution (Meisenberger et al., 1980; Stoeckel et al., 1980), by experiments measuring the distance traveled by the 5' end of the nascent RNA before leaving the surface of the enzyme (Hanna & Meares, 1983) and, more directly, by mapping the DNA contacts made by the enzyme as it binds to various promoters (Chenchick et al., 1981; Siebenlist et al., 1980; Schmitz and Galas, 1979). The latter studies indicate that /3 and /3' can each touch the DNA over a distance of more than 200/~ (65 base pairs). From neutron- and small angle X-ray scattering studies (Meisenberger et al., 1980; Stoeckel et al., 1980), fl and fl' are each 230 to 240 long. These data imply an average width to length ratio of approximately 1 : 6 for these 150-160 kDa proteins. The main assumption on which the conclusions reached below are based is that, in the elongated polymerase subunits, the overall polarity of the polypeptide chain will tend to follow the longer axis, with the N and C termini near opposite ends. This particular disposition of the termini is observed, for example, in elongated molecules like collagen, myosin, fibrinogen, antibodies, and transcription factor IIIA, and is due to the way many multi-functional proteins appear to be constructed, by sequential linking of individual domains (Rossmann & Argos, 1981).

With these premises, the amino-carboxy orientation of the E. coil fl ' and the largest eukaryotic subunit with respect to the direction of transcription can be deduced from the following data. As described, the largest subunits of yeast RNA polymerases II and III and the E. coli f l ' subunit are homologous over most of their length. They also share sequence homology, confined to a 150 amino-acid region, with E. coli DNA polymerase I and T7 DNA polymerase (Allison et al., 1985). Within the three-dimensional structure of E. coli DNA polymerase I (Ollis et al., 1985a), this 150 amino-acid region comprises three a-helices involved in binding DNA, and is in close proximity of the 3' end of the primer strand, i.e. the active site. The observed sequence conservation and the probably stricter conservation of structure (Ollis et al., 1985b; Keim et al., 1981) suggest that also in prokaryotic and eukaryotic RNA polymerases the three a-helices are located in the general vicinity of the active site. Their position on the linearized amino-acid sequences of the E-coil /3' and the largest potymerase II and III subunits (Allison et al., 1985) places the active site region about 30% away from the amino-terminus; when E. coil RNA polymerase binds to a specific promoter (Chenchick, et al., 198t), the transcription start site (which also indicates the position of the active site) lies about 30% away from the "front" end of the/3' subunit, defined as the end which protrudes into the gene to be transcribed. Thus, the elongated f l ' subunit and its eukaryotic homologues

Page 7: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

S T R U C T U R E A N D E V O L U T I O N O F R N A P O L Y M E R A S E 307

E. coil DNApolymerasel 352PHE ALA PHE ASP THR GLU THR35B

T7 DNApolymerase 232PHE PRO PHE ASP THR LYS ALA238

F186PHE GLU PHE ASP PRO LYS ASP192 E. coli~ I L251ALA SER PHE ASP ILE GLU ALA257

E. coli~" 172PHE GLY ASP GLU PHE ASP ALA LYS MET180

FIG. 3. Alignment of short sequences thought to be involved in nucleotide binding in DNA and RNA polymerases. Seven-residue sequences characteristic of the active site regions of E. coli DNA polymerase I and T7 DNA polymerase were used to seek possibly homologous sequences marking the active site regions in the /3 subunit of E. coil RNA polymerase. If present, such sequences are expected in the amino-terminal region of/3 if it is oriented in parallel to/3' (see text for further discussion). Heptapeptide 352-358 of E. coli DNA polymerase I, which is in close contact with the bound deoxynucteotide (Ollis et aL, 1985a), and heptapeptide 232-238 of the highly homologous T7 DNA polymerase (Ollis et aL, 1985b), were matched by computer with the entire sequence of E. coil/3. Scanning residues 1-1200 with either heptapeptide, the best match found was with sequence 186-192; scanning residues 193-1342 with either heptapeptide the best match was with sequence 251-257. It should be noted that the indicated matches were chosen among the approximately 1200 seven-residue stretches which make up the subunit sequence scanned. The ALIGN program (with an upper limit of 1200 residues per run) was used with the unitary matrix (Dayhoff et at., 1983). The/3' sequence shown was found by scanning residues 1-1200 of/3' with heptapeptide 232-238 of T7 DNA polymerase.

appear or iented with the amino - t e rminus at the head and the carboxy- te rminus at

the tail of the t ranscr ib ing enzyme. In eukaryot ic R N A polymerases, this or ien ta t ion places the carboxy- te rmina l ex tens ion domains at the trai l ing edge of the enzyme, an area of impor tan t regulatory in teract ions with D N A and with other proteins.

In 17. coli, the tail of the p r o m o t e r - b o u n d enzyme is thought to interact with a n u m b e r of regulatory proteins , like CAP or A repressor (Hochschi ld et al., 1983;

Siebenlis t et aL, 1980; Schmitz & Galas , 1979). The /3 subun i t and, by analogy, the second largest eukaryot ic subuni t , can be

or iented on the basis of the fol lowing considera t ions . In E. co l i , /3 and /3' share a l imited a m o u n t of sequence homology, which suggests that a single gene was at the origin of bo th subun i t s . t G e n e dup l ica t ion and divergence would have led from a dimeric complex in which the two c o m p o n e n t s were very s imilar or identical

(homodimer ) to today ' s he terodimer ic /3 a n d / 3 ' polypept ides . The or ienta t ion of/3

with respect t o /3 ' should reflect the original or ien ta t ion in the homodimer . For two similar or ident ical prote ins b ind ing to D N A and to each other, two major types of symmetry can be expected. The first relates the two subuni t s through a ro ta t ion a r o u n d an axis parallel to the D N A and does not change their amino-ca rboxy

polar i ty ( "para l l e l " or ienta t ion) . The second involves a rota t ion a round an axis pe rpend icu l a r to the D N A and would invert the amino-ca rboxy polari ty of one

tThe sequences of/3 and /3' were compared using the ALIGN program (Dayhoff et al., 1983). The overall alignment score obtained for the 1200 amino-terminal residues was 3-43. This score is the number of standard deviations by which the maximum match score for the two real sequences exceeds the average maximum match score for 100 random permutations of the two sequences. The value 3.43 corresponds to a probability of 3 x 10 -4 for the random generation of an homology as high as, or higher than, that observed.

Page 8: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

3 0 8 D. A R M A L E O

subunit with respect to the other ("anti-parallel" orientation). The former alternative is supported by two independent lines of evidence. First, small angle X-ray-scattering studies indicate that the E. coli polymerase core has an elongated trigonal shape mostly determined by the two attached/3 and/3' subunits, with the center of mass closer to one end of the molecule (Meisenberger et al., 1980). This shape (resembling two adjacent sections of a pie) corresponds best to a parallel orientation of two originally very similar large subunits and is consistent with the disposition of cysteines along the polypeptide chain, as described below. If the orientation were anti-parallel one would expect a more homogeneous overall mass distribution. Second, a short E. coli DNA polymerase I sequence which constitutes part of the binding site for nucleotide monophosphates (Ollis et al., 1985a) resembles two short sequences located 60 residues apart in the amino-terminal region of the/3 subunit (Fig. 3). A similar sequence is also found in T7 DNA polymerase (Fig. 3), an enzyme highly homologous to t?.. coti DNA polymerase I (Ollis et aL, 1985b). Considering that/3 binds nucleotide triphosphates (Yura & Ishihama, 1979) at two adjacent and catalytically equivalent sites (Panka & Dennis, 1985), it seems plausible that the two neighbouring/3 sequences may contribute each to one nucleotide triphosphate binding site, thereby localizing the catalytically important area of/3 in proximity of the amino-terminus and orienting the subunit in parallel to /3'. (Figure 3 also shows that a possible related sequence, discussed below, is found in single copy in the/3' subunit.)

5. Functional and Structural Map of the Two Large RNA Polymerase Subunits

The /3 subunit can contact the LacUV-5 promoter between bases -36 and +30 (the transcription start site being +1), and/3' between bases -47 and +18 (Chenchick et al., 1981). This staggered disposition of the DNA contacts suggests a staggered disposition of the two subunits, with/3 being about 12 bases "ahead" of/3', i.e. one and a quarter helix turns. In an ancestral homodimer, two parallel subunits could have made equivalent contacts with the phosphate backbone by being 12 base-pairs displaced and rotating one with respect to the other 80-90 ° around the DNA axis. The "twelve base-pair motif" appears also as another important aspect of RNA polymerase function, namely as the most commonly estimated length of the DNA "bubble" within which RNA synthesis occurs (von Hippel et aL, 1984; Hanna & Meares, 1983; Gamper & Hearst, 1982; Reisbig et al., 1979; Siebenlist, 1979). Is it a coincidence that the displacement of the two large subunits along the DNA corresponds to the approximate extension of the RNA-DNA hybrid in the active site of the enzyme? If not coincidental, the twelve base-pair displacement of/3 and /3' might in fact reflect the fundamental symmetry requirements of an ancestral dimer unwinding a twelve base-pair stretch of DNA to allow synthesis of RNA.

A scheme exemplifying in this light the hypothetical structure-function relation- ships of/3 and fl' is presented in Fig. 4. The rectangles represent /3 and /3' 20% displaced with respect to one another (the equivalent of about 12 base pairs, since each subunit can cover about 60 base pairs). Indicated by the triangles are the positions of the putative nucleotide binding sequences: the two on/3 (A) presumably

Page 9: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

S T R U C T U R E A N D E V O L U T I O N . OF R N A P O L Y M E R A S E 309

IN.;: . . cl

' A A ' ' ' '

FIG. 4. Scheme of the hypothesized relationships between /3, /3' and the substrate. The /3 and /3' subunits are represented by rectangles whose lengths were made proportional to the primary sequence lengths. N and C, amino- and carboxy-terminal regions. The location of the putative nucleotide binding sequences is indicated by triangles. (A) catalytic nucleotide binding; (A) effector nucleotide binding. The regions presumably involved in the binding of double-stranded nucleic acid are marked by the double-headed arrows. The region on /3' found homologous to the DNA binding domains of E, coli DNA polymerase I (Allison et aL, 1985) is marked by the bracket. The dots indicate the positions (within ten amino-acid residues) o f cysteines. Nucleic acids ( DNA; . . . . RNA) are drawn so as to emphasize the primary interactions with each subunit. The direction of transcription is from right to left.

contributing to the binding of the catalytic triphosphates; the one of fl' (A) corre- sponding to residues 172-189 in Fig. 3 and marking a possible effector binding site (Kingston et al., 1981, have shown, for instance, that E. coli RNA polymerase binds ppGpp as a modulator of actvity). Upstream of the putative nucleotide binding regions, the double stranded nucleic acid binding domains are also located in corresponding segments of the two subunits,/3' being involved in DNA binding as the available experimental evidence indicates (Allison et al., 1985; Yura & Ishihama, 1979), and /3 being the major contributor to the catalytic site (Yura & Ishihama, 1979) and probably to the RNA-DNA hybrid binding region. The 5' end of the nascent RNA ( . . . . ) has been shown to follow the boundary between/3 and/3' (Hanna & Meares, 1983). However, as the nascent end first leaves the RNA-DNA hybrid, it appears closer to/3 (Hanna & Meares, 1983), reinforcing the notion that /3 is in fact the major contributor to the hybrid binding domain. The DNA helix interacts with both large subunits outside the drawn regions (Chenchick et al., 1981). The cysteine residues of both /3 and /3', also mapped in Fig. 4 (O), cluster in the amino-terminal two thirds of the molecule, leaving a cysteine-free stretch of 400 to 500 amino-acids towards the carboxy terminus of both polypeptides. The sequences of the largest subunits of yeast polymerases II and III (Allison et al., 1985) also show a similar overall arrangement of cysteines (not drawn). No inter-chain disulfide bonds have been observed in E. coli or in eukaryotic RNA potymerases, indicating that most cysteines will be involved in intra-chain structural stabilization. The

Page 10: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

310 D. A R M A L E O

distribution of the known disulfide bridges among a wide variety of proteins (Thornton, 1981) indicates that most disulfide-linked cysteines are within the same domain and separated by less than 30 residues. They are rarely more than 150 residues apart, a fact probably related to the maximum size of individual domains in large proteins (Freedman & Hillson, 1980). Each of the large polymerase subunits might therefore fold as a succession of domains roughly colinear with the polypeptide chain, compacted by disulfide bonds towards the amino-terminus and more extended and flexible in the cysteine-free carboxy-terminal regions.

This pattern can be related to the overall structural and functional features of the polymerase discussed in the preceding section. It is likely that the bulkier area of the molecule (as defined by small-angle X-ray scattering, see Meisenberger et aL, 1980) comprises the amino-terminal halves of the two large subunits and is involved in the central catalytic functions of helix unwinding, rewinding, and of RNA synthesis, which require the largest transfers of free energy. The more slender and flexible carboxy-terminal region (with the additional extension in eukaryotes) would be a tail-like area amenable to regulatory interactions with specific DNA and protein sequences. It may also remain bound to the DNA even if the front end of the enzyme were displaced by the binding of repressor-like molecules, as observed by Schmitz & Galaz (1979). Alternatively, during elongation, the front end may be bound to the template while the tail domains are not, an interpretation of the footprinting data of Carpousis & Gralla (1985). RNA polymerase may thus display some overall flexibility around a central "hinge" region: the whole elongated enzyme can contact the DNA in an open promoter complex, or either the front or the rear domains are displaced from the DNA. A number of regulatory factors, both positive and negative, may mediate such conformational changes. No attempt was made here to define the topography of the or. 2 dimer or of tr in relation to/3,/3' and the DNA, given the limited data available (Chenchick et al., 1981; Meisenberger et al., 1980; Simpson, 1979; Hillel & Wu, 1977).

It is assumed that throughout evolution the basic structural and functional features described above have been largely conserved between prokaryotic and eukaryotic RNA polymerases.

6. Possible Roles of the Various Sets of Eukaryotic Subunits

For eukaryotic enzymes, the evidence discussed in the preceding sections can provide the following functional and structural information on the various sets of subunits.

(1) As described, the central nucleic acid binding and polymerization functions can be assigned largely to the/3/3'a2-type subunit complex. In a eukaryotic enzyme, this complex would include the invariant part of the largest subunit, the second largest subunit, and two copies of a 35-45 kDa subunit.

(2) The highly conserved 10-30 kDa subunits are a dominant characteristic of eukaryotic polymerases, while nucleosomes and other highly conserved chromatin components are a hallmark of the eukaryotic template. The parallel evolution and conservation of two sets of proteins involved in the same metabolic process (i.e.

Page 11: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

S T R U C T U R E A N D E V O L U T I O N O F R N A P O L Y M E R A S E 311

transcription) suggests specific interactions between them. Several of the small 10-30 kDa eukaryotic subunits may thus bind to chromatin components and induce the fundamental alterations necessary for the enzyme to interact directly with DNA. The coexistence of the small subunits with a structurally and functionally autonomous f l f l 'aE-l ike structure further suggests that most are probably located on the outer surface of the polymerase molecule. This suggestion is reinforced by the likely occurrence of uncontrolled dissociation of t0-30 kDa subunits from the enzyme during isolation, leading all the way from variable yields of some of the small subunits to possible cases of severe depletion (see type I polymerases in rat and calf, Fig. l(a)).

(3) The main features distinguishing one polymerase class from another would not be evenly distributed throughout the structure of each enzyme, but would rather depend on individual protein domains evolving either as carboxy-terminal extensions of the largest core subunit or as individual 50-100 kDa subunits (grouped under p in Fig. 2). Due to their presumed roles in defining the individuality of each poly- merase class, the functions of these domains and subunits can be broadly classified as "regulatory". Somewhat paradoxically, the absence of 50-100 kDa subunits in purified type I! polymerases might in fact reflect the very large number of regulatory proteins binding reversibly to polymerase II and modulating transcription of a wide variety of genes. Unlike some of their counterparts stably bound to the less versatile polymerases I and III, most or all polymerase II regulatory subunits would be lost upon enzyme purification due to their reversible binding and low concentration, varying with cell type and metabolic state. Some loss of regulatory components is of course expected also with type I and III enzymes, since no isolated eukaryotic RNA polymerase is able to recognize specific initiation sites in vitro without addition of cellular extracts or purified transcription factors. For instance, the amount of a polypeptide of approximately 65 kDa shows considerable variation among type I polymerases isolated from various sources. It was found in Aspergil lus (Stunnenberg et al., 1979) and not in yeast (Huet et al., 1975), it was present in sub-stoichiometric amounts in Neurospora (Armaleo & Gross, 1985b) and its presence or absence distinguished two populations of polymerase I in both rat and calf (Matsui et al., 1976; Gissinger & Chambon, 1975).

7. The Case of Sulfur-dependent Bacteria

Prokaryotes have been subdivided into Eubacteria, which include most of the classical prokaryotes, and Archaebacteria, which comprise the methanogens (Woese & Gupta, 1981; Woese & Fox, 1977). Recently and with some controversy, sulfur- dependent bacteria living in thermal springs at temperatures up to 90°C and pH as low as 2 have been grouped into a new kingdom, that of Eocyta, distinct from Archaebacteria (Pace et al., 1986; Lake et al., 1984, 1985). These subdivisions stem from the discovery of a number of profound biochemical differences among prokary- otes. Of concern here is a major discrepancy in subunit composition between eubacterial RNA polymerases and those isolated from representatives of several genera of sulfur-dependent bacteria and methanogens (Huet et at., 1983; Zillig et

Page 12: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

3 1 2 D. A R M A L E O

a/., 1979, 1980). Most of these non-eubacterial enzymes possess, besides two large (100-130 kDa) and one to three 35-45 kDa subunits which comprise the presumed flfl'a2-1ike core, also three to six subunits in the 10-30 kDa range (Zillig & Stetter, 1980). In view of the proposal made here that most of the 10-30 kDa subunits in eukaryotes functionally co-evolved with nuclear chromatin, the presence of small subunits in some archaebacterial polymerases may suggest that their templates are extensively complexed with protein; a stabilizer, perhaps, in the face of the extreme habitats colonized by these prokaryotes. In eukaryotes, evolution and differentiation of chromatin and RNA polymerases would have been influenced by more complex variables than the need to stabilize the chromosome against heat or low pH.

It should be noted that these speculations on the evolution of nucleo-protein complexes do not imply a choice, by themselves, between the various proposals on the relationships of the primordial eukaroytic and prokaryotic lineages (Pace et al., 1986; Lake et al., 1985).

8. Concluding Remarks

A model of RNA polymerase structure and evolution is described which suggests (1) the existence of a basic f l f l 'aE-like design common to eukaryotic and prokaryotic RNA polymerases, comprising a bulkier front region mostly involved in catalysis flexibly connected to a more slender tail largely involved in regulation, (2) that within this basic design, two originally similar large subunits (both looking with the amino-terminal region towards the 3' end of the transcribed gene) diverged one towards binding the RNA-DNA hybrid and catalytic nucleotides, and the other towards binding DNA and effector nucleotides, and (3) that the evolution differen- tiating bacterial and eukaryotic enzymes essentially concerned a set of domains and subunits peripheral to the underlying flfl'CtE-like structure. These are (a) a group of largely conserved 10-30 kDa "chromatin-unraveling" subunits on the surface of the molecule, found in all three eukaryotic classes, (b) class-specific carboxy-terminal extensions of the largest subunits (at the tail end of the enzyme), and (c) the variable, class-dependent set of 50-100 kDa subunits with "regulatory" functions. Although the flfl'a2-1ike structure is presumed to have undergone relatively limited change, the real extent of such change can only be revealed by gene-sequencing. In general, more conservation is expected in the front regions involved in catalysis than in the regulatory areas towards the back.

As described in the preceding section, the model suggests that the chromosome of sulfur-dependent bacteria and methanogens is extensively complexed with pro- teins. Another suggestion is that the unicellular algae Dinoflagellates, as their nucleo-protein is organized very differently from that of the other eukaryotes (Rizzo & Burghardt, 1982; Li Jing-Yan, 1984), should have RNA polymerases differing from other eukaryotic enzymes with regard to the 10-30 kDa subunits. No RNA polymerases have yet been purified from these organisms, although an o~-amanitin sensitive RNA polymerase activity has been observed in isolated Dinoflagellate nuclei (Rizzo, 1979).

In conclusion, I think that the message conveyed by the more than 300 subunits in Fig. I and by the observed sequence homologies, however blurred by the large

Page 13: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

S T R U C T U R E A N D E V O L U T I O N OF RNA P O L Y M E R A S E 313

evolutionary distances it bridges, allows to unify a growing number of scattered data into a picture of some coherence and amenable to experimental testing.

I would like to thank Arno Greenleaf, David Price, Stephen Johnston and Mike Ostrowski for useful criticisms and David Pickup for help with the ALIGN program.

Note Added In Proof

Through their recent cloning of the gene for the second largest subunit of yeast RNA polymerase II, Sweetser et al. (1987) arrive at some of the same conclusions presented here on the homologies between eukaryotic and prokaryotic RNA poly- merases. However, the authors' implication that a GTP/GDP-bind ing sequence and a DNA-binding "zinc finger" domain, both located near the C-terminus of the subunit, are involved in catalysis, is not consistent with the known roles of such sequences. They are conserved in a wide variety of proteins and in no case have been involved directly in nucleotide polymerization, but are rather associated with regulatory functions (la Cour et aL, 1985; Berg, 1986; Chowdhury et al., 1987). The latter notion is consistent with the regulatory roles ascribed here to the C-terminal regions of both large RNA polymerase subunits.

REFERENCES

ALLISON, L. A., MOYLE, M., SHALES, M. & INGLES, C. J. (1985). Cell 42, 599. ARMALEO, D. (1984). PhD Thesis, Duke University. ARMALEO, D. & GROSS, S. R. (1985a). J. bioL Chem. 260, 16169. ARMALEO, D. & GROSS, S. R. (1985b). J. biol. Chem. 260, 16174. BAROUDY, B. M. & MOSS, B. (1980). J. biol. Chem. 255, 4372. BERG, M. J. (1986). Science 232, 485. BIGGS, J., SEARLES, L. L. & GREENLEAF, m. L. (1985). Cell 42, 611. BROYLES, S. S. & MOSS, B. (1986). Proc. hath. Acad. Sci. U.S.A. 83, 3141. BUHLER, J-M., HUET, J., DAVIES, K. E., SENTENAC A. & FROMAGEOT, P. (1980). J. biol. Chem. 255,

9949. BUHLER, J-M., IBORRA, F., SENTENAC, A. & FROMAGEOT, P., (1976). 3'. biol. Chem. 251, 1712. CARPOUSlS, A. J. & GRALLA, J. D. (1985). J. tool. Biol. 183, 165. CARROLL, S. B. & STOLLAR, B. D. (1983). J. mol. Biol. 170, 777. CHAMBERLIN, M. J. (1982). In: The Enzymes. (Boyer, P. D. ed.). Vol. XV, pp. 61-86. New York:

Academic Press. CHENCHICK, A., BEABEALASHVlLL1, R. & MIRZABEKOV, A. (1981). FEBS Lett. 128, 46. CHOWDHURY, K., DEUTSCH, U. & GRUSS, P. (1987). Cell 48, 771. CORDEN, J. L., CADENA, D. L., AHEARN, J. M. JR & DAHMUS, M. E. (1985). Proc. hath. Acad. Sct

U.S.A. 82, 7934. DAHMUS, M. E. (1981a)..1. biol. Chem. 256, 3332. DAHMUS, M. E. (1981b). J. biol. Chem. 256, 11239. D'ALESSlO, J. M., PERNA, P. J. & PAULE, M. R. (1979a). J. biol. Chem. 254, 11282. D'ALESSIO, J. M., SPINDLER, S. R. & PAULE, M. R. (1979b). J. biol. Chem. 254, 4085. DAYHOFF, M. O., BARKER, W. C. & HUNT, L. T. (1983). Meth. Enzymol. 91,524. DEZELEE, S., WYERS, F., SENTENAC, A. & FROMAGEOT, P. (1976). Eur. J. Biochem. 65, 543. FREEDMAN, R. B. & HILLSON, D. A. (1980). In: The Enzymotogy o f Post-translational Modification o f

Proteins, Vol. I, (Freedman, R. B. & Hawkins, H. C. eds.) pp. 157-207. London: Academic Press. GAMPER, H. B. & HEARST, J. E. (1982). Cell. 29, 81. GISSINGER, F. & CHAMBON, P. (1975). FEBS Left. 58, 53. GUILFOYLE, T. J. (1980). Biochemistry 19, 5966. GU1LFOYLE, T. J., HAGEN, G. & MALCOM, S. (1984). J. biol. Chem. 259, 649. GUILFOYLE, T. J. & JENDRISAK, J. J. (1978). Biochemistry 17, 1860. HAGER, G. L., HOLLAND, M. J. & RUTTER, W. J. (1977). Biochemistry 16, 1. HANNA, M. M. & MEARES, C. F. (1983). Proc. natn. Acad. Sci. U.S.A. 80, 4238.

Page 14: J. theor. Biol. (1987) 127, 301-314 - Duke Universitypeople.duke.edu/~darmaleo/Structure and Evolution... · FIG. 2. A scheme of structural relationships among RNA polymerases. The

3 1 4 D. A R M A L E O

HILLEL, Z. • WU, C. (1977). Biochemistry 16, 3334. HOCHSCHILD, A., IRWIN, N. & PFASHNE, M. (1983). Cell 32, 319. HODO, H. G. & BLA'r'rl, S. P. (1977). Biochemistry 16, 2334. HORIKOSHI, M., TAMURA, H., SEK1MIZU, K., OBINATA, M. & NATORI, S. (1983). J. Biochem. 94, 1761. HUET, J., BUHLER, J-M., SENTENAC, A. FROMAGEOT, P. (1975). Proc. natn. Acad. Sci. U.S.A. 72, 3034. HUET, J., PHALENTE, L., BU'VI'IN, G., SENTENAC, A. & FROMAGEOT, P. (1982a). E M B O J. i, 1193. HUET, J., SCHNAaEL, R., SENTENAC, A. & ZILLIG, W. (1983). E M B O J . 2, 1291. HUET, J., SENTENAC, A. & FROMAGEOT, P. (1982b). J. bioL Chem. 257, 2613. INGLES, C. J., HtMMELFARB, H. J., SHALES, M., GREENLEAF, A. L. & FRIESEN, J. D. (1984). Proc.

hath. Acad. Sci. U.S.A. 81, 2157. JAEHNING, J. A., WOODS, P. S. & ROEDER, R. G. (1977). J. bioL Chem. 252, 8762. JENDRISAK, J. (1981). Plant PhysioL 67, 438. KEIM, P., HE1NRIKSON, R. L. & FITCH, W. M. (1981). J. tool BioL 151, 179. KINGSTON, R. E., NIERMAN, W. C. & CHAMBERLIN, M. J. (1981). J. biol. Chem. 256, 2787. LA COUR, T. F. M., NYBORG, J., THIRUP, S. & CLARK, B. F. C. (1985). E M B O J. 4, 2385. LAKE, J. A., CLARK, M. W., HENDERSON, E., FAY, S. P., OAKES, M., SCHEINMAN, A., THORNBER,

J. P. & MAH, R. A. (1985). Proc. natn. Acad. Sci. U.S.A. 82, 3716. LAKE, J. A., HENDERSON, E., OAKES, M. & CLARK, M. W. (1984). Proc. natn. Acad. Sci. U.S.A. 81, 3786. LEWIS, M. K. & BURGESS, R. R. (1982). In: The Enzymes, (Boyer, P. D. ed.) Vol. XV. pp. 109-153.

New York: Academic Press. L1 J1NG-YAN (1984). Biosystems 16, 217. MATSUI, T., ONISHI, T. & MARAMATSU, M. (1976). Eur. J. Biochem. 71, 351. MEISENBERGER, O., HEUMANN, H. & PILZ, I. (1980). FEBS Lett. 122, 117. OLLIS, D. L., BRICK, P., HAMLIN, R., XUONG, N. G. & STEITZ, T. A. (1985a). Nature 313, 762. OLLIS, D. L., KLINE, C. & STEITZ, T. A. (1985b) Nature 313, 818. PACE, N. R., OLSEN, G. J. & WOESE, C. R. (1986). Cell45, 325. PANKA, D. & DENNm, D. (1985). J. bioL Chem. 260, 1427. PAULE, M. R. (1981). TIBS 6, 128. REISB1G, R. R., WOODY, A. M. & WOODY, R. W. (1979). J. bioL Chem. 254, 11208. RIzzo, P. J. (1979). J. ProtozooL 26, 290. RIZZO, P. J. & BURGHARDT, R. C. (1982). BioSystems 15, 27. ROEDER, R. G. (1976). In: R N A Polymerase. Losick, R. Chamberlin, M. eds. pp. 285-329. Cold Spring

Harbor. ROSSMANN, M. G. & ARGOS, P. (1981). Ann. Rev. Biochem. 50, 497. SCHMITZ, A. & GALAS, D. J. (1979). Nucleic Acids Res. 6, 111. S1EaENLIST, U. (1979). Nature 279, 651. SIEBENLIST, U., SIMPSON, R. B., & GILBERT, W. (1980). Cell 20, 269. SIMPSON, R. B. (1979). Ceil 18, 277. SKLAR, V. E. F., SCHWARTZ, L. B. & ROEDER, R. G. (1975). Proc. ham. Acad. Sci. U.S.A. 72, 348. SMITH, S. S. & BRAUN, R. (1978). Eur. J. Biochem. 82, 309. SPINDLER, S. R., D'ALESSIO, J. M., DUESTER, G. L. & PAULE, M. R. (1978a). J. biol. Chem. 253, 6242. SPINDLER, S. R., DUESTER, G. L., D'ALESSIO, J. M. & PAULE, M. R. (1978b). J. biol. Chem. 253, 4669. STOECKEL, P., MAY, R., STRELL, I., CEJKA, Z., HOPPE, W., HEUMANN, H., ZILLIG, W. & CRESPI,

H. (1980). Eur. J. Biochem. 112, 411. STUNNENBERG, H. G., WENNEKES, L. M. J. & VAN DEN BROEK, H. W.J. (1979). Eur. J. Biochem.

98, 107. SWEETSER, D., NONET, M. & YOUNG, R. A. (1987). Proc. natn. Acad. Sci. U.S.A. 84, 1192. TEISSERE, M., PENON, P., AZOU, Y. & RICARD, J. (1977). FEBS Lett. 82, 77. THORNTON, J. M. (1981). 3". mot. Biol. 151, 261. VAlSlUS, A. C. & HORGEN, P. A. (1979). Biochemistry 18, 795. VALENZUELA, P., WEINBERG, F., BELL, G. t~ RU'I'I'ER, W. J. (1976). J. biol. Chem. 251, 1464. VON HIPPEL, P. H., BEAR, D. G., MORGAN, W. D. & McSWIGGEN, J. A. (1984). Ann. Rev. Biochem.

53, 389. WEEKS, J. R., COULTER, D. E. & GREENLEAF, A. L. (1982). J. biol. Chem. 257, 5884. WOESE, C. R. ~t. FOX, G. E. (1977). Proc. natn. Acad. Sci. U.S.A. 74, 5088. WOESE, C. R. & GUPTA, R. (1981). Nature 289, 95. YURA, T. & ISHIHAMA, A. (1979). Ann. Rev. Genet. 13, 59. ZILLIG, W. • STETTER, K. O. (1980). In: Genetics and Evolution o f R N A Polymerases t R N A and

Ribosomes. (Osawa, S., Ozeki, H., Uchida, H. & Yura, T., eds). pp. 525-538. Univ. of Tokio Press. ZILLlG, W,, STETTER, K. O. t~ JANEKOVIC, D. (1979). Eur. Z Biochem. 96, 597. ZILLIG, W., STEq-FER, K. O., WUNDERL, S., SCHULZ, W., PRIESS, H. & SCHOLZ, |. (1980). Arch.

MicrobioL 125, 259.