the peculiar evolution of apolipoprotein(a) in human and rhesus

6
Copyright 0 1994 by the Genetics Society of America The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus Macaque Graziano Pesole,* Angela Gerardi,* Fernando di Jesot and Cecilia Saccone* *Dipartimento di Biochimica e Biologia Molecolare, Universita di Bari e CSMME-CNR, Bari, Italy and tDipartimento di Biochimica, Z Cattedra di Chimica Biologica, Universita di Pavia, and Centro di Biochimica Cardiovascolare, Pavia, Italy Manuscript received June 8, 1993 Accepted for publication September 27, 1993 ABSTRACT Apo(a) is a low density lipoprotein homologous to plasminogen and has been shown to be involved in coronary atheroschlerosis. In the present paper we will try to analyze the interesting evolutionary pattern of Apo(a). The plasminogen gene contains 5 cysteine-rich sequences, called kringles, followed by a protease domain. Apo(a), probably arisen by duplication of an ancestral plasminogen gene, contains many tandemly repeated copiesof a sequence domain similar to the fourth kringle of plasminogen, 37 in human and at least 10 in the partially sequenced gene of rhesus, and the protease domain. We have found that the upstream kringles of apo(a) undergoMolecular Drive-like processes that produce high intraspecies similarity, whereas the downstream kringles evolve in a molecular clock-like manner and show an high interspecies sequence similarity. The latter regions are obviously suitable for dating the duplication event by which Apo(a) arose from plasminogen, but only if they evolve at the same rate in the two genes. Thus, we propose a “Molecular Clock Test” for assessing whether the comparison of two paralogous genes (or gene regions) can give reliable information on the dating of their origin by duplication. Applying this test to the kringle-4 domain of apo(a) and plasminogen gene, we demonstrate that the separation between the two genes by duplication dates back at about 90 Mya immediately before the radiation of mammals. LP(A) is an low density lipoprotein (LDL) that carries one copy of a protein called Apo(a)joined to apo B-100 by a disulphide linkage. High levels of Lp(a), ie., above 30 mg/dl, double the risk of coronary atherosclerosis (BERG, DALHENand FRICK 1974; UTERMANN 1989). When LDL and Lp(a) both have a high concentration, the relative risk rises about five- fold (cf. BROWN and GOLDSTEIN 1987). MCLEAN et al. (1987) showed that Apo(a) is a func- tionally altered relative of plasminogen, the precursor of plasmin, which dissolves fibrin clots. Plasminogen, a protein of 791 amino acids, contains 5 cysteine-rich sequences of 80- 114 amino acids each, called kringles, followed by a serine protease domain. Each kringle contains three internal disulphide bridges, producing a structure resembling the Danish cake from which it derives its name. Kringles are also found in other proteases of the coagulation system, including tissue plasminogen activator (TPA) and prothrombin (PAT- THY 1985). In plasminogen, kringles promote the binding to the substrate fibrin (cf. BROWN and GOLD- STEIN 1987). Apo(a) is one of the most polymorphic expressed sequences in the human genome as to date, at least 34 alleles, differing in the number of tandemly re- peated sequences homologous to kringle 4 of plas- monogen, have been characterized (LACKNER et al. 1991; LACKNER, COHEN and HOBBS 1993). Genetics 136: 255-260 (January, 1994) The Apo(a) gene has been completely sequenced in human and partially in rhesus monkey. The human gene contains a hydrophobic signal sequence followed by: 37 copies of asequencedomain similar to the fourth kringle of plasminogen, one copy of a sequence homologous to plasminogen kringle 5, and the pro- tease domain. The rhesus Apo(a) sequence, which is still undetermined at the N-terminus, contains at least 10 copies of kringle 4-like sequence domain and the protease domain (TOMLISON, MCLEAN and LAWN 1989). The protease domain contained in Apo(a) is likely to have lost the proteolytic activity because the argi- nine in the site cleaved by TPA in plasminogen is changed into serine both in human and rhesus ma- caque. In addition, the three amino acids of the cata- lytic thriad are intact in human but two of them have been mutated in rhesus macaque. The evolutionary origin of Apo(a) and its functional role is still obscure. T o shed light on this issue, we report an accurate evolutionary analysis on Apo(a) and plasminogen genes for Human and Rhesus Ma- caque. We have found that the various regions of the two genes have a different evolutionary behavior. This implies that each protein may evolve under dif- ferent evolutionary pressures. In particular, some do- mains undergo Molecular Drive-like processes that homogenize their nucleotide sequence composition,

Upload: lynhu

Post on 08-Feb-2017

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

Copyright 0 1994 by the Genetics Society of America

The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus Macaque

Graziano Pesole,* Angela Gerardi,* Fernando di Jesot and Cecilia Saccone*

*Dipartimento di Biochimica e Biologia Molecolare, Universita di Bari e CSMME-CNR, Bari, Italy and tDipartimento di Biochimica, Z Cattedra di Chimica Biologica, Universita di Pavia, and Centro di Biochimica Cardiovascolare, Pavia, Italy

Manuscript received June 8, 1993 Accepted for publication September 27, 1993

ABSTRACT Apo(a) is a low density lipoprotein homologous to plasminogen and has been shown to be involved

in coronary atheroschlerosis. In the present paper we will try to analyze the interesting evolutionary pattern of Apo(a). The plasminogen gene contains 5 cysteine-rich sequences, called kringles, followed by a protease domain. Apo(a), probably arisen by duplication of an ancestral plasminogen gene, contains many tandemly repeated copies of a sequence domain similar to the fourth kringle of plasminogen, 37 in human and at least 10 in the partially sequenced gene of rhesus, and the protease domain. We have found that the upstream kringles of apo(a) undergo Molecular Drive-like processes that produce high intraspecies similarity, whereas the downstream kringles evolve in a molecular clock-like manner and show an high interspecies sequence similarity. The latter regions are obviously suitable for dating the duplication event by which Apo(a) arose from plasminogen, but only if they evolve at the same rate in the two genes. Thus, we propose a “Molecular Clock Test” for assessing whether the comparison of two paralogous genes (or gene regions) can give reliable information on the dating of their origin by duplication. Applying this test to the kringle-4 domain of apo(a) and plasminogen gene, we demonstrate that the separation between the two genes by duplication dates back at about 90 Mya immediately before the radiation of mammals.

LP(A) is an low density lipoprotein (LDL) that carries one copy of a protein called Apo(a) joined

to apo B-100 by a disulphide linkage. High levels of Lp(a), ie., above 30 mg/dl, double the risk of coronary atherosclerosis (BERG, DALHEN and FRICK 1974; UTERMANN 1989). When LDL and Lp(a) both have a high concentration, the relative risk rises about five- fold (cf. BROWN and GOLDSTEIN 1987).

MCLEAN et al. (1987) showed that Apo(a) is a func- tionally altered relative of plasminogen, the precursor of plasmin, which dissolves fibrin clots. Plasminogen, a protein of 791 amino acids, contains 5 cysteine-rich sequences of 80- 1 14 amino acids each, called kringles, followed by a serine protease domain. Each kringle contains three internal disulphide bridges, producing a structure resembling the Danish cake from which it derives its name. Kringles are also found in other proteases of the coagulation system, including tissue plasminogen activator (TPA) and prothrombin (PAT- THY 1985). In plasminogen, kringles promote the binding to the substrate fibrin (cf. BROWN and GOLD- STEIN 1987).

Apo(a) is one of the most polymorphic expressed sequences in the human genome as to date, at least 34 alleles, differing in the number of tandemly re- peated sequences homologous to kringle 4 of plas- monogen, have been characterized (LACKNER et al. 1991; LACKNER, COHEN and HOBBS 1993).

Genetics 136: 255-260 (January, 1994)

The Apo(a) gene has been completely sequenced in human and partially in rhesus monkey. The human gene contains a hydrophobic signal sequence followed by: 37 copies of a sequence domain similar to the fourth kringle of plasminogen, one copy of a sequence homologous to plasminogen kringle 5 , and the pro- tease domain. The rhesus Apo(a) sequence, which is still undetermined at the N-terminus, contains at least 10 copies of kringle 4-like sequence domain and the protease domain (TOMLISON, MCLEAN and LAWN 1989).

The protease domain contained in Apo(a) is likely to have lost the proteolytic activity because the argi- nine in the site cleaved by TPA in plasminogen is changed into serine both in human and rhesus ma- caque. In addition, the three amino acids of the cata- lytic thriad are intact in human but two of them have been mutated in rhesus macaque.

The evolutionary origin of Apo(a) and its functional role is still obscure. T o shed light on this issue, we report an accurate evolutionary analysis on Apo(a) and plasminogen genes for Human and Rhesus Ma- caque. We have found that the various regions of the two genes have a different evolutionary behavior. This implies that each protein may evolve under dif- ferent evolutionary pressures. In particular, some do- mains undergo Molecular Drive-like processes that homogenize their nucleotide sequence composition,

Page 2: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

256 G. Pesole et al.

and as a consequence these regions appear more sim- ilar within the species than between species. On the other hand, other domains evolve in a molecular clock-like manner. The latter regions are obviously suitable for dating the duplication event but only if they evolve at the same rate.

In this paper we we propose a “Molecular Clock Test” for assessing whether the comparison of two paralogous genes (or gene regions) can give reliable information on the dating of their origin by duplica- tion. Applying this test to the kringle-4 domains of Apo(a) and plasminogen genes, we demonstrate that the separation between the two genes by duplication is more ancient than 40 Mya, during primate evolu- tion, as claimed by MCLEAN et al. (1987); but it dates back at about 90 Mya immediately before the radia- tion of mammals. This implies Apo(a) should be pres- ent not only in primates but also in other mammals.

MATERIALS AND METHODS

Human and Rhesus Macaque Apo(a) and plasminogen genes were extracted from the EMBL database (accession numbers: X06290,J04635, X05199,J04697).

The multiple alignment of nucleotide sequences used in the evolutionary analysis has been constructed following that of protein sequences calculated by using the program PILEUP (GCG Package, 1993). The multiple alignments are available upon E-mail request from the authors at PE- [email protected].

The nucleotide substitution rates have been calculated by using the Stationary Markov Process (SMC) program devel- oped in our laboratory (SACCONE et al. 1990) or the Li method (LI 1993). NEIGHBOR and DRAWGRAM pro- grams of the PHYLIP package were used to draw phyloge- netic trees (FELSENSTEIN 1990) using relative times of diver- gence calculated by the SMC.

RESULTS

Evolutionary analysis of serine protease domain: It has been reported that bovine and human plasmin- ogen genes, in both the protease domain and in the 3“untranslated region, are more divergent than are the human plasminogen and human Apo(a) sequences, thus suggesting that Apo(a) comes from a recent du- plication event (MCLEAN et al. 1987). An alternative explanation, supported by the observation that Apo(a) and plasminogen are located very close to each other on chromosome 6 (FRANK et al. 1988), is that the high sequence similarity between Apo(a) and plasminogen derives from exchange of DNA sequence as a result of Molecular Drive phenomena.

The evolution of the protease domain has already revealed interesting features. OTHA and BASTEN (1 992) have observed gene conversion with selection in proteases, which is made evident by a higher sub- stitution rate of the reactive center with respect to the other regions.

The protease domains of Apo(a) and plasminogen

TABLE 1

Evolutionary rate (subs/site. 10-7 calculated according to the Stationary Markov Process (SMC) and Li methods between

human and rhesus monkey Protease domains of Apo(a) and Plasminogen

Nonsynonymous rate Synonymous rate

SMC Li SMC Li

Apo(a) 1.41 f 0.32 1.58 f 0.28 1 . 2 9 f 0.37 1.86k 0.50 Plasminogen 0.85 f 0.22 0.73 f 0.18 1.38 f 0.41 1.58 f 0.42

genes are paralogous, i .e. , they are the descendants of a duplicated ancestral gene (CREIGTON and DARBY 1989). After gene duplication, they are likely to have evolved under different functional constraints. If nu- cleotide substitutions accumulate at different rates in the two paralogous genes, their comparative analysis cannot provide quantitative distance estimates. An exact estimate of the date of gene duplication can only be made if the two genes show to have evolved at comparable rates.

If the same Clock applies to Apo(a) and plasmino- gen, the same nucleotide substitution rates on synon- ymous and nonsynonymous positions should be ob- served for both genes in human-rhesus macaque comparisons. Thus, we calculated the nucleotide substitution rate of both Apo(a) and plasminogen be- tween human and rhesus.

Table 1 shows the nucleotide substitution rates, calculated with both SMC and Li method on synony- mous and nonsynonymous positions of Apo(a) and plasminogen, between human and rhesus monkey. In Apo(a) the replacement rate is significantly higher than in plasminogen and is comparable to the synon- ymous rate. This can be explained if the protease domain of Apo(a), deprived of its original proteolytic activity, began to accumulate neutral mutations in all codon positions, like a pseudogene. Alternatively, if the protease domain of Apo(a), after duplication, ac- quired a new functional role, the higher substitution rate observed on nonsynonymous positions, could be the result of positive selection (CREIGTON and DARBY 1989). In both cases the Molecular Clock is not obeyed and quantitative estimates of gene distance are unre- liable.

Evolutionary analysis of the 3’-untranslated re- gion: 3”noncoding regions of Apo(a) and plasmino- gen mRNAs, each about 250 nucleotides long, show high sequence similarity, 87% in human and 88% in rhesus monkey.

MCLEAN et al. (1987), assuming that differences accumulate in noncoding regions at a rate of about 0.3% per million years, as in noncoding sequences of the beta-globins of human and chimpanzee, dated the duplication about 40 million years ago, during pri-

Page 3: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

Peculiar Evolution of Apo(a) 257

TABLE 2

Evolutionary rate (subs/site. lo-*) calculated according to the Stationary Markov Process (SMC) between human and rhesus monkey 3’-untranslated region of Apo(a) and Plasminogen

Nucleotide substitution rate

Plasminogen 1.46 zk 0.48 APO(4 3.28 f 0.74

The rates are statistically different as proved by the t-test (t = 20.5, P = 0.00).

mate evolution. If this is the case Apo(a) gene should be restricted to primates only.

We applied the same test shown before, i .e . , the estimate of the nucleotide substitution rate, to the orthologous genes of human and rhesus monkey.

Table 2 shows the nucleotide substitution rate cal- culated between human and rhesus monkey 3’-non- coding regions in both Apo(a) and plasminogen. It is striking to note that the nucleotide substitution rate is more than double in Apo(a) with respect to the plasminogen. This demonstrates that the comparison of the untranslated regions of Apo(a) and plasminogen cannot provide reliable information about sequence divergence, i .e. , they cannot be used as Molecular Clock. In the light of these data, the proposal of MCLEAN et al. (1987) that the gene duplication oc- curred 40 Mya is unreliable.

Evolutionary analysis of the kringle domain: Hu- man Apo(a) gene contains 37 copies of a sequence resembling the plasminogen kringle 4. For the rhesus macaque Apo(a) gene, only partially sequenced, 10 copies of the kringle 4 sequences are available. Indeed, it has been shown that the human alleles can contain a variable number of kringle 4 sequences, in the range 12-5 1 (LACKNER et al. 1991; LACKNER, COHEN and HOBBS 1993). A similar size variability has been de- tected in baboon (HIXSON et al. 1989).

Apo(a) kringles will be in the following denoted k41-k437, according to MCLEAN et al. (1987). All cysteine residues, involved in the three disulphide bonds which generate the typical kringle structure are maintained in both human and rhesus monkey Apo(a). In K436 Apo(a) presents an extra unpaired cysteine, responsible for the disulphide linkage with apo B 100. Human and Rhesus macaque K4ss share a 8 aminoacid deletion.

The 37 kringles of humans can be classified into 11 different types: K42-23 and K425-26 are identical; K424 and K427-29, which differ from the previous kringle repeats by three nucleotides only; the remaining krin- gles, K41 and K430-37, are unique sequences. In rhesus monkey, all 10 kringles, K428-37, are unique se- quences.

The presence of tandemly repeated kringle domains showing even 100% similarity suggest that the phe-

nomenon that produces locus expansion or contrac- tion is currently at work, in Apo(a) gene, and thus individuals of the same species may present size poly- morphic alleles with a variable number of kringle repeats. Indeed, considerable size heterogeneity is shown by human Apo(a), which exists as discrete gly- coprotein isoform variants that range a molecular mass from approximately 400-800 kDa (GAUBATZ et al. 1990; KOSCHINSKY et al. 1990).

Northern blot analysis has demonstrated that tran- script sizes are variable (8-1 2 kb) and in all cases closely correlated with protein masses as determined from immunoblots. Thus, it is very likely that Apo(a) isoform size variation is due to allelic differences in the number of tandemly repeated kringle sequences (KOSCHINSKY et al. 1990). LACKNER, COHEN and HOBBS (1993) estimated that at least 34 different alleles can be detected, and isoforms with as few as 12 and as many as 51 kringle repeats can be found in human plasma. The above authors have proposed that the high degree of length polymorphism is partly due to recombination between sister chromatids.

The SMC method has been applied to all kringle repeats of human and rhesus Apo(a) in order to try the reconstruction of their evolutionary origin. Figure 1 shows the calculated phylogenetic tree obtained by averaging the results on synonymous and nonsynon- ymous codon positions.

It is interesting to note that, starting from K431, each kringle in human is more closely related to the correspondent kringle in rhesus monkey than to the other kringles of the same gene. This means that the multiple duplication of kringle-4-like repeats, at least for the eight downstream kringles, occurred before the human-rhesus split monkey and not as recently as 2-3 Mya as claimed by IKEO, TAKAHASHI and GOJO- BORI (1 99 1). Indeed, if this was the case, considering that the split between human and rhesus monkey is dated at about 25 Mya (SACCONE et al. 1990) or 45 Mya as reported by IKEO, TAKAHASHI and GOJOBORI (1 991), we should observe that all human kringle-4- like repeats clustered together as well as the rhesus monkey counterparts. This happens only for upstream kringles that appear instead more closely related be- tween themselves than to the corresponding regions of the other species.

This supports the hypothesis that K 4 ~ ~ - ~ 7 , the seven downstream kringles of Apo(a), unlike upstream krin- gles were not subject to unequal crossing-over or gene conversion events, two phenomena involved in the Molecular Drive, which generates sequence homoge- neity between DNA tracts in the same species (DOVER 1982).

Looking at the phylogenetic tree shown in Figure 1 , we can also argue that the ancestral Apo(a) gene, before the human-rhesus monkey split, contained at

Page 4: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

258 G. Pede

rl K4-

rl 1 - K4~52

- K ~ H ,

- K4m,

H m r n - R h W U S rn(25MYa)

FIGURE 1.-Phylogenetic tree of the kringle repeats of rhesus monkey and human Apo(a) obtained with the SMC method (SAG CONE et 01. 1990) averaging results of synonymous and nonsynony- mous codon positions. The grey band covers Human-Rhesus Ma- caque split nodes for K431-37. Ancestral kringles-4-like repeats ex- isting before the human-rhesus monkey split are denoted in roman numbers.

least eight kringle-4-like repeats. The higher intra- species similarity of Apo(a) upstream kringles suggests that they were affected by Molecular Drive mecha- nisms, which homogenized sequence divergence. In this context, only last seven kringles, unique se- quences, are potentially useful for drawing quantita- tive evolutionary estimates.

To verify this hypothesis, we have carried out the Molecular Clock test shown before for protease do- main and noncoding regions. We have calculated the synonymous and nonsynonymous rates for the krin- gles of Apo(a) and the K4 kringle of plasminogen between human and rhesus monkey. The data re- ported in Table 3 show that Apo(a) downstream krin-

et al.

TABLE 3

Evolutionary rate (subs/site. 1 0 3 calculated aceording to the Stationary Markov Process (SMC) and Li methods between human and rhesus monkey kringle domain (downstream

kringles 31-37 only) of Apo(a) and Plasminogen

Nonsynonymous rate Synonymous rate

SMC Li SMC Li

Apo(a) 0.80 f 0.27 0.82 f 0.10 1.10 f 0.38 1.40 f 0.22 Plasminogen 0.70 & 0.20 0.68 f 0.08 1.15 f 0.35 1.60 f 0.24

I apo(a)K%,.,, RHESUS

Plasminogen K4 HUMAN

I Plasminogen K4 RHESUS

I I I I

100 50 Time (Mya)

FIGURE 2.-Phylogenetic tree calculated by comparing Apo(a) K431-31 and a seven-fold repeat of kringle 4 of plasminogen between human and rhesus monkey. The tree is calibrated by fixing the split between human and rhesus monkey at 25 Mya (SACCONE ef al. 1990).

1 MOLECULAR CLOCK I

11 12 33 14 15 16 17 Pa- domm

FIGURE 3.-Schematic representation of human and rhesus mon- key Apo(a) genes.

gles have an evolutionary rate comparable to that of kringle 4 of plasminogen, both at synonymous and nonsynonymous positions, in the human-rhesus mon- key comparison. Therefore, this sequence domain in Apo(a) is under the same evolutionary pressure as the homologous domain in plasminogen and thus it can be used as a reliable Molecular clock.

Figure 2 reports the phylogenetic tree calculated by comparing Apo(a) K491-37 and a seven-fold repeat of kringle 4 of plasminogen between human and rhesus monkey. It is striking to note that the diver- gence time between human and rhesus, calculated on K431-37, is the same as that calculated on plasminogen K4. In addition, both nonsynonymous and synony- mous positions give the same tree, thus supporting the

Page 5: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

Peculiar Evolution of Apo(a) 259

reliability of this sequence region for quantitative estimates. Fix the time of divergence between human and rhesus monkey at 25 Mya (SACCONE et al. 1990), then the gene duplication which originated Apo(a) can be dated about 93 Mya, much earlier the 40 Mya estimated by MCLEAN et al. (1987) and in line with the 80 Mya estimate of IKEO, TAKAHASHI and GOJO- BORI (1 99 1). This means that Apo(a) is not restricted to primates but should be also present in all mamma- lian species that are thought to have originated 80- 100 Mya. However, it is likely that not all mammals have a functional Lp(a) particle. Indeed, at least three distinct events must happen after duplication of plas- minogen to create functional Lp(a) particles: (a) de- velopment of the crucial cysteine, which allows the disulphide bridge interaction with the apo-B100 in LDL; (b) large expansion of the kringle 4 copy num- ber; (c) loss of the protease activity (probably by alter- ation of the arginine at the site cleaved in plasminogen when the zymogen is converted to the active plasmin).

DISCUSSION

Two genes are said to be paralogous if they are derived from a duplication event, orthologous if they are derived from a speciation. Apo(a) and Plasmino- gen are thus paralogous genes and the time of gene duplication can be reliably estimated by measuring the nucleotide substitutions between the two genes if both gene evolved at the same rate. This can be verified through a Molecular Clock test that counts the nucleotide substitution rate for both Apo(a) and plasminogen genes or portions of them separately in orthologous comparisons ( i .e . , human vs. rhesus mon- key). This test has been applied to the 3”untranslated region, to the protease domain and to the K4 region of the two genes in the couple human-rhesus monkey. This test has shown that only the K4 region, in partic- ular K431-37, may be suitably used for dating gene duplication. This DNA segment indeed, both in Apo(a) and in plasminogen evolves at the same rate between human and rhesus monkey. Fixing the diver- gence time human/rhesus monkey at 25 Mya (SAC- CONE et al. 1990), the duplication event dates back about 90 Mya, immediately before the radiation of mammals that occurred about 75-80 Mya. The same divergence time was obtained when the K4 of mouse plasminogen was included in the analysis and the very reliable human/rodent divergence time of 75 Mya was used as calibration (DAYHOFF 1972). If the dupli- cation event has been dated correctly, Apo(a) may not be restricted to primates and might be found in other mammals. These conclusion is in accord with the earlier finding of LAPLAUD et al. (1988) and RATH AND PAULING (1990), who detected Apo(a) in the hedgehog and guinea pig, nonprimate mammals.

One of the outstanding result of this work-the

intriguing evolutionary pattern of Apo(a) kringle re- peats-is schematically shown in Figure 3. These re- peats can be clearly classified into two groups, the upstream kringles showing a higher intraspecies simi- larity that denotes the presence of Molecular Drive mechanisms; and the downstream kringles that, on the contrary, show higher interspecies similarity and a clocklike behavior. Obviously, due to the stochastical nature of concerted evolution processes, more se- quence data might confirm more strictly the clear-cut division between the two kringle regions.

The expansion/contraction of Apo(a) gene locus is also cosupported by the data of HELMOLD et al. ( 1 99 l), LACKNER et al. (1991) and LACKNER, COHEN and HOBBS (1 993), who investigated Apo(a) polymorphism in various ethnic groups. They found that the greatest ethnic variation is observed in plasma Lp(a) concen- trations associated with the high molecular weight Apo(a) polymorphs. Our data suggest that contrac- tion/expansion of gene locus occur at level of the K4 region, but only upstream kringles are involved.

The biological significance of polymorphic forms of Apo(a) is still unclear but a highly significant inverse correlation was found between the molecular weight of Apo(a) and the plasma Lp(a) concentration (BERG, DALHEN and FRICK 1974; UTERMANN 1989). This suggests that locus expansion should preserve from the risk of coronary atherosclerosis.

Another interesting outcome of our evolutionary analysis is that the serine protease domain of Apo(a) has should have not conserved its protease activity.

This work was financed by Progetto Finalizzato Biotecnologie e Biostrumentazioni (CNR, Italy), Progetto Finalizzato Ingegneria Genetica (CNR, Italy) and Murst, Italy.

LITERATURE CITED

BERG, K., G. DAHLEN and M. H. FRICK, 1974 Lp(a) lipoprotein and pre-betal-lipoprotein in patients with coronary heart dis- ease. Clin. Genet. 6: 230-235.

BROWN, M. S., and J. L. GOLDSTEIN, 1987 Plasma lipoproteins: teaching old dogmas new tricks. Nature 3 3 0 1 13-1 14.

CREIGTON, T. E., and N. J. DARBY, 1989 Functional evolutionary divergence of proteolytic enzymes and their inhibitors. Trends Bio. Sci. 14 319-324.

DAYHOFF, M. O., 1972 Atlas of Protein Sequence and Structure, Vol. 5 , National Biomedical Research Foundation, Silver Spring, Md.

DOVER, G., 1982 Molecular Drive: a cohesive mode of species evolution. Nature 9 9 9 11 1-1 17.

FELSENSTEIN, J., 1990 PHYLIP manual version 3.3. University Herbarium, University of California, Berkeley.

FRANK, S. L., I. KLISAK, R. S. SPARKES, T. MOHANDAS, J. E. TOMLISON, et al., 1988 The apolipoprotein(a) gene resides on human chromosome 6q26-27 in close proximity to the homol- ogous gene for plasminogen. Hum. Genet. 7 9 352-356.

GAUBATZ, J. W., K. I. GHANEM, J. GUEVARA, JR., M. L. NAVA, W. PATSCH, et al., 1990 Polymorphic forms of human apolipo- protrein(a): inheritance and relationship of their molecular weights to plasma levels of lipoprotein(a). J. Lipid Res. 31: 603- 61 3.

Page 6: The Peculiar Evolution of Apolipoprotein(a) in Human and Rhesus

260 G. Pesole et al.

GENETICS COMPUTER GROUP, 1993 Program Manual for the GCG Package, Version 7.3, April 1991,575 Science Drive, Madison, Wisconsin, 5371 1.

HELMHOLD, M., J. BIGGE, R. MUCHE, J. MAINOO, J. THIERY, et al., 199 1 Contribution of the Apo(a) phenotype to plasma Lp(a) concentrations shows considerable ethnic variation. J. Lipid Res. 32: 1919-1928.

HIXSON, J. E., M. L. BRITTEN, G . SCOTT MANIS and D. L. RAIN- WATER, 1989 Apolipoprotein(a) (Apo(a)) glycoprotein iso- forms result from size differences in Apo(a) mRNA in baboons. J. Biol. Chem. 264: 6013-6016.

IKEO, K., K. TAKAHASHI and T. GOJOBORI, 1991 Evolutionary origin of numerous kringles in human and simian apolipopro- tein(a). FEBS Lett. 287: 146-148

KOSCHINSKY, M. L., U. BEISIEGEL, D. HENNE-BRUNS, D. L. EATON and R. L. LAWN. 1990 Apolipoprotein(a) size heterogeneity is related to variable number of repeat sequences in its mRNA. Biochemistry 2 9 640-644.

LACKNER, C., J. C. COHEN and H. H. HOBBS, 1993 Molecular definition of the extreme size polymorphism in Apolipopro- tein(a). Hum. Mol. Genet. 2: 933-940.

LACKNER, C., E. BOERWINKLE, C. C. LEFFERT, T. RAHMIG and H. H. HOBBS, 1991 Molecular basis of apolipoprotein(a) isoform size heterogeneity as revealed by pulsed-field electrophoresis. J. Clin. Invest. 87: 2153-2161.

LAPLAUD, P. M., L. BEAUBATIE, S. C. RALL, JR., G . Luc and M. SABOUREAU, 1988 Lipoprotein(a) is the major apoB-contain- ing lipoprotein in the plasma of a hibernator, the hedgehog (Erinaceus europaeus). J. Lipid Res. 2 9 1 157-1 170.

LI, W.-H., 1993 Unbiased estimation of the rates of synonymous and non-synonymous substitutions. J. Mol. Evol. 36: 96-99.

MCLEAN, J. W., J. E. TOMLISON, W.-I. KUANG, D. L. EATON, E. Y. CHEN, et al., 1987 cDNA sequence of human apolipopro- tein(a) is homologous to plasminogen. Nature 3 3 0 132-1 37.

OTHA, T., and C. J. BASTEN, 1992 Gene conversion generates hypervariability at the variable regions of kallikreins and their inhibitors. Mol. Phylogenet. Evol. 1: 87-90.

PATTHY, L., 1985 Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modules. Cell 41: 657-663.

RATH, M. and L. PAULING, 1990 Immunological evidence for the accumulation of lipoprotein(a) in the atherosclerotic lesion of the hypoascorbemic guinea pig. Proc. Natl. Acad. Sci. USA 87: 9388-9390.

SACCONE, C., C. LANAVE, G. PESOLE and G. PREPARATA, 1990 The Influence of base composition on quantitative es- timates of gene evolution. Methods Enzymol. 183: 570-583.

TOMLISON, J. E., J. W. MCLEAN and R. M. LAWN, 1989 Rhesus monkey apolipoprotein(a). J. Biol. Chem. 264 5957-5965.

UTERMANN, G . 1989 The mysteries of lipoprotein(a). Science 246: 904-9 10.

Communicating editor: W.-H. LI