genetic linkage of lung cancer-associated mspi

J. Biochem. 110, 407-411 (1991)

Genetic Linkage of Lung Cancer-Associated MspI Polymorphisms with

Amino Acid Replacement in the Heme Binding Region of the Human

Cytochrome P450IA1 Gene1

Shin-ichi Hayashi,* Junko Watanabe,* Kei Nakachi,** and Kaname Kawajiri*"Department of Biochemistry and "Department of Epidemiology, Saitama Cancer Center Research Institute, Ina-machi, Kitaadachi-gun, Saitama 362

Received for publication, April 25, 1991

Individuals with high genetic risk of lung cancer had previously been identified by Mspl

polymorphisms of the cytochrome P450IA1 gene. In the present study we analyzed the

structures of individual P450IA1 genes by PCR direct sequencing of genomic DNA of each

genotype raised by the MspI polymorphisms, which were ascribed to a single point

mutation in the 3•L-flanking region. We then found a novel point mutation in the coding

region of the gene which results in the substitution of Ile for Val at residue 462 in the heme

binding region. We further analyzed the genetic association between this amino acid

replacement and MspI polymorphisms in the general population, using a new method to

detect polymorphisms not recognized by restriction enzymes. The results showed that there

are at least two forms of human P450IA1 protein with different primary structures and that

one of the forms is closely linked with the lung cancer-susceptible genotype of MspI

polymorphisms. Thus MspI polymorphisms, which are associated with increased risk of

lung cancer, are linked to at least one amino acid substitution, which gives an important

clue, at the molecular level, toward elucidation of increased susceptibility to lung cancer.

A large proportion of human cancers are known to be caused

by synthetic or natural chemical compounds in the environ

ment (1, 2). Many chemical carcinogens are metabolically

activated to forms that have deleterious effects on organ-

isms (3), and this metabolic activation is an obligatory

initiation step in human chemical carcinogenesis. The

microsomal electron transport system, including cyto

chrome P450s, plays the most important role in oxidation

of chemical carcinogens (4, 5), and involves a variety of

isozymes of P450. This oxidative activation shows genetic

variation (4, 6), which may be responsible for individual

differences in susceptibility to chemical carcinogenesis.

Cytochrome P450IA1 is important in the initiation of lung

cancer because it is responsible for the activation to

mutagens of benzo [a] pyrene and other aromatic hydrocar

bons in cigarette smoke (7, 8).

We recently found a close correlation between develop

ment of lung cancer and MspI polymorphisms in the

3•L-flanking region of the P450IA1 gene, where three

genotypes were determined the predominant homozygote

(genotype A), the heterozygote (genotype B), and a homo

zygous rare allele (genotype C) (9). Namely, the incidence

of genotype C of the P450IA1 gene was 21.2% in lung cancer

patients, but only 10.6% in healthy controls. This genetic

change was specifically correlated with increased risk of the

Kreyberg I type of lung cancer, which is closely associated

with cigarette smoking, but not with the Kreyberg II type.

Furthermore, our case-control study showed that the

relative risk of genotype C was much higher (7.31-fold)

than those of genotypes A and B at low levels of cigarette

consumption (10). Therefore, it is indispensable to investi

gate whether this MspI polymorphism is genetically as

sociated with differences in protein structure or gene

expression of P4501A1. On the basis of this investigation,

the interindividual difference in the basal or induction

response of P450IA1 can be elucidated.

In the present study we analyzed individuals with

genotypes A and C by PCR direct sequencing. We found that a novel point mutation in genotype C resulted in an

amino acid replacement in the heme binding region of

P4501A1 and also showed that at least two forms of

P4501A1 protein with different primary structures exist

among humans. We further determined the genetic associa

tion of this novel mutation with MspI polymorphisms in the

general population, developing a new detection method to

identify the polymorphisms not recognized by restriction

enzymes. It was found that these two loci, causing the

observed DNA polymorphisms, were very closely linked.

EXPERIMENTAL PROCEDURES

Materials-Restriction endonucleases were purchased

from Takara Shuzo (Kyoto). [ƒ¿-35S]dATP (1,000Ci/

mmol) and [ƒÁ-32P]ATP (5,000Ci/mmol) were obtained

from Amersham (Arlington Heights, IL). All other re-

agents were of the highest quality commercially available.

Isolation and Sequence Analysis of the 3•L-Region of the

P4501A1 Gene-A human gene library of Charon 4A

1 This work was supported in part by Grants-in-Aid for Scientific Research from the Ministry of Education, Science and Culture of Japan, and a research grant from the Ministry of Health and Welfare

of Japan.Abbreviations: P450IA1, previously called P450c; PCR, polymerase chain reaction; RFLPs, restriction fragment length polymorphisms; XRE, xenobiotic responsive element; BTE, basic transcription element.

Vol. 110, No. 3, 1991 407

408 S. Hayashi et al.

bacteriophage was re-screened using an EcoRI-EcoRI

1-kbp fragment of the Tend of the clone AhP450mc-1 as a

probe (11). The clone ƒÉhP450mc-2 which contained the

3•L-flanking region of the gene was isolated, and its EcoRI-

EcoRI 4-kbp fragment was subcloned in pUC18 (named as

phP450mc-2). Unidirectionally deleted subclones gener

ated by the Erase-a-Base system (Promega, Madison, WI)

were used to obtain sequence information on the 3•L-flanking

region. Sequencing was carried out for both strands by the

chain-termination method (12).

Detection of MspI Polymorphisms-Individual DNAs

were isolated from peripheral lymphocytes in blood sam

ples obtained from a cohort of 2,500 Japanese persons all

over 40 years old. The genotypes of the P450IA1 gene

ascribed to the Mspl site were identified as RFLPs by the

polymerase chain reaction (PCR) (13) and were incomplete

agreement with our previous Southern blotting (9). The

PCR-amplified DNA fragments including the polymorphic

site were digested with Mspl and subjected to electropho

resis in 1.8% agarose gel.

PCR Direct Sequencing of Individual Genomic DNAs-

DNA fragments of 0.5 to 1.5kbp were prepared by

PCR-amplification and used as templates for direct se

quencing. Synthesized 21-nucleotide oligomers were end-labeled with [ƒÁ-32P]ATP by kination, and 4pmol of the

labeled primers were used for sequencing reaction together

with 200ng of purified PCR template using a Sequenase

sequencing kit (USB, Cleveland, OH). Oligo-nucleotides

for sequencing primers and PCR primers (named as C1 to

C51) were synthesized on an Applied Biosystems Model

381 DNA sythesizer.

Detection of Polymorphisms Not Recognized by RFLP-

Two oligonucleotides of 20 mer (primer 2a, primer 2b),

both of which contained the polymorphic site at the 3•L end,

were synthesized and each of them was used as a primer for

PCR-amplification together with another strand of 21 mer

primer (primer 1) which is located about 200 by upstream of a polymorphic site detected by sequencing. PCR was

performed by 25 cycles under the following conditions: 1

min at 95•Ž for denaturation and 1 min at 70•Ž for primer

annealing and primer extension. The other conditions were

as described by Saiki et al. (13). The PCR products were

then subjected to electrophoresis in 1.8% agarose gel.

RESULTS AND DISCUSSION

Structural Analysis of the 3•L-Flanking Region of

P450IA1•\Previously we cloned the human P450IA1 gene

and determined the nucleotide sequence (11), although the

cloned gene did not include the 3•L-flanking region where the

polymorphic Mspl site is located. To identify the MspI site

and analyze the 3•L-flanking region of the gene in this study,

we re-screened a human genomic library and obtained clone

ƒÉthP450mc-2 for analysis. A 4-kbp fragment including the

3•L-flanking region was subcloned in a plasmid vector

(phP450mc-2) and then sequenced. Based on the sequence

information, primers were synthesized, and DNAs of types

A and C were subjected to PCR direct sequencing. Figure 1

Fig. 1. Structural analysis of the 3•L-region

of the P450IA1 gene. (A) Nucleotide sequence of

the 3•L-flanking region of the gene. Clone

A hP450-mc2 was isolated, and subclone

phP450mc-2 was sequenced as described under "EXPERIMENTAL PROCEDURES

." PCR direct

sequencing was carried out using the primers

indicated by broken lines, and the polymorphic

site was identified. The nucleotides with numbers

up to 5985 in the margins were transcribed in

mRNA, and untranscribed nucleotides are num

bered from 1. (B), RFLPs of PCR-amplified

fragments by MspI or SmaI. The primers indicat

ed by solid lines in (A) were used for PCR

amplification, and the PCR products from type A,

B, and C DNAs were digested with MspI or SmaI,

and subjected to agarose gel electrophoresis.

Lanes 1, 2, 3, and M are type A (ml/ml), B (ml/

m2), C (m2/m2), and size markers (SV40-

HindIII), respectively. Allele ml and m2 are

defined by the absence and presence, respective-

ly, of the one MspI site detected by RFLPs in our

previous paper (9).

J. Biochem.

Genetic Polymorphisms of Cytochrome P450IA1 409

Fig. 2. Structural analysis of individual P450IA1 genes by PCR-direct sequencing. The DNA fragments amplified by PCR are shown

by horizontal bars. Closed circles and squares show the primers used for the two opposite directions in PCR. These PCR products were sequenced directly by the chain-termination method using synthesized sequencing primers. Coding sequences are indicated by boxes, and leader and trailer sequences of mRNA are indicated by boxes of lower height. The solid arrow shows the site of MspI polymorphisms, and the open arrow shows that of the novel mutation in the coding region. Solid triangles, the open circle, and the open triangle indicate the locations of the XRE, BTE, and TATA box, respectively.

Fig. 3. Novel polymorphic mutation in the coding region. DNAs from individuals were examined by PCR-direct sequencing. A, B, and C show the results for an Ile/Ile homo-zygote, Ile/Val heterozygote,

and Val/Val homozygote, respectively. The point mutation is indicated by an arrow.

Fig. 4. Comparison of the amino acid sequences of the heme binding region in the P450IA1 subfamily. The heme-binding cysteine residue is designated as 0 (22). The amino acid residues at the polymorphic position are boxed. The sequences of all members of the IA1 subfamily identified so far are shown (23).

summarizes the results. Replacement of thymine in genotype A by cytosine in genotype C was observed at the 264th base downstream from the poly A additional signal, form-ing an MspI or Smal site in genotype C. For confirmation of this observation, PCR-amplification was carried out with DNAs of types A, B, and C as templates and the oligonucleotide primers indicated in Fig. IA. The products were then digested with MspI or Smal. The fragment amplified from type A DNA gave only a single undigested band at the

position of 340 bp, the fragment from C type gave bands of digested DNA at 200 and 140 bp, while the fragment from

B type gave three bands at 340, 200, and 140 by (Fig. 1B).

This procedure was used for determining the genotypes of

P450IA1 in a large number of individuals, and allowed

identification of persons at genetically high risk of lung

cancer.

Amino Acid Replacement in the Heme Binding Region of

P450IA1-For determination of whether there was any

other mutation site in the P450IA1 gene linked with MspI

polymorphism, the regulatory regions in the 5•L-flanking and

coding regions of the P450IA1 gene were sequenced direct-

ly using PCR-amplified fragments from the DNAs of types

A and C (Fig. 2). In the 5•L-flanking region including the XRE

(xenobiotics responsible element) (14), BTE (basic tran

scription element) (15), and TATA box, the DNA sequences

of type A and C were identical.

However, we found a difference of one base at position

4889 in the 7th exon. As shown in Fig. 3, adenine in type A

was replaced by guanine in many individuals of type C. This

novel point mutation resulted in replacement of Ile by Val

at residue 462 in the HR2 region (16), which was well

conserved among P450 families. The cysteine in this region

has been shown to be the heme-binding thiolate ligand (17,

22), and thus this region is essential for the catalytic

activity of this enzyme. As this polymorphic position is

conserved as Ile among members of the IM subfamily (Fig.

4), the substitution of Val may affect the catalytic activity

of this enzyme.

Vol. 110, No. 3, 1991

410 S. Hayashi et al.

Fig. 5. Detection of the point mutation of Ile-Val polymorphism. (A) Scheme of PCR for the detection. Two primers (primer 1 and primer 2a, or primer 1 and primer 2b) were used for each DNA sample, and PCR was performed according to the method described in "EXPERIMENTAL PROCEDURES." The point mutation site is indicated by asterisks. (B) Representative results of PCR to identify Ile-

Val polymorphisms. A, Ile/Ile homozygote; B, Ile/Val heterozygote; C, Val/Val

homozygote. Primers 1 and 2a were used for lane 1 and primers 1 and 2b for lane 2; lane

M, size markers (ƒÉ-EcoRI•EHindIII).

TABLE I. Estimated frequencies of combined genotypes. Numbers of subjects examined are shown in parentheses.

'These frequencies were observed in our previous study (10) .

The RFLP method cannot be used to detect this polymor

phism because there is no suitable restriction site. There-fore, for use in screening, we developed the new detection method described in "EXPERIMENTAL PROCEDURES." The primers used in this method are shown in Fig. 5A. Figure 5B shows the clear profiles that identified the two homo-zygotes (Ile/Ile and Val/Val) and the heterozygote (Ile/ Val). The combination of primers 1 and 2a gave a PCR product of 210 by (lane Al) when (Ile/Ile) was used as a template. The combination of primers 1 and 2b gave no PCR product (lane A2): the product appeared only with the combination of primers 1 and 2b with (Val/Val) as tem-plate (lanes C1, C2), and with both combinations with (Ile/ Val) as template (lanes B1, B2). The results by this new method were completely consistent with those obtained by direct sequencing.

Linkage Analysis in the General Population-We found the DNA polymorphisms at two loci of the P450IA1 gene to be due to point mutations. Therefore, we next investigated the genetic association between these two loci in the general population. In our previous study on MspI polymorphisms, we observed frequencies of 0.443, 0.451, and 0.106 for genotypes A (ml/ml), B (ml/m2), and C (m2/m2), respectively, in 375 healthy controls (10). Our next problem was to determine the frequencies of the genotypes (Ile/Ile), (Ile/Val), and (Val/Val) in the populations with genotypes A, B, and C. To do this, we randomly selected 43

and 47 individuals from among those with genotypes A and B, respectively, and all the 45 individuals with genotype C observed so far. The observed frequencies of (Ile/Ile), (Ile/ Val), and (Val/Val) in the groups of genotypes A, B, and C were then weighted by the frequencies of A, B, and C. The estimated frequencies are shown in Table I as a 3 x 3 genotype table.

From the table we could calculate the respective fre

quencies f(ml; Ile), f(m2; Ile), f(ml; Val), and f(m2; Val) of the genes ml with Ile at residue 462, m2 with Ile, ml with Val, and m2 with Val, assuming the Hardy-Weinberg equilibrium for f (ml; Ile) and f (ml; Val) and also for f (m2; Ile) and f (m2; Val) weighted by the observed gene frequencies f(ml)=0.668 and f(m2)=0.332. The gene frequencies were thus estimated as f(ml; Ile)=0.661,

f(m2; Ile)=0.092, f(ml; Val) =0.007, f(m2; Val)=0.240, f (Ile) =0.753, and f (Val) =0.247. The frequencies expect-ed from Table I agreed well with observed values within a range of 0.03.

Linkage disequilibrium D (24) was then calculated to be 0.158, and the linkage coefficient R described by Hill and Robertson (25) to be 0.778, showing that these two loci are closely linked. To understand the genetic association more clearly, we calculated the occurrence of linkage, i.e., p(ml; Ile)=f(ml; Ile)/f (ml)=0.990, p(ml; Val)=0.010, p(m2; Ile)=0.272, p(m2; Val)=0.723, p(Ile; ml)=f(ml; Ile)/ f(Ile)=0.878, p(Ile; m2)=0.122, p(Val; ml)=0.028, and p(Val; m2)=0.972. These results clearly showed 99% linkage of ml with Ile and 97% linkage of Val with m2.

We analyzed the gene structures of the individuals with different susceptibility to lung cancer identified by Mspl polymorphisms of P450IA1. As a result, a novel polymorphic site was then found in the DNA sequences of these individuals which results in an amino acid mutation in the heme binding region. The existence among human of at least two forms of P450IA1 protein in association with MspI polymorphisms was thus revealed. To clarify this

genetic association, we further assessed the frequencies of the genotypes with a combination of these two polymorphic loci among the general population using the new detection method and showed that these two loci were very closely linked.

Recently the reasons for phenotypes of extensive and

poor metabolism of debrisoquine have been explained by DNA polymorphisms of P450IID at plural polymorphic sites (26, 27). However, there is no previous report of

J. Biochem.

Genetic Polymorphisms of Cytochrome P4501A1 411

analysis of the linkage between multiple polymorphic loci of human P450s in the general population. Such analyses are essential for estimating susceptibilities to drugs and chemical carcinogens, because the susceptibility or risk must be evaluated as the ratio of the genotype frequency in the susceptible subpopulation (or patients) to that in the general population (28). It is, of course, important to characterize the effect of this novel mutation and MspI

polymorphisms on phenotypic expression of P450IA1 at the molecular level. We are now investigating these successive questions, which involve biological assays of the activity of the enzyme with the mutation, influence of MspI polymorphisms on expression, association with arylhydrocarbon hydroxylase activity together with epidemiological analysis of cancer risk.

We are grateful to Dr. Yusaku Tagashira for his support during this study, and Nahomi Shinoda for technical assistance.

REFERENCES

1. Nagao, M. & Sugimura, T. (1978) Annu. Rev. Genet. 12,117-1592. Doll, R. & Peto, R. (1981) The Causes of Cancer, pp. 1256-1260,

Oxford University Press, Oxford, New York3. Heidelberger, C. (1975) Annu. Rev. Biochem. 44, 79-1214. Guengerich, F.P. (1988) Cancer Res. 48, 2946-29545. Conney, A.H. (1982) Cancer Res. 42, 4875-49176. Gonzalez, F.J. (1988) Pharmacol. Rev. 40, 243-2887. Aoyama, T., Gonzalez, F., & Gelboin, H.V. (1989) Mol. Carcinog.

1, 253-2598. McManus, M.E., Burgess, W.M., Veronese, M.E., Huggett, A.,

Quattrochi, L.C., & Tukey, R.H. (1990) Cancer Res. 50, 3367-3376

9. Kawajiri, K., Nakachi, K., Imai, K., Yoshii, A., Shinoda, N., & Watanabe, J. (1990) FEBS Lett. 263, 131-133

10. Kawajiri, K., Nakachi, K., Imai, K., Hayashi, S., &Watanabe, J. (1991) in Proceedings of the 21th International Symposium of the Princes Takamatsu Cancer Fund (Ernster, L., ed.) Japan Sci. Soc. Press, Tokyo/Taylor & Francis, London, in press

11. Kawajiri, K., Watanabe, J., Gotoh, 0., Tagashira, Y., Sogawa,

K., & Fujii-Kuriyama, Y. (1986) Eur, J. Biochem. 159, 219-22512. Sanger, F., Nicklen, S., & Coulson, A.R. (1977) Proc. Natl. Acad.

Sci. U.S.A. 74,5463-546713. Saiki, R.K., Gelfand, D.H., Stoffel, S., Scharf, S.J., Higuchi, R.,

Horn, G.T., Mullis, K.B., & Ehrlich, H.A. (1988) Science 239, 487-491

14. Fujisawa-Sehara, A., Sogawa, K., Yamane, M., & Fujii-Kuriyama, Y. (1987) Nucleic Acids Res. 15, 4179-4191

15. Yanagida, A., Sogawa, K., Yasumoto, K., & Fujii-Kuriyama, Y. (1990) Mol. Cell. Biol. 10, 1470-1475

16. Gotoh, 0., Tagashira, Y., Iizuka, T., &Fujii-Kuriyama, Y. (1983) J. Biochem. 93, 807-817

17. Poulos, T.L., Finzel, B.C., Gunsalus, I.C., Wagner, G.C., & Kraut, J. (1986) J. Biol. Chem. 260, 16122-16130

18. Heilmann, L.J., Sheen, Y.Y., Bigelow, S.W., & Nebert, D.W. (1988) DNA 7, 379-387

19. Okino, S.T., Quattrochi, L.C., Barnes, H.J., Osanto, S., Griffin, K.J., Johnson, E.F., & Tukey, R.H. (1985) Proc. Natl. Acad. Sci. U.S.A. 82, 5310-5314

20. Kimura, S., Gonzalez, F., & Nebert, D.W. (1984) J. Biol. Chem. 259,10705-10713

21. Sogawa, K., Gotoh, 0., Kawajiri, K., & Fujii-Kuriyama, Y. (1984) Proc. Natl. Acad. Sci. U.S.A. 81, 5066-5070

22. Kawajiri, K., Gotoh, O., Sogawa, K., Tagashira, Y., Muramatsu, M., & Fujii.Kuriyama,Y. (1984) Proc. Natl. Acad. Sci. U.S.A. 81,1649-1653

23. Nebert, D.W., Nelson, D.R., Adesnik, M., Coon, M.J., Estabrook, R.W., Gonzalez, F.J., Guengerich, F.P., Gunsalus, I.C., Johnson, E.F., Kemper, B., Levin, W., Phillips, I.R., Sato, R., & Waterman, M.R. (1989) DNA 8, 1-13

24. Vogel, F. & Motulsky, A.G. (1986) in Human Genetics, pp. 162-164, Springer-Verlag, Berlin, Heidelberg, New York, London, Paris, Tokyo

25. Hill, W.G. & Robertson, A. (1968) Theor. Appl. Genet. 38,226-231

26. Kagimoto, M., Heim, M., Kagimoto, K., Zeugin, T., & Meyer, U.A. (1990) J. Biol. Chem. 265, 17209-17214

27. Gough, A.C., Miles, J.S., Spurr, N.K., Moss, J.E., Gaedigk, A., Eichelbaum, M., & Wolf, C.R. (1990) Nature 347, 773-776

28. Khoury, M.J., Adams, M.J., Jr., & Flanders, W.D. (1988) Am. J. Hum. Genet. 42, 89-95

Vol. 110, No. 3, 1991

genetic linkage of lung cancer-associated mspi

Documents