the wistar institute anatomy pa - proceedings of the ... · pdf fileproc. natl. acad. sci. usa...

5
Proc. Natl. Acad. Sci. USA Vol. 84, pp. 2824-2828, May 1987 Genetics MYC oncogene involved in a t(8;22) chromosome translocation is not altered in its putative regulatory regions (Burkitt lymphoma/oncogene deregulation/immunoglobulin gene rearrangement/somatic cell hybrids) LOUISE C. SHOWE*, ROBERT C. A. MOORE, JAN ERIKSON, AND CARLO M. CROCE The Wistar Institute of Anatomy and Biology, Philadelphia, PA 19104 Communicated by Peter C. Nowell, December 18, 1986 ABSTRACT We have cloned the translocation-associated MYC gene from the Burkitt lymphoma cell line (BL2) with a t(8;22) chromosomal translocation and have determined the nucleotide sequence of the first exon and of the 3' and 5' flanking regions, where sequences with putative regulatory functions have been identified. The nucleotide sequence of the 5' flanking region, which contains regions of DNase hypersen- sitivity and binding sites for putative regulatory proteins, is the same as that of the normal MYC. Accordingly, mutations in these regulatory regions are not required for the transcrip- tional deregulation of MYC in the BL2 cell line. The nucleotide sequence of the first exon is similar to that of the normal MYC [Gazin, C., Dupont de Direchin, S., Hampe, A., Masson, J. M., Martin, P., Stehelin, D. & Galibert, F. (1984)EMBO J. 3, 383-387] and has the coding capacity for a 188-residue polypeptide. However, six nucleotide changes that occur in the middle of this reading frame could result in amino acid substitutions. We also have cloned and sequenced the t(8;22) chromosomal breakpoint that is located 10 kilobases 3' of the MYC exon 3 and near the CA3 gene on chromosome 22. Sequences with homology to immunoglobulin joining signals occur close to the breakpoint both on chromosome 8 and 22, providing further evidence that the immunoglobulin joining enzymes may be involved in the recombinations associated with a variety of chromosomal translocations in B and T cells. In most Burkitt lymphomas, the MYC gene on the normal chromosome 8 is transcriptionally silent, while the translo- cation-associated gene is transcriptionally active (1, 2). It has been suggested that this phenomenon of allelic exclusion results from the disruption or mutation of negative regulatory elements within the first exon and/or the 5' flanking region of the translocated gene (3). Some of us have suggested (1, 2) that the deregulation of the MYC gene in Burkitt lymphoma results from the juxtaposition of the MYC locus and an activated immunoglobulin locus. BL2 is a Burkitt lymphoma cell line that carries a t(8;22) chromosomal translocation that involves the MYC gene on chromosome 8 and the immunoglobulin A light chain locus on chromosome 22. No gross alterations in the gene are detected by Southern analysis with probes for introns 1, 2, or 3 (4). The translocation occurs 10 kilobases (kb) 3' of MYC exon 3 (5) and outside of any region implicated in transcriptional regu- lation. The normal MYC allele is transcriptionally silent (1). Transcription from the translocation-associated allele results in steady-state levels of MYC transcripts that are 3-4 times higher than levels in lymphoblastoid controls (6). Nuclease S1 cleavage-protection studies of the MYC transcripts in BL2 cells with probes for exon 1 yield protected fragments of the expected size (2), suggesting that the first exon of BL2 is not highly mutated. Because of the apparent normal structure of this gene, we were interested in determining whether minor changes might have occurred that could be involved in this transcriptional deregulation. MATERIALS AND METHODS Nucleotide Sequences. Nucleotide sequences were deter- mined by the dideoxy chain-termination method of Sanger et al. (7) using phage M13 and the Clontech M13 sequencing primer or synthetic primers derived from the nucleotide sequence. Genomic DNA Libraries. The genomic DNA library used to isolate the 3' breakpoint region was prepared from a complete EcoRI digest of high molecular weight BL2 DNA, and appropriate fractions from sucrose gradients were cloned into the bacteriophage A vector EMBL 4 (8). The genomic library for the isolation of the translocation-associated MYC gene was prepared from a somatic cell hybrid between BL2 and the murine fusion partner NP3, which contained the derivative chromosome 8 but not the normal human allele (4). This library was constructed in the bacteriophage A vector EMBL 3a. Genomic DNA libraries were screened as described (8) with the exception that screenings with 32P-end-labeled with synthetic oligonucleotides had final washes with 0.9 M NaCl/0.09 M sodium citrate, pH 7, at 37°C for 1-2 hr. Probes. The 3' human MYC probes CA 1.7S and pPA1.3SB are derived from regions 2 kb and 12 kb 3' of MYC, respectively (6). The CA probe is an 8-kb genomic probe that includes the immunoglobulin A light chain constant (C) region genes CA and CA3 (9). Ryc7.4 is a 1.1-kb cDNA clone that codes for the human MYC exon 3 and part of exon 2 (10). RESULTS Isolation of the Translocation-Associated MYC Gene. While no rearrangements could be detected within the MYC gene with the Ryc7.4 probe on Southern blots, a rearrangement was detected with the 3' pCA1.7S probe (6) and was found to occur - 10 kb 3' of MYC exon 3 and 19 kb 3' of the P1 and P2 promoter regions (5). A rearrangement was also detected with the pPA1.3SB probe derived from a region 3' of the breakpoint. This rearranged fragment segregated with the 22q- chromosome and indicated that no gross deletion occurred as a result of the translocation (data not shown). Since the distance between the 3' MYC rearrangement and putative regulatory sequences of interest 5' of the gene made it unlikely that both the identifying breakpoint junction and the 5' region of interest would be found in a single bacteri- ophage A genomic clone, it was necessary to ensure that the MYC gene being analyzed was the translocation-associated gene and not the normal gene. Accordingly, a genomic DNA library was constructed from a somatic cell hybrid (17-6) Abbreviations: C, constant; J, joining; V, variable. *To whom reprint requests should be addressed. 2824 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Upload: hacong

Post on 31-Jan-2018

223 views

Category:

Documents


8 download

TRANSCRIPT

Proc. Natl. Acad. Sci. USAVol. 84, pp. 2824-2828, May 1987Genetics

MYC oncogene involved in a t(8;22) chromosome translocation is notaltered in its putative regulatory regions

(Burkitt lymphoma/oncogene deregulation/immunoglobulin gene rearrangement/somatic cell hybrids)

LOUISE C. SHOWE*, ROBERT C. A. MOORE, JAN ERIKSON, AND CARLO M. CROCEThe Wistar Institute of Anatomy and Biology, Philadelphia, PA 19104

Communicated by Peter C. Nowell, December 18, 1986

ABSTRACT We have cloned the translocation-associatedMYC gene from the Burkitt lymphoma cell line (BL2) with at(8;22) chromosomal translocation and have determined thenucleotide sequence of the first exon and of the 3' and 5'flanking regions, where sequences with putative regulatoryfunctions have been identified. The nucleotide sequence of the5' flanking region, which contains regions of DNase hypersen-sitivity and binding sites for putative regulatory proteins, is thesame as that of the normal MYC. Accordingly, mutations inthese regulatory regions are not required for the transcrip-tional deregulation ofMYC in the BL2 cell line. The nucleotidesequence of the first exon is similar to that of the normal MYC[Gazin, C., Dupont de Direchin, S., Hampe, A., Masson,J. M., Martin, P., Stehelin, D. & Galibert, F. (1984)EMBO J.3, 383-387] and has the coding capacity for a 188-residuepolypeptide. However, six nucleotide changes that occur in themiddle of this reading frame could result in amino acidsubstitutions. We also have cloned and sequenced the t(8;22)chromosomal breakpoint that is located 10 kilobases 3' of theMYC exon 3 and near the CA3 gene on chromosome 22.Sequences with homology to immunoglobulin joining signalsoccur close to the breakpoint both on chromosome 8 and 22,providing further evidence that the immunoglobulin joiningenzymes may be involved in the recombinations associated witha variety of chromosomal translocations in B and T cells.

In most Burkitt lymphomas, the MYC gene on the normalchromosome 8 is transcriptionally silent, while the translo-cation-associated gene is transcriptionally active (1, 2). It hasbeen suggested that this phenomenon of allelic exclusionresults from the disruption or mutation of negative regulatoryelements within the first exon and/or the 5' flanking region ofthe translocated gene (3). Some of us have suggested (1, 2)that the deregulation of the MYC gene in Burkitt lymphomaresults from the juxtaposition of the MYC locus and anactivated immunoglobulin locus.BL2 is a Burkitt lymphoma cell line that carries a t(8;22)

chromosomal translocation that involves the MYC gene onchromosome 8 and the immunoglobulin A light chain locus onchromosome 22. No gross alterations in the gene are detectedby Southern analysis with probes for introns 1, 2, or 3 (4). Thetranslocation occurs 10 kilobases (kb) 3' of MYC exon 3 (5)and outside of any region implicated in transcriptional regu-lation. The normal MYC allele is transcriptionally silent (1).Transcription from the translocation-associated allele resultsin steady-state levels ofMYC transcripts that are 3-4 timeshigher than levels in lymphoblastoid controls (6). NucleaseS1 cleavage-protection studies of the MYC transcripts in BL2cells with probes for exon 1 yield protected fragments of theexpected size (2), suggesting that the first exon of BL2 is nothighly mutated. Because of the apparent normal structure of

this gene, we were interested in determining whether minorchanges might have occurred that could be involved in thistranscriptional deregulation.

MATERIALS AND METHODSNucleotide Sequences. Nucleotide sequences were deter-

mined by the dideoxy chain-termination method of Sanger etal. (7) using phage M13 and the Clontech M13 sequencingprimer or synthetic primers derived from the nucleotidesequence.Genomic DNA Libraries. The genomic DNA library used to

isolate the 3' breakpoint region was prepared from a completeEcoRI digest of high molecular weight BL2 DNA, andappropriate fractions from sucrose gradients were cloned intothe bacteriophage A vector EMBL 4 (8). The genomic libraryfor the isolation of the translocation-associated MYC genewas prepared from a somatic cell hybrid between BL2 and themurine fusion partner NP3, which contained the derivativechromosome 8 but not the normal human allele (4). Thislibrary was constructed in the bacteriophage A vector EMBL3a.Genomic DNA libraries were screened as described (8)

with the exception that screenings with 32P-end-labeled withsynthetic oligonucleotides had final washes with 0.9 MNaCl/0.09 M sodium citrate, pH 7, at 37°C for 1-2 hr.

Probes. The 3' humanMYC probes CA 1.7S and pPA1.3SBare derived from regions 2 kb and 12 kb 3' of MYC,respectively (6). The CA probe is an 8-kb genomic probe thatincludes the immunoglobulin A light chain constant (C) regiongenes CA and CA3 (9). Ryc7.4 is a 1.1-kb cDNA clone thatcodes for the human MYC exon 3 and part of exon 2 (10).

RESULTSIsolation of the Translocation-Associated MYC Gene. While

no rearrangements could be detected within the MYC genewith the Ryc7.4 probe on Southern blots, a rearrangementwas detected with the 3' pCA1.7S probe (6) and was found tooccur - 10 kb 3' ofMYC exon 3 and 19 kb 3' of the P1 and P2promoter regions (5). A rearrangement was also detectedwith the pPA1.3SB probe derived from a region 3' of thebreakpoint. This rearranged fragment segregated with the22q- chromosome and indicated that no gross deletionoccurred as a result of the translocation (data not shown).Since the distance between the 3' MYC rearrangement andputative regulatory sequences of interest 5' of the gene madeit unlikely that both the identifying breakpoint junction andthe 5' region of interest would be found in a single bacteri-ophage A genomic clone, it was necessary to ensure that theMYC gene being analyzed was the translocation-associatedgene and not the normal gene. Accordingly, a genomic DNAlibrary was constructed from a somatic cell hybrid (17-6)

Abbreviations: C, constant; J, joining; V, variable.*To whom reprint requests should be addressed.

2824

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Proc. Natl. Acad. Sci. USA 84 (1987) 2825

derived from the fusion of the BL2 cell line and the murineplasmacytoma cell line NP3 (4). The 17-6 hybrid has beencharacterized cytogenetically and by Southern blotting andcontains the derivative but not the normal human chromo-some 8 (4).High molecular weight DNA was first digested to comple-

tion with the restriction enzyme BamHI. This digests themurine c-myc gene to a size that is too small (5.6 kb) to beefficiently cloned into the EMBL 3a phage vector, whereasthe BamHI restriction fragment derived from human MYCgene is too large (28 kb) to be cloned in this vector. This largeBamHI restriction fragment was subsequently subjected to alimited digestion with the restriction enzyme Sau3A to trimthe fragment to a clonable size. Six MYC-positive cloneswere isolated from 100,000 recombinants, and one of these,BL2R, which extended sufficiently far 5' to encompassputative regulatory regions (3, 11, 12), was subcloned fornucleotide sequencing (Fig. 1). The nucleotide sequence (Fig.2) was determined for the entire first exon and part of intron1 as well as 1.65 kb 5' of exon 1.

Nucleotide Sequence of the 5' Flanking Region. The nucle-otide sequence for the 1.65 kb of DNA 5' of exon 1 iscompared in Fig. 2 to that of a normal MYC gene (13, 14).Three nucleotide differences occur between nucleotides870-874, and additional changes occur at 1965 and 2134.However, these changes are in agreement with the nucleotidesequence for 870-874 as determined by Battey et al. (15) andfor 1965 and 2134 as determined by Bentley and Groudine(14). Therefore, these differences most likely representsequencing differences or polymorphic changes rather thanmutations. This 1.65-kb region ofDNA contains three of thefive major DNase 1-hypersensitive sites that have beenidentified in the MYC locus and two sequences that arehomologous to nuclear factor 1 protein binding sites,-all ofwhich have been suggested to regulate MYC gene expression(3). The sequence through the MYC major promoters P1 andP2 (14) as well as the recently described minor promoterregion P0 (14) is also normal. Thus, the exclusive expressionof the translocation-associated MYC gene in BL2 cells is notdue to the alteration (as a result of somatic mutation) oftranscriptional regulatory sequences that are located withinthis region and that might regulate transcription from eitherP0, P1, or P2.

Nucleotide Sequence of MYC Exon 1. The nucleotide se-quence of the first exon (Fig. 2) extends from nucleotides2325 to 2880 as numbered by Gazin et al. (13). The regionwithin exon 1 to which DNase hypersensitive site II12 hasbeen mapped is identical to the reference sequence. The extranucleotidfes in the first exon sequence described by Watt etal. (10, 16) that introduce stop codons in all three readingframes are not present in the BL2 MYC sequence, and thecoding capacity for a 188-amino acid peptide, which has beensuggested by Gazin et al. (13), is intact in this MYC gene. Sixnucleotide changes, which cluster in a 166-base-pair (bp)region within the middle of exon 1, would each result in anamino acid substitution. A 32-kDa polypeptide has now beenidentified in cell extracts by using antisera prepared againstseveral synthetic peptides deduced by the translation of this

5-

A

B

0

H P

1000 2000cwvvu Xh Pv u'K P PV pi I p2 S

648

*4 * 4 * 4

500 bp

open reading frame (17), although it is not yet known whetherthe 188-amino acid polypeptide has any biological function.The phenomenon of MYC truncation as a result of trans-

location and the observed accumulation of somatic mutationsthat frequently surround the first exon of translocated geneshas focused on possible implications of these alterations fortranscriptional regulation. The 3' end of exon 1 also has beenimplicated in the regulation of MYC gene expression; inparticular, the region around the Pvu II site (Fig. 2) seems tobe altered in many samples derived from the endemic AfricanBurkitt lymphomas (18, 19). There are no alterations in the200 nucleotides to either side of this Pvu II site in the BL2MYC gene. The region 200 bp from the Pvu II site into the 5'end of intron 1 (in particular the thymidine runs T7, T4, T7)has been suggested to regulate MYC RNA at the level oftranscription (11, 18). There are five single nucleotidechanges within the subsequent 370 bp of intron 1, some ofwhich may represent polymorphic changes or sequencingdifferences. This is also the region where the breakpoints forseveral sporadic Burkitt lymphomas occur. No regulatoryfunction has been demonstrated for this region specifically,although there are three putative binding sites (CCCGCC) forthe Spl transcription factor that are unaffected by the fivenucleotide changes. Thus, mutation of the Pvu II region,which has been implicated by others in the deregulation ofMYC in the endemic Burkitt lymphomas, does not appear tobe required for the deregulation seen in BL2.The BL2 Breakpoint. The 18-kb normal and the 12-kb

rearranged EcoRI restriction fragments from the 3' flankingregion of MYC were isolated from the BL2 genomic libraryin EMBL 4. The restriction maps are shown in Fig. 3 A andC. The maps diverge in the region of a Kpn I site located 3 kbfrom the 3' end of the rearranged EcoRI fragment. Asexpected, this region cross-hybridized with the 8.0-kb C,genomic probe (Fig. 3D). The restriction map for the 3' endof the 8.0-kb CA EcoRI fragment (Fig. 3D) is identical to thatfor the 3' end of the BL2 breakpoint clone (Fig. 3C). The twomaps diverge within the region of a previously mapped X lightchain joining (J) region gene JA (6, 9).

Recognition Signals Associated with the Breakpoint. Thefrequent association of MYC translocations with regionsinvolved in immunoglobulin gene rearrangements [J or vari-able (V) regions] or immunoglobulin class-switching [heavychain switch regions] suggests that mechanisms involved inimmunoglobulin gene rearrangements may also take part inthe recombinatorial events associated with chromosomaltranslocations in B- and T-cell neoplasias. Evidence tosupport this hypothesis has come from the identification ofnucleotide sequences with homology to the heptamer-nonamer signal sequences, which are utilized in immuno-globulin gene rearrangements, close to chromosomal break-points within the nonimmunoglobulin loci involved in chro-mosomal translocations in B- and T-cell neoplasias (20-22).The nucleotide sequence through the breakpoint regions on

chromosomes 8 and 22 and the sequence through the sameregion on the normal chromosome 8 are shown in Fig. 4. Thebreakpoint sequence diverges from the normal chromosome

FIG. 1. Sequencing strategy forMYC exon3510 31l and flanking regions. (A) The HindIII-Xba IXb restriction fragment (H-Xb) from the translo-

I cation-associated gene was cloned into M133507 mpl8 and -19. The black bar denotes exon 1.

H,HindIII;P,PstI;Pv,PvuII; K, KpnI; S,Sau3A; Xb, Xba I; pl and p2, promoters. (B)The nucleotide sequence was determined ei-ther by primer extension from synthetic oli-gonucleotides on the cloned H-Xb fragmentor from subcloned restriction fragments ofH-Xb.

I

Genetics: Showe et al.

_ 3000

2826 Genetics: Showe et al. Proc. Natl. Acad. Sci. USA 84 (1987)

myc 648 CCCCTTTCCCCCGAATTGTTTTCTCTTTTGGAGGTGGTGGAGGGAGAGMMAGTTTACTTMMTGCCTTTGGGTGAGGGACCAAGGATGAGMGMTGTTTTTTGTTTTb12 1

myc 758 TCATGCCGTGGMTAACACMAATAMMATCCCGAGGGAATATACATTATATATTMMTATAGATCATTTCAGGGAGCMACAAATCATGTGTGGGGCTGGGCAACTAGb12 111

myc 868 CTGATG-CGAAGCGTMATAMATGTGAATACACGTTTGCGGGTTACATACAGTGCACTTTCACTAGTATTCAGA TTGTGAGTCAGTGMCTAGGMMTTMTGCCb12 221 X A T

myc 978 TGGAAGGCAGCCAAATTTTAATTAGCTCMGACTCCCCCCCCCCCCCAMAAAAGGCACGGMGTMTACTCCTCTCCTCTTCTTTGATCAGMTCGATGCATTTTTTGTb12 331 X

myc 1088 GCATGACCGCATTTCCAATMTAAAAGGGGAMGAGGACCTGGMAGGMTTMACGTCCGGTTTGTCCGGGGAGGAMGAGTTAACGGTTTTTTTCACMGGGTCTCTGbl2 440

myc 1198 CTGACTCCCCCGGCTCGGTCCACAAGCTCTCCACTTGCCCCTTTTAGGMGTCCGGTCCCGCGGTTCGGGTACCCCCTGCCCCTCCCATATTCTCCCGTCTAGCACCTTTb12 550

myc 1308 GATTTCTCCCAMCCCGGCAGCCCGAGACTGTTGCMMCCGGCGCCACAGGGCGCMMGGGGATTTGTCTCTTCTGMACCTGGCTGAGAMTTGGGMCTCCGTGTGGGb12 660

myc 1418 AGGCGTGGGGGTGGGACGGTGGGGTACAGACTGGCAGAGAGCAGGCMCCTCCCTCTCGCCCTAGCCCAGCTCTGGMCAGGCAGACACATCTCAGGGCTMMCAGACGCb12 770

myc 1528 CTCCCGCACGGGGCCCCACGGMGCCTGAGCAGGCGGGGCAGGAGGGGCGGTATCTGCTGCTTTGGCAGCMATTGGGGGACTCAGTCTGGGTGGMGGTATCCMTCCAb12 880

myc 1638 GATAGCTGTGCATACATAATGCATAATACATGACTCCCCCCAACMMTGCMTGGGAGTTTATTCATAACGCGCTCTCCAAGTATACGTGGCMTGCGTTGCTGGGTTATb12 990

myc 1748 TTTMTCATTCTAGGCATCGTTTTCCTCCTTATGCCTCTATCATTCCTCCCTATCTACACTMCATCCCACGCTCTGMCGCGCGCCCATTAATACCCTTCTTTCCTCCAb12 1100

myc 1858 CTCTCCCTGGGACTCTTGATCMAGCGCGGCCCTTTCCCCAGCCTTAGCGAGGCGCCCTGCAGCCTGGTACGCGCGTGGCGTGGCGGTGGGCGCGCAGTGCGTTCTCTGTb12 1210 G

myc 1968 GTGGAGGGCAGCTGTTCCGCCTGCGATGATTTATACTCACAGGACAAGGATGCGGTTTGTCAAACAGTACTGCTACGGAGGAGCAGCAGAGAAAGGGAGAGGGTTTGAGAbi2 1320

* ***** . 1111Imyc 2078 GGGAGCAAMGMMATGGTAGGCGCGCGTAGTTAATTCATGCGGCTCTCTTACTCTCTTTACATCCTAGAGCTAGAGTGCTCGGCTGCCCGGCTGAGTCTCCTCCCCACCb12 1430 G

myc 2188 TTCCCCACCCTCCCCACCCTCCCCATAAGCGCCCCTCCCGGGTTCCCAAAGCAGAGGGCGTGGGGGAMAGAAAAAAGATCCTCTCTCGCTMTCTCCGCCCACCGGCCCb12 1540

P1*-- >.

m 2298 TTTATA ATG CGA GGG TCT GGA CGG CTG AGG ACC CCC GAG CTG TGC TGC TCG CGG CCG CCA CCG CCG GGC CCC GGC CGT CCC TGG CTC CCCMet Arg Gly Ser Gly Arg Leu Arg Thr Pro Glu Leu Cys Cys Ser Arg Pro Pro Pro Pro Gly Pro Gly Arg Pro Trp Leu Pro

b 1650

m 2388 TCC TGC CTC GAG AAG GGC AGG GCT TCT CAG AGG C T GGC GGG AAA MG MC GGA GGG AGG GAT CGC GCT GAG TAT AM AGC CGG TTT TCGSer Cys Leu Glu Lys Gly Arg Ala Ser Gln Arg Leu Gly Gly Lys Lys Asn Gly Gly Arg Asp Arg Ala Glu Tyr Lys Ser Arg Phe Ser

b 1740P2-.-

m 2478 GGG CTT TAT CTA ACT CGC TGT AGT AAT TCC AGC GAG AGG CAG AGG GAG CGA GCG GGC GGC CGG CTA GGG TGG AAG AGC CGG GCG AGC AGAGly Leu Tyr Leu Thr Arg Cys Ser Asn Ser Ser Glu Arg Gln Arg Glu Arg Ala Gly Gly Arg Leu Gly Trp Lys Ser Arg Ala Ser Arg

b 1830

m 2568 GCT GCG CTG CGG GCG TCC TGG GAA GGG AGA TCC GGA GCG MT AGG GGG CTT CGC CTC TGG CCC AGC CCT CCC GCT GAT CCC CCA GCC AGCAla Ala Leu Arg Ala Ser Trp Glu Gly Arg Ser Gly Ala Asn Arg Gly Leu Arg Leu Trp Pro Ser Pro Pro Ala Asp Pro Pro Ala Ser

b 1920 T ATrp Thr

m 2658 GGT CCG CM CCC TTG CCG CAT CCA CGA AAC TTT GCC CAT AGC AGC GGG CGG GCA CTT TGC ACT GGA ACT TAC MC ACC CGA GCA AGG ACGGly Pro Gln Pro Leu Pro His Pro Arg Asn Phe Ala His Ser Ser Gly Arg Ala Leu Cys Thr Gly Thr Tyr Asn Thr Arg Ala Arg Thrb 2010 T T A A

Ser Val Thr Lys

m 2748 CGA CTC TCC CGA CGC GGG GAG GCT ATT CTG CCC ATT TGG GGA CAC TTC CCC GCC GCT GCC AGG ACC CGC TTC TCT GM AGG CTC TCC TTGArg Leu Ser Arg Arg Gly Glu Ala Ile Leu Pro Ile Trp Gly His Phe Pro Ala Ala Ala Arg Thr Arg Phe Ser Glu Arg Leu Ser Leub 2100PvuII

m 2838 CAG CTG CTT AGA CGC TGG ATT TTT TTC GGG TAG TGGAMACCAGGTAAGCACCGAAGTCCACTTGCCTTTTMTTTATTTTTTTATCACTTTAATGCTGAGln Leu Leu Arg Arg Trp Ile Phe Phe Gly ---

b 2190

myc 2938 GATGAGTCGMTGCCTAMTAGGGTGTCTTTTCTCCCATTCCTGCGCTATTGACACTTTTCTCAGAGTAGTTATGGTMCTGGGGCTGGGGTGGGGGGTMTCCAGMCTb12 2290 T

myc 3048 GGATCGGGGTMMGTGACTTGTCMGATGGGAGAGGAGMGGCAGAGGGMMCGGGMTGGTTTTTMGACTACCCTTTCGAGATTTCTGCCTTATGMTATATTCACGb12 2400 C

myc 3158 CTGACTCCCGGCCGGTCGGACATTCCTGCTTTATTGTGTTAATTGCTCTCTGGGTTTTGGGGGGCTGGGGGTTGCTTTGCGGTGGGCAGAMGCCCCTTGCATCCTGAGCbl2 2510 C

myc 3268 TCCTTGGAGTAGGGACCGCATATCGCCTGTGTGAGCCAGATCGCTCCGCAGCCGCTGACTTGTCCCCGTCTCCGGGAGGGCATTTMMTTTCGGCTCACCGCATTTCTGAb12 2620

myc 3378 CAGCCGGAGACGGACACTGCGGCGCGTCCCGCCCGCCTGTCCCCGCGGCGATTCCMCCCGCCCTGATCCTTTTMGMGTTGGCATTTGGCTTTTTMMMGCMTAATb12 2730 G A

myc 3488 ACAATTTAAAACCTGGGTCTCTAGAGGTGTTAGGACGTGGTGTTGGGTAGGCGb12 2840

FIG. 2. Nucleotide sequence of MYC exon 1 and flanking regions. The top line is the sequence of Gazin et al. (14) and begins 648 bp fromthe 5' end of that sequence. The lower line is the BL2 sequence and only regions of sequence differences are indicated. DNase I-hypersensitivesites are indicated by Roman numerals, and putative nuclear factor I binding sites are boxed. The exon 1 reading frame is that proposed by Gazinet al. (14), and changes in the proposed BL2 sequence are indicated. The Pvu II site is indicated at the end of exon 1.

8 sequence around position 448. The boxed sequence 30 bp of seven match with the consensus sequence for a heptamer3' of the breakpoint on the normal chromosome 8 has a five recombination signal. A heptamer with a six of seven match

Proc. Natl. Acad. Sci. USA 84 (1987) 2827

5' SBg BH E 3

Bg q~g iEI SI I K|Bg B lSI BCh. 8

1 2 3 probe . probe

pCA1.7S J . pPA1.3SB

BCh.8K " * - nBB Ch.~

C K H B B Bg B ECI I I Ch.22

-40.O. *-4-

Ch. 22

D ,g B BI *

B Bg B EI

3.3 kbLWI

FIG. 3. Restriction map of the BL2 breakpoint.(A) Restriction map of the normal MYC gene and 3'flanking region. The position of 3' MYC probes is

0.3 kb indicated; the breakpoint occurs within the insertshown in B. (B) Expanded restriction map of theBL2 breakpoint region on the normal chromosome8. (C) Restriction map of the BL2 breakpoint clone.(D) Restriction map of the normal chromosome 22region involved in the translocation. B, BamHI; Bg,Bgi II; E, EcoRI; K, Kpn I; H, HindIII; S, Sst I.Arrows indicate regions sequenced; asterisks indi-cate breakpoint positions; and the solid boxes areexons 1, 2, and 3.

occurs 12 bp 3' of the breakpoint within the sequence derived twice is present 100 bp from the breakpoint on chromosomefrom chromosome 22. In addition, we find the GAGG or 8. The significance of this repeat is not known.CCTC tetranucleotide occurring several times 5' of the The matched sequence 161 bp 3' of the breakpoint isbreakpoint on chromosome 8 and 3' of the breakpoint on identical to the published sequences for a JA3 gene (25, 26).chromosome 22. This tetranucleotide is frequently found in However, the sequence 5' of the J gene is not consistent withclose association with the chromosomal breakpoints that the JA 5' flanking region (data not shown) and lacks both thehave been characterized in mouse plasmacytomas (23) and heptamer and nonamer signal sequences, which should beoccurs with a significant frequency in both the MYC locus present 5' of an unrearranged JA gene, suggesting a V-Jand the murine and human immunoglobulin loci (24). A rearrangement has occurred. An open reading frame (Fig. 4)nine-nucleotide sequence, GACCTCAGG, that is repeated that extends from the breakpoint and continues in frame

A

ch .8ch .8 GGTACCTAGTCAGACCCAATTACACGGTTGAATAT TGCAGTCGGTGTGCTCAGGTGCTGAGATCCATGGCCAGAAAGAGTCTGGTGATAGATTTATGCCCGGGTAGT

jct. GGTACCTAGTCAGACCC CTTCACGTCGTTTTGAGCGTGTGCTCAGGTGCTGAGATCCATGGCCAGAAAGAGTCTGGTGATAGATTTATGCCCC GGGTAGT

cht.8 l TTGTACCTGCCTGGTTGCTTGAGGGTTAAAATGGGTTAGMTATAGGATGATGACTCAGAGCTGGAGACGCCTTAGCATTCGCTTTCCAGATG TGGCCCTGGAGMGA

ct . 5l5l TTGTACCTGCCTGGTTGCTTGACMGTAGTGGGTGTAAAATATAGGATGATGACTCAGAGCTGGAAGACCTATAGCATTCGCATTTCCAGATGT GCCCCTGGAGGA

....

ch.8 221 CGCATCTTCCTGAGATCCCTCTACTGGCTTTTGGCTGAATTAGCTTATGACCTCAGGGCCCTGGTTGCATCCTACTGATTATC CGCAAGTAAGACCTCAGGTCTTCT

jct. 221 CGCAATCTTCCTGAGATCCCTCTACTGGCTTTTGGCTGMTTAGCTTATGACCTCAGGGCCCTGGTTGCATCCTMCTGATTATCMGCMGTMGACCTCAGGTCTTCT

ch .8 331 GAGATGCAGGATGCTTACTTGCCTTACTACTMAGATGMAGCTTTCAMACTCACCCCGTTTCCMAGTCCATGCCAGGTCTTATGCCCACACCCAGACTGGCCCAGTCMACA...22..CC T.

jct. 331 GAGATGCAGGATGCTTACTTGCCTTACTACTMAGATGMAGCTTTCMAACTCACCCCGTTTCCMTCGCATTAGCAACAATGCATMA

ch.8 441 AATATTTATCGGGCACCTCCGATGACAAAGGAGGCACCCGTGTGGMGGMCAGATTCC CAGTCTAGTAGAGAAGAGTGGCATGATACMGTATTTACATACATMTT

jct. 441 GATGATCCTGMGATGCTTAACAGTGATGGGCTTCCCAGCTTCTGGGGGAGCTGAGCGCTACCTCACCATCTCCA

ch.8 551 ACCTMMACAGATGCMMAGGCACMMAGTAGTGGGGTGAGGMAGTACAGAGTATGTGGGGGCATCTGACAGCCTAGGAGTTCCAGGMAGTTTCT

jct. 551 GCCTCCAGTCTGAGGATGAGGCTGACTATTACTGTCAGACCTGGGGCACTGGCATTCMAGGGATGTTCGGCGGAGGGACCMAGCTGACCGTCCTAGGTGAGTCTCTTCTC

ckh.22 ........TGTTCGGCGGAGGGACCMAGCTGACCGTCCTAGGTGAGTCTCTTCTC

jct. 661 CCCTCTCCTTCCCCGCTCTCGGGACATTCTGCTGTTT

ch .22 CCCTCTCCTTCCCCGCTCTTGGGACAATTTCTGTTGT

Bjct. 448 TTG ATG MAG CTT MAC AGT GAT GGC AGC CAC AGC MAG GGG GAC

Leu Met Lys Leu Asn Ser Asp Gly Ser His Ser Lys Gly Asp

Ala Pro Leu Leu Val Ile Tyr Gly Arg Asn Asn Arg Pro Ser

V>jct. 490 GGG ATC CCT GAT CGC TTC TCA GGC TCC AGC TCT GGG GCT GAG

Gly Ile Pro Asp Arg Phe Ser Gly Ser Ser Ser Gly Ala Glu* * * * * * * * * * * *

Gly LIle Pro Asp Arg Phe Ser Giy Ser Ser Ser Giy His Thr

jct. 532 CGC TAC CTC ACC ATC TCC AGC CTC CAG TCT GAG GAT GAG GCTArg Tyr Leu Thr Ile Ser Ser Leu Gin Ser Glu Asp Glu Ala

Ala Ser Leu Thr Ile Thr Gly Ala Gin Ala Glu Asp Glu Ala

* . ~~~J>jct. 574 GAC TAT TAC TGT CAG ACC TGG GGC ACT GGC ATT CM GGG ATG

Asp Tyr Tyr Cys Gin Thr Trp Gly Thr Gly Ile Gin Gly Met

Asp Tyr Tyr Cys Asn Ser Arg Asp Ser Ser Gly Val Leu ---

jct. 616 TTC GGC GGA GGG ACC MG CTG ACC GTC CTA GGT GAG TCT CTTPhe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly* * * * * * * * * * *

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly

jct. 658 CTC CCC TCT CCT TCC CCG CTC TCG GGA CAT TCT GCT GTT TT

FIG. 4. Nucleotide sequence through theBL2 breakpoint. (A) The sequence for the nor-mal MYC (lines ch. 8) is compared to that of thebreakpoint clone (lines Jct) and the normalchromosome 22 (lines ch. 22) for the 3' end ofthe sequence. (B) The deduced amino acidsequence from the region 3' of the chromosome8/chromosome 22 junction. V>, beginning ofthe sequence that is homologous to the V14 genefamily; J>, beginning of the J13 sequence. As-terisks denote homology; dots above the nucle-otide sequence indicate 10 bp; and the dashedline indicates a deletion.

Genetics: Showe et al.

Proc. Natl. Acad. Sci. USA 84 (1987)

through the J region is highly homologous to the nucleotidesequence of the VX4 gene for 120 nucleotides 5' of J,suggesting this is actually the case. Although the readingframe continues up to the breakpoint, the deduced amino acidsequence for the 5' end of this region is not homologous tothat for any known VA gene family. The evidence that thissequence does not derive from chromosome 8 is based on thelack of hybridization of a synthetic oligonucleotide derivedfrom this region to any of our phage X clones, which extend20 kb 3' and 5' of the breakpoint on chromosome 8. We haveisolated genomic clones that contain internal restrictionfragments that hybridize with a synthetic oligonucleotidederived from the divergent sequence but not with an oligo-nucleotide derived from the sequence 20 bp further 3' that ishomologous to V4 (data not shown). Thus, the 43 bp betweenchromosome 8 and VA are not contiguous on chromosome 22.

DISCUSSIONMYC Deregulation. Since all of the regions that have been

suggested to play a role in the regulation of expression of thehuman MYC gene are normal in the BL2 gene, we cannotaccount for the exclusive expression of the translocation-associated gene as a consequence of the mutation of any ofthese regions. The only changes that do occur are clusteredin a 166-bp region within exon 1, which has not beenimplicated in transcriptional regulation. We cannot eliminatethe possibility that these changes contribute to the highsteady-state levels ofMYC transcripts by increasing messagestability. However, increased stability of MYC transcriptshas only been demonstrated for messages derived from eithertruncated genes (18) or genes in which there are extensivealterations within the first exon (27). Furthermore, whilealtered message stability could explain the high steady-statelevels of MYC RNA, it does not explain the exclusivetranscription of the translocation-associated MYC gene inmost Burkitt lymphomas. Since the nucleotide sequencethrough the putative regulatory regions within exon 1 as wellas within the 5' and 3' flanking regions are normal, this showsthat deregulation of MYC transcription does not requiresequence alterations within these regions. Although thepossibility remains that additional transcriptional regulatoryregions may exist outside of the region we sequenced, wewould suggest that the dominant effect on MYC transcriptionresults from the de novo association with an activatedimmunoglobulin locus. This would require that the influenceof the associated immunoglobulin locus be dominant over thenormal mechanisms for MYC regulation. In the case of BL2cells, the influence of the immunoglobulin locus must beexerted over a distance of =20 kb, which separates theimmunoglobulin locus and the putative 5' regulatory ele-ments.Mechanisms Involved in Translocation. Although we find

regions with homology to J signals on chromosome 8, therecombination within a V gene is not consistent with aconventional immunoglobulin gene rearrangement, whichrequires the presence of both heptamer and nonamer signalswith the appropriate 1 turn-2 turn spacing. However, singleheptamer sequences have now been shown to occur close toregions of illegitimate recombinations involving the immu-noglobulin loci in a variety of examples, including the murine(28) and human (29) K light chain-deleting elements. Recom-binations have also been demonstrated within the heavychain locus, where a V-to-V rearrangement has been pro-posed to occur via a single heptamer signal within the V genecoding sequence, resulting in a chimeric VH gene (30, 31).This suggests that a certain flexibility exists in the joiningreactions that had not been considered previously. Anexamination of published VA, nucleotide sequences revealsthe presence of multiple sequences with homologies to the

heptamer consensus sequence within the 5' ends of thesegenes (25, 26, 32). The presence of multiple heptamers withinthese sequences suggests that similar recombinations mightalso occur within VA genes. The rearrangement within the VAgene illustrated in Fig. 4 could possibly result from this typeof recombination.We thank Marina Hoffman for editing, The Wistar Institute

editorial stafffor preparing the manuscript, and Andrew Caton for hishelp in the analysis of breakpoint sequence and careful reading of themanuscript. This work was supported by National Institutes ofHealth Grant CA39860 (to C.M.C.) and American Cancer SocietyGrant CD-274 (to L.C.S.).

1. ar-Rushdi, A., Nishikura, K., Erickson, I., Watt, R., Rovera, G. &Croce, C. M. (1983) Science 222, 390-393.

2. Nishikura, K., ar-Rushdi, A., Erikson, J., Watt, R., Rovera, G. &Croce, C. M. (1983) Proc. Natl. Acad. Sci. USA 40, 4822-4826.

3. Siebenlist, U., Hennighausen, L., Battey, J. & Leder, P. (1984)Cell 37, 381-391.

4. Croce, C. M., Thierfelder, W., Erikson, J., Nishikura, K., Finan,J., Lenoir, G. M. & Nowell, P. C. (1983) Proc. Natl. Acad. Sci.USA 80, 6922-6926.

5. Erikson, J., Finger, L., Sun, L., ar-Rushdi, A., Nishikura, K.,Minowada, J., Finan, J., Emanuel, B. S., Nowell, P. C. & Croce,C. M. (1986) Science 232, 884-886.

6. Sun, L. K., Showe, L. C. & Croce, C. M. (1986) Nucleic AcidsRes. 14, 4037-4050.

7. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl. Acad.Sci. USA 74, 5463-5467.

8. Showe, L. C., Ballantine, M., Nishikura, K., Erikson, J., Kaji, H.& Croce, C. M. (1985) Mol. Cell. Biol. 5, 501-509.

9. McBride, D. W., Heiter, P. A., Hollis, G. F., Swan, D., Otey,M. C. & Leder, P. (1982) J. Exp. Med. 155, 1480-1490.

10. Watt, R., Stanton, L. W., Marcu, K. B., Gallo, R. C., Croce,C. M. & Rovera, G. (1983) Nature (London) 303, 725-728.

11. Chung, J., Sinn, E., Reed, R. & Leder, P. (1986) Proc. NatI. Acad.Sci. USA 83, 7918-7922.

12. Taub, R., Kirsch, I., Morton, C., Lenoir, G., Swan, D., Fronich,S., Aaronson, S. & Leder, P. (1982) Proc. Natl. Acad. Sci. USA 79,7837-7841.

13. Gazin, C., Dupont de Direchin, S., Hampe, A., Masson, J. M.,Martin, P., Stehelin, D. & Galibert, F. (1984) EMBO J. 3, 383-387.

14. Bentley, D. L. & Groudine, M. (1986) Mol. Cell. Biol. 6,3481-3489.

15. Battey, J., Moulding, C., Taub, R., Murphy, W., Stewart, T.,Potter, H., Lenoir, G. & Leder, P. (1983) Cell 34, 779-787.

16. Watt, R., Nishikura, K., Sorrentino, J., ar-Rushdi, A., Croce,C. M. & Rovera, G. (1983) Proc. Natl. Acad. Sci. USA 80,6307-6311.

17. Gazin, C., Rigolet, M., Briand, J. P., Van Regenmortel, M. H. V.& Galibert, F. (1986) EMBO J. 5, 2241-2250.

18. Bentley, D. L. & Groudine, M. (1986) Nature (London) 321,702-706.

19. Pelicci, P. G., Knowles, D. M., II, Magrath, I. & Dalla Favera, R.(1986) Proc. Natl. Acad. Sci. USA 83, 2984-2988.

20. Tsujimoto, Y., Jaffe, E., Cossman, J., Gorham, J., Nowell, P. C. &Croce, C. M. (1985) Nature (London) 315, 340-343.

21. Haluska, F. G., Finver, S., Tsujimoto, Y. & Croce, C. M. (1986)Nature (London) 324, 158-161.

22. Finger, L. R., Harvey, R., Moore, R. C. A., Showe, L. C. &Croce, C. M. (1986) Science 234, 982-985.

23. Picolli, S. P., Carini, P. G. & Cole, M. D. (1984) Nature (L>ndon)310, 327-330.

24. Showe, L. C. & Croce, C. M. (1986) Cancer Rev. 2, 18-23.25. Denny, C. T., Hollis, G., Magrath, I. T. & Kirsch, I. R. (1985)

Mol. Cell. Biol. 5, 3199-3202.26. Sun, L. H. K., Croce, C. M. & Showe, L. C. (1985) Nucleic Acids

Res. 13, 4921-4934.27. Piechaczyk, M., Yang, J. Q., Blanchard, J. M., Jeanteur, P. &

Marcu, K. B. (1985) Cell 42, 489-597.28. Durdik, J., Moore, M. W. & Selsing, E. (1984) Nature (London)

307, 749-752.29. Siminovitch, K. A., Bakshi, A., Goldman, P. & Korsmeyer, S.

(1985) Nature (London) 316, 260-262.30. Klienfield, R., Hardy, R. R., Tarlinton, D., Dang, J., Herzenberg,

L. A. & Weigert, M. (1986) Nature (London) 322, 843-846.31. Reth, M., Gehrmann, P., Petrac, E. & Wiese, P. (1986) Nature

(London) 322, 840-842.32. Anderson, M. L. M., Szajnert, M. F., Kaplan, J. C., McColl, L. &

Young, B. D. (1984) Nucleic Acids Res. 12, 6647-6661.

2828 Genetics: Showe et al.