high-resolution physicalmapping alu-hybridization ... · t317, which contains 3pter-p21 as its sole...

6
Proc. Natl. Acad. Sci. USA Vol. 93, pp. 4474-4479, April 1996 Genetics High-resolution physical mapping by combined Alu-hybridization/ PCR screening: Construction of a yeast artificial chromosome map covering 31 centimorgans in 3p21-pl4 (Alu-PCR/chromosome 3) HIROYUKI ABURATANI*, VINCENT P. STANTON, JR., AND DAVID E. HOUSMAN Center for Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139 Contributed by David E. Housman, November 29, 1995 ABSTRACT We describe an integrated approach to large- scale physical mapping using an Alu-PCR hybridization screening strategy in conjunction with direct PCR-based screening to construct a continuous yeast artificial chromo- some map covering >20 mb in human chromosome 3, bands pl4-p21, composed of 205 loci, connected by 480 yeast arti- ficial chromosomes, with average interlocus distance of 100 kb. We observe an inverse distribution of Alu-PCR and (CA)n markers. These results suggest that the two screening methods may be complementary and demonstrate the utility of Alu- PCR hybridization screening in the closure of high-resolution human physical maps. The predominant method for physical mapping of human chromosomes is based on the use of loci termed sequence tagged sites (STSs) to identify specific large-insert clones in libraries prepared from high-capacity DNA vectors such as yeast artificial chromosomes (YACs) through PCR-based screening of DNAs prepared from hierarchically pooled DNA samples from the library (1-3). Identification of overlapping sets of YACs by unique STSs allows the STSs to be ordered along chromosomes. STSs that detect polymorphic loci can be ordered on a genetic map, providing a basis for ordering contiguous groups of overlapping YAC clones (contigs). This approach has been used to assemble continuous overlapping YAC sets (YAC contigs) spanning most of chromosome arms 21q (3) and 22q (4) and the euchromatic portion of the Y chromosome (2). Higher density genetic and radiation hybrid maps promise to simplify the assembly of physical maps by providing a more extensive scaffolding for ordering YAC groups. Several non-STS-based methods have been developed for large-scale mapping, in an attempt to reduce the cost and effort involved. Principal among these are YAC fingerprinting by hybridization of medium copy repeat elements to digested YAC DNA (5) and hybridization of YAC Alu-PCR products to Alu-PCR products from pools of YACs (6). The high incidence of chimeric YACs in available libraries prevents sole use of either method in map construction. Rather, data produced by these methods is mainly useful in the context of more certain data produced by STS content mapping. We describe here the application of an Alu-PCR hybridiza- tion approach to physical mapping, in which individual Alu- PCR products are used as loci. This approach, combined with STS content mapping, was used to produce a continuous YAC map in 3p21-pl4 that links 205 loci and spans >20 mb with an average interlocus distance of -100 kb. The application of bothAlu-PCR product hybridization and STS content mapping to the same genomic region allowed comparison of the two methods. MATERIALS AND METHODS Cell Lines. A9(Neo 3/t) is a human-mouse hybrid cell line monochromosomal for human 3 (provided by M. Oshimura, Tottori University). HA54, DM-1, and DG(2)3 are irradiation- reduced human-hamster hybrids carrying portions of human chromosome 3 as their sole human content. HA54 contains 3pl4 and 3p25 (H.A. and D.E.H., unpublished results). DM1 and DG(2)3 (provided by S. Naylor, University of Texas, San Antonio) carry several discontinuous segments of 3p. T317, a human-hamster hybrid provided by W. E. C. Bradley (Hopital Notre-Dame, Montreal), contains 3pter-p21 as its sole human chromosome 3 content (7). YAC Library Screening. Centre d'Etude Polymorphisme Humain (CEPH) plates 1-550 (8) were screened by PCR analysis of hierarchically pooled YACs or by hybridization of individual Alu-PCR probes to Alu-PCR products of the YAC pools (9). CEPH plates 709-964, containing "megaYACs" (average insert -1 mb) (6, 10) were screened mainly by hybridization ofAlu-PCR probes toAlu-PCR products of YAC pools. YAC pool DNAs were provided by T. Hudson and S. Foote (Whitehead-Massachusetts Institute of Technology Center for Genome Research). Alu-PCR. Alu-PCR products (11) were amplified from cos- mid and YAC DNA templates (annealing temperature, 58°C) using four separate Alu-PCR reactions with primersAle 1,Alu S/J, Ale 3, andAlu END (9, 12). The latter three primers were also used to selectively amplify humanAlu-PCR products from human-rodent somatic cell hybrid DNAs (annealing temper- ature, 64°C).Alu-vectorette PCR products (12) were produced with Alu S/AluJ. L1-PCR was performed as described by Ledbetter et al. (13). Verification of YAC Probe Content. For STS verification miniprep DNA (14) was prepared from individual YACs for PCR analysis. Alu-PCR probes were verified by hybridization to dot-blotted Alu PCR products amplified from miniprep DNA of individual YACs. YAC End Isolation and STS Development. STSs were de- veloped using DNA sequences from the ends of YAC or cosmid inserts amplified by vectorette PCR (15). Ends map- ping to 3pl4 (see below) were cycle sequenced (Promega femtomole kit) with vector primers 1207 or 1208 (15). STSs were also made from exons andAlu or LI PCR products, which were cloned before sequencing. All sequences were screened against GenBank using BLAST (16) to eliminate repetitive sequences before designing oligonucleotide primers. RESULTS AND DISCUSSION Production of 3p14 Probes. To isolate YAC clones spanning 3pl4 we collected entry probes from a variety of sources (Table Abbreviations: YAC, yeast artificial chromosome; STS, sequence tagged site; contig, contiguous group of overlapping clones; CEPH, Centre d'Etude Polymorphisme Humain; cM, centimorgan. *Present address: 3rd Department of Internal Medicine, University of Tokyo, Hongo, Tokyo 113, Japan. 4474 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Upload: buikhanh

Post on 06-Jul-2019

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

Proc. Natl. Acad. Sci. USAVol. 93, pp. 4474-4479, April 1996Genetics

High-resolution physical mapping by combined Alu-hybridization/PCR screening: Construction of a yeast artificial chromosomemap covering 31 centimorgans in 3p21-pl4

(Alu-PCR/chromosome 3)

HIROYUKI ABURATANI*, VINCENT P. STANTON, JR., AND DAVID E. HOUSMANCenter for Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139

Contributed by David E. Housman, November 29, 1995

ABSTRACT We describe an integrated approach to large-scale physical mapping using an Alu-PCR hybridizationscreening strategy in conjunction with direct PCR-basedscreening to construct a continuous yeast artificial chromo-some map covering >20 mb in human chromosome 3, bandspl4-p21, composed of 205 loci, connected by 480 yeast arti-ficial chromosomes, with average interlocus distance of 100kb. We observe an inverse distribution ofAlu-PCR and (CA)nmarkers. These results suggest that the two screening methodsmay be complementary and demonstrate the utility of Alu-PCR hybridization screening in the closure of high-resolutionhuman physical maps.

The predominant method for physical mapping of humanchromosomes is based on the use of loci termed sequencetagged sites (STSs) to identify specific large-insert clones inlibraries prepared from high-capacity DNA vectors such asyeast artificial chromosomes (YACs) through PCR-basedscreening of DNAs prepared from hierarchically pooled DNAsamples from the library (1-3). Identification of overlappingsets of YACs by unique STSs allows the STSs to be orderedalong chromosomes. STSs that detect polymorphic loci can beordered on a genetic map, providing a basis for orderingcontiguous groups of overlapping YAC clones (contigs). Thisapproach has been used to assemble continuous overlappingYAC sets (YAC contigs) spanning most of chromosome arms21q (3) and 22q (4) and the euchromatic portion of the Ychromosome (2). Higher density genetic and radiation hybridmaps promise to simplify the assembly of physical maps byproviding a more extensive scaffolding for ordering YACgroups.

Several non-STS-based methods have been developed forlarge-scale mapping, in an attempt to reduce the cost andeffort involved. Principal among these are YAC fingerprintingby hybridization of medium copy repeat elements to digestedYAC DNA (5) and hybridization of YAC Alu-PCR productsto Alu-PCR products from pools of YACs (6). The highincidence of chimeric YACs in available libraries prevents soleuse of either method in map construction. Rather, dataproduced by these methods is mainly useful in the context ofmore certain data produced by STS content mapping.We describe here the application of an Alu-PCR hybridiza-

tion approach to physical mapping, in which individual Alu-PCR products are used as loci. This approach, combined withSTS content mapping, was used to produce a continuous YACmap in 3p21-pl4 that links 205 loci and spans >20 mb with anaverage interlocus distance of -100 kb. The application ofbothAlu-PCR product hybridization and STS content mappingto the same genomic region allowed comparison of the twomethods.

MATERIALS AND METHODSCell Lines. A9(Neo 3/t) is a human-mouse hybrid cell line

monochromosomal for human 3 (provided by M. Oshimura,Tottori University). HA54, DM-1, and DG(2)3 are irradiation-reduced human-hamster hybrids carrying portions of humanchromosome 3 as their sole human content. HA54 contains3pl4 and 3p25 (H.A. and D.E.H., unpublished results). DM1and DG(2)3 (provided by S. Naylor, University of Texas, SanAntonio) carry several discontinuous segments of 3p. T317, ahuman-hamster hybrid provided by W. E. C. Bradley (HopitalNotre-Dame, Montreal), contains 3pter-p21 as its sole humanchromosome 3 content (7).YAC Library Screening. Centre d'Etude Polymorphisme

Humain (CEPH) plates 1-550 (8) were screened by PCRanalysis of hierarchically pooled YACs or by hybridization ofindividual Alu-PCR probes to Alu-PCR products of the YACpools (9). CEPH plates 709-964, containing "megaYACs"(average insert -1 mb) (6, 10) were screened mainly byhybridization ofAlu-PCR probes toAlu-PCR products ofYACpools. YAC pool DNAs were provided by T. Hudson and S.Foote (Whitehead-Massachusetts Institute of TechnologyCenter for Genome Research).Alu-PCR. Alu-PCR products (11) were amplified from cos-

mid and YAC DNA templates (annealing temperature, 58°C)using four separateAlu-PCR reactions with primersAle 1,AluS/J, Ale 3, andAlu END (9, 12). The latter three primers werealso used to selectively amplify humanAlu-PCR products fromhuman-rodent somatic cell hybrid DNAs (annealing temper-ature, 64°C).Alu-vectorette PCR products (12) were producedwith Alu S/AluJ. L1-PCR was performed as described byLedbetter et al. (13).

Verification of YAC Probe Content. For STS verificationminiprep DNA (14) was prepared from individual YACs forPCR analysis. Alu-PCR probes were verified by hybridizationto dot-blotted Alu PCR products amplified from miniprepDNA of individual YACs.YAC End Isolation and STS Development. STSs were de-

veloped using DNA sequences from the ends of YAC orcosmid inserts amplified by vectorette PCR (15). Ends map-ping to 3pl4 (see below) were cycle sequenced (Promegafemtomole kit) with vector primers 1207 or 1208 (15). STSswere also made from exons andAlu or LI PCR products, whichwere cloned before sequencing. All sequences were screenedagainst GenBank using BLAST (16) to eliminate repetitivesequences before designing oligonucleotide primers.

RESULTS AND DISCUSSIONProduction of3p14 Probes. To isolate YAC clones spanning

3pl4 we collected entry probes from a variety of sources (Table

Abbreviations: YAC, yeast artificial chromosome; STS, sequencetagged site; contig, contiguous group of overlapping clones; CEPH,Centre d'Etude Polymorphisme Humain; cM, centimorgan.*Present address: 3rd Department of Internal Medicine, University ofTokyo, Hongo, Tokyo 113, Japan.

4474

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement" inaccordance with 18 U.S.C. §1734 solely to indicate this fact.

Page 2: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

Proc. Natl. Acad. Sci. USA 93 (1996) 4475

1). Starting from many sites should facilitate early coalescenceof large contigs and reduce the downstream effort required forextension of contigs and gap closure. Subsequent contig ex-tension was accomplished by developing further probes (STSsor Alu-PCR products) from YACs at contig ends. Altogether205 probes were used in this study. They fall into two heter-ogeneous groups: (i) 92 STSs used for PCR-based screeningand (ii) 113 Alu-PCR or L1-PCR products used for hybridiza-tion-based screening.STS Probes. Fifty-eight of the 92 STSs are new. They were

developed from genes, putative exons, cosmid ends, and YACends, as well as Alu, LI (13), and Alu-vectorette (12) PCRproducts. STSs were generated from three genes: PTPy (18),WNT5A (19), and MITF (20). Cosmids used for STS produc-tion came from libraries made from somatic-cell hybrids DM-1and HA54 and from a chromosome 3-specific library (21).Putative exons were spliced from six cosmids: cCI825, cCI912,cCI948, cCI1416, cCI1430, and D3S1149 (17). YAC ends wereisolated in the course of contig extension from YACs at contigboundaries (see below) using vectorette PCR (15).

Thirty STSs were polymorphic (CA)n repeats used in theCEPH/Genethon genetic map (22) or by Weber's group (23).They tether our physical map to the CEPH-Genethon geneticmap.Alu-PCR Probes. One hundred and sixAlu-PCR hybridiza-

tion probes were derived from somatic-cell hybrids [DG(2)3,DM-1, and HA54], cosmids [DM-1, HA54, and chromosome3 (21) cosmid libraries], or 3pl4 YACs. Templates wereamplified in three or four separate PCRs, each with one Aluprimer or primer set: Alu S/J,Ale 1,Ale 3, orAlu END (9, 12).The use of several Alu primers increased the diversity ofAlu-PCR products. AllAlu-PCR probes were treated as loci inthe sense that single Alu-PCR products were used for hybrid-ization screening of the YAC library. By avoiding the use ofcomplex mixtures of Alu-PCR products (e.g., from a wholeYAC) identification of chimeric YAC insert was simplified: theset of YACs detected by a single Alu-PCR product generallyeither fit in the contig or not.

Library Screening. All probes, both STSs and Alu-PCRproducts, were screened against the CEPH I YAC library,which contains approximately seven genomes of human insert(8). An average of 5.7 YACs was detected per STS. Only 2 of92 STSs failed to detect at least one YAC address: D3S1300and B2-A89. Alu-PCR hybridization screening of the CEPH Ilibrary yielded an average of 7.2 YACs per locus. Sixty-fourpercent of Alu hybridization screens produced one or morereadable YAC addresses. In the contigs that emerged, STS andAlu-PCR loci are interspersed at high density (Fig. 1), dem-

Table 1. Origin of loci

Screening method

Source STS Hybridization Total

Cosmids 17 7 24(CA), markers 30 0 30Trinucleotide STSs 1 0 1Anonymous 2 0 2 87Genes, ESTs 4 0 4 Entry probesL1-PCR products 1 0 1 Entry besAlu-PCR productsFrom hybrids 6 19 25From YACs 2 87 891

YAC insert ends 32 0 32 Walking probesTotals 92 113 205

Probes used to make the 3p14 physical map sorted by source DNA,YAC screening method, and use (entry probes vs. walking probes).Four of the six hybrid-derived STSs are Alu-vectorette PCR products(12). The 17 cosmid-derived STSs include six putative exons (17). Twoother cosmid derived STSs contained short tandem repeats and aregrouped with the 32 (CA), repeats. EST, expressed sequence tags.

onstrating sufficient correspondence in YAC address detec-tion between the two methods to allow contig assembly.YAC Contig Extension. YAC contig extension in the CEPH

I library was accomplished by converting YAC ends at themargins of contigs into STSs and screening YAC libraries orby isolating individual Alu-PCR products from YACs at theedges of contigs and using them to screen YAC libraries byhybridization. In general, hybridization was a faster method forcontig extension. Overall 89 YAC-derived Alu-PCR productsand 32 YAC ends were used successfully in contig extension.Whether PCR or hybridization screening was used, it was

desirable to avoid probes from non-3p14 segments of chimericYACs. Accordingly, YAC end fragments were localized to3pl4 before STS conversion using somatic-cell hybrids HA54and T317. HA54 carries 3pl4 and 3p25 as its sole humancontent. T317, which contains 3pter-p21 as its sole humanchromosome 3 content (7), was used to assign HA54-positiveprobes to either the 3p25 (if positive in T317) or 3pl4 (ifnegative in T317) segment of HA54. We chose the 3pl4segment of HA54 as our target region for YAC coverage.To eliminate non-3p14 Alu-PCR products from consider-

ation as probes, Alu-PCR products from YACs with overlap-ping inserts were analyzed in adjacent wells of agarose gels.Alu-PCR products identical in size and amplified from two ormore overlapping YACs were considered likely to be the samemolecule and therefore likely (given the random origin of thenon-3p14 content of chimeric YACs) to arise from 3pl4 YACinsert.YAC Contig Assembly. After screening the CEPH I library

10 contigs ranging in length from 1 to 5 mb were obtained. Thecontigs were ordered and oriented based on the location of(CA), loci from the CEPH-Genethon genetic map (22). Thefinal nine gaps were closed by screening the CEPH megaYAClibrary, which contains human inserts over twice as long asthose in the CEPH I library (6). Selected Alu-PCR productswere used to screen the CEPH megaYAC library by hybrid-ization to port the CEPH I library contigs to the CEPHmegaYAC library. To fill the gaps between megaYAC contigs,contig extension was done by either YAC end isolation orAlu-PCR hybridization. The two last gaps were closed byD3S2400 and by data from Whitehead Institute/Massachu-setts Institute of Technology (MIT) anonymous DNA markersD3S2478 and D3S2483. The final contig spans the p14 segmentof HA54, which by fluorescence in situ hybridization is esti-mated to be >20 mb (M. Haas and D. Ward, personalcommunication). The outermost genetic markers, D3S1581(telomeric) and D3S1598 (centromeric), subtend 31 centimor-gans (cM) in the CEPH genetic map (22). The contig consistsof 480 YAC clones: 291 from the CEPH1 and 189 from themegaYAC library. One hundred fifty four of 205 DNA mark-ers are ordered unambiguously. The average interlocus dis-tance is 100 kb.

Integration with Other Mapping Data. MegaYAC screen-ing data for most of the (CA)n repeat loci was obtained fromCEPH/Genethon (http://www.cephb.fr/ceph-genethonmap.html) and Whitehead/MIT Genome Center (http://www.genome.wi.mit.edu/) data bases. These data were of consid-erable use in assembling our contig. Full information for theloci that make up this map is available at the University ofTexas at San Antonio Chromosome 3 World Wide Web site(http://mars.uthscsa.edu/DB/).

Distribution ofAlu Markers vs. (CA)n Repeat Markers. Thelocation of somatic-cell hybrid-derived Alu-PCR entry probesand (CA)n repeat probes in the 3pl4 contig is shown in Fig. 2.Both the Alu-derived entry probes (circles, column E) and the(CA)n repeat STSs (column G) are unevenly distributed, withAlu clustering more pronounced. The two classes of probesappear inversely distributed, particularly when contigs derivedfrom data produced by each method alone in the CEPH Ilibrary are compared (Fig. 2, columns F and H). Column F

Genetics: Aburatani et al.

Page 3: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

4476 Genetics: Aburatani et al.

I I I I I I I

Proc. Natl. Acad. Sci. USA 93 (1996)

I I I I I

£* N

o 0o0)C

L 0 LL C) Cl)i 0otV 00

CD3

0

cmCOCO)a

CY

0)

CD

v-

Cl)')a

CM

w0)r-

79C1196G1299C721D3325F3 267H

18H4 2346 401E6 287H12 534H(7,8)213C1 243

_ .515H. IL1326E12356E1

9A1 513E4__476G 519(E.F)12

00) N 0 CuLOU C - coCO:in(D 1- o

o C U) C U)

co

4t:0)

S cq

N

CM

coN cc3 )MC,M C0

(0

D0)CY

C:)090g8 2~4g210 293D1 141H8

157F5 5 C,9 325591910F_ 181H6 1j3i137 H9 357E10 53

-J.eL ,m_7 2 142(1.F)1 206B35A4 _ 163A 24I5.8 346H9 471H4

459(A.C.D)12 47A4294,A12) 25C7296F.,12)

37801 90H12372F6496D11

393(B.D08o 'C 0

0 .c

CO coC - 0M CsCj~~~ ~~0)cM<)-_ C t_C

0

-.

<cCO)CO0

COC)

a: a c!)00 000

U)

0)0

00)

Ott)

0

CM _ - Ca 0) 2 a

0o.

0

L-W aOD C)Ss C%

It U. T-_ <n in ooa > lte

(cD

4LO

258S7 2j,.4178A3 4,.

189C7

435

r-

COC)

1-_ I

_ a

420A4

c8 872 238 Q83E1

282D3A280B856H7 37.23

419E5 384H6

358A12

CD

0)

Cu(0

0Cl)CD0)<o

o>

0

2

808B10 929A9 8s5 , 757G3 89202760A5 9424 (925j928 9=9 9J191 90AJ2

94483 891g&7 933H(3.4.9) 9574 771 1

757.758H10 798 10 811F3.10 96199,j86907 791E8 872H9.10 971D 9

87&7 88110 6,6 .........932H2

J4a6 9=E4 8_=9 792D9 .. 951 F89gm10 7=,6 7604 ... 871D(11.12} 7 11653E 0 9148 933A9 923,511)6_9__G8 S__ 801.A2 ..._ 9g -A4

77_A5 925C9789, ^i^-r__ 7754D9 9j381.792C 764(F.G)4 737A2

957A5 82101,4 748943010

-co co oocMI I IIc It- mh hIsIS N8SS^8 iCY CM Y c- o O<o°

»o^ z m O Y YQ n ICQCT-QcMso o C Am_ _ _-.-N -.OD r-. z O _ a C;o a Oa u) mNo_Cla roco NCy CS Cy

769G.

8J,7

I 1 1 I

?00C1*

CMN

C)C')0a

0

I-

U)

o6

0_O

4-

ICY

*- S < 6, . 0m WN C) O) CO N-

420A4 95D9 12_2 238B11 34_1 17D2 , 130H11_447F7 3 7_J4,7 H461C1 54_2 _ 13m8G _ 134F11 28"G6

194H1 1 542H 131413 01 144vG3 479F121131H10 546C5 169C7 (449,492D.E)(4,11,12) A272F2 349C5 44C10

9F1 1751H2 5j,_ 2j3_ 5 343B4 371H4 182A6 249L3238H10 203A45,9) =,A(3,10) 3=79,10) 4027 430C2 143C5 425C883E1 994B10 206A,5.9) 218B9 1.A8 3§=9.10) 518 247 476D.G)782D3 388H12 26,A_G)8 64B9 397A11 403B10 166G8 2512 526(.G9

75H2 31 4A,G)7 498,8 27H1 24A5 494A2 183C8237B7 36F9 28B3 186A4161 524A6234G6463E4 264A9 40163

1=9 512FS 29SF4 5,10446Ei7 :376E8 364E3 46 .B2166D5

U)

cu

0a

03

811F127Gajiu

P-

Go _CO

r-0 Nm w

0Q N

91OD6GARAA

762E1184611 ...

927E1714.3 770E682_32 879H10842E6 6,7,8

--807A(3.10)

9.24B12...0.2

706C6

CO

0

I

1-OD

o0o

U)Cu0o

C-CO

w

CDrND

0)0

Cu

1-CDN0)

858A8 770B4

2A5U. 926G 11746E8 965C47614 85687

734F___ 964A1=H.4. . 951Ho10

801AB 774C1267_03 904F 78402.10

7._H8 ,764(B8.9)82,9 743B3

927E 1 849E7

FIG. 1. (Figure continues on the opposite page.)

I

u)

_1

C(

W

811, 8j.D3

999956A12 843E3

9_67E9

N

0

CMC4S

Page 4: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

Genetics: Aburatani et al. Proc. Natl. Acad. Sci. USA 93 (1996) 4477

I11 I I I I I II III IEIIII I IIIII

· - C,- tL

coc')~ *0)M 111 *<»*'-^-UJ^-<-<OCM^ CQQ CDQ 'tCMCMT-co'^rQ cMCM^'^-^-Q-'-j u m<Co Q C CM

250sP4FA^Lfe 4()5m 4~_87 3J_418D6 65E7 457__E(.5_.f 5C4 402F2223A5

4OD,..m..2 307B9 .130F6 195A7

543_9 459F2 1_6 2--'- 229F11 219D5~~ l~~~~~~j*. 4~~~~32H,10

__4___̂ £ 171 ---1 44__6 4^ -- 2c!2_ 21B4--- s_40 --7

443(A,C)8 .. ___1__________0F_B91,,~A,C)8 JJ8iA 7 ,11sg812 _. __ 0E_..21F4

.,, ___145B1 - 4__5 . .'188052F5 359Fll --

_2..MG 6 4118 237E3 253811 542.4_013 473E4 2_81

-339F2

291810 141_..... .

____________1_ 1 l 5 161E0 CO ,C - C. U)

Cif)~~~~~~~~CMI? CD CM0Ta TC o Wc

1 : </ cWoC) co

cS W (0 0 UCCOU) V Co V - CO) CD Co Nc

CD Ul.t-(ID -0e e0 ~ a - y CO CD c j 0 0 COa0) t _j- O - CO )-

0 o0 o) o)CO CM W QI

88o^8 747H19 939E10,64_̂.&E4 737E4 94,1F1

98c4 711E6 (4A928.93037S3 ,3 ,61_12 95 E7

750~F1 9_4_0.H_8 805010_ 8_1_o.r;.3_

754158.91 - 45452L2...4 2129A-CS 8IA7 9BI2IO7746 961F12

I I IIII I I I I I I

'5 N

_C,._0.A 543{E.023 252C 20 4 4010

4,, B347H..327G24 125B4 339F25

2' §* D ?. . . ...N..2.

OZ..jOQ0CQWflQf O<<u.'-u0Q(o4^S<Z 35 SS2

O~~~~~~~~~31__11 . 16 31 4 3159 75H6 263E1 200G7 7 432_?,__LilA5 420H^^g22 ^^g8^ 404E8 ....4DJ021252195A7 ,^2^ *89F J^j438 292A10 394A12 1J8667^^^

^

£2205 17765 14^12 5g^2 161C2 ___0S1

A^57 347"9 327612 ^S 12584 5J00C2 5^.11

---5C· 25 11 4n1 0 3858 78A535085163F1 1BF·0H , A280AF9409B3 222B6 18103 21!}Hl 1 65H3.4

277H5 418F4 135H1 66A1

.~,a.F2 364C4 26C10 63G2 12888;1141 371G8 43607 328B5

0G3_ ?212E9 370612

o ,- I t ' c' ' CMO~~~~~~~hI'- I', r r »-cO

:It: :1~ ~~ ~ 0 ~ ~ ~~lbI:t : CO 0 U aDNo coN N in

0 00 N NU UJ0

. CO OlbJ -" CMV 0' -"

0

lb lb lb lblb0 lb N N ~~~~~~~~~~~~~~~~~ 0 0 lb 0~~~~ CObN)0C~

N N N 00-0.-o o a) 0

.C.0~co 0) N 0 N co

CD CD CD 00 0)C' S ^ "0M CO CO0 0

LE10 87oE- 7.....15..58_~6p41F -- 1------- ^^- ^BS____7735592205 748 8__2

^___9568_E7^__ 92 5C3..__ 923El 771F,,,9 957FJ2810E3-------------------- 810A3 94866____7E9F

827C2 74L11 9A2 872A10 681183_34 ..67912~1?75fAf7.8l ....... ----876F4 6._._J1 7_25^ 9g_^.. ..9771A1 0

805F12 906F6 74578010} 717C OP11927F12 --9...48--- _________ S6&^ (727.729)C(31j), 70510 6_.34

~913A2 ~755F1 7M 948E9 77 , 3800F8 ~~ ~~~~~829E6 ·········- 7310~5

.............F................._ 81710F19 l 277.,bC5819C4

FIG. 1. YAC map of 3p21-pl4. Two hundred and five markers that detect 480 YACs are ordered in 154 bins. All 92 STS markers are markedwith a solid rectangle at top. YACs are divided in two groups: a CEPH I group above and a CEPH megaYAC group below. Markers used to screenthe CEPH I library are shown above the CEPH I YACs. Markers used to screen the megaYAC library are shown above the megaYACs. A subsetof the markers used to screen the megaYAC library (those aligned toward the CEPH I YACs) were also used to screen the CEPH I YACs. Markersused only to screen the megaYAC library are aligned toward the megaYACs. Stippled lines indicate probable internal deletions in YACs.Parenthetical YAC coordinates indicate ambiguity or contamination. Proven nonchimeric insert ends are marked by black circles.

Page 5: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

Proc. Natl. Acad. Sci. USA 93 (1996)

Genetic Map CEPH YACsA B C DE F G H J K

1 cM -S1578 *1

-S1289 0

-S1613, -S1236 11* SS1582 IM

-S 1606 0 m

-S1295-S1592 -S1514 a

-S1067-S1766 mm

I-S1313 ysiSiSM g

-S1234 O 3i-S1300 O

-S131239 |

-81312 0

-S1600

-S1287

-S 1285 * _00-

**

-S1228 * IN

-S1217S1233 0

-S1261-S 1 296,

S1566 -S659,-S1562 S1210

-S1598

II.o

m {

SE 0

I

I

i

Mega YACsL M

IIFIG. 2. Assembly of YAC contigs and distribution of probes. Column

A, the 3p21.1-p14 segment of hybrid HA54 with two internal deletionsshown lightly shaded. Column B, genetic linkage map by Genethon (22).Column C, genetic map by others (23). The vertical scale is geneticdistance. Columns D-K, results with CEPH I library. Column D, non-Alu,non-(CA), entry probes (empty circles). Column E, Alu-related entryprobes, shown by filled circles (e). Column F, YAC contigs that could beassembled solely from Alu-related entry probes (black bars). Column G,(CA), markers (filled circles). Column H, contigs that could be assembledsolely from (CA), markers (gray bars). Column J, contigs that could beassembled from both (CA),, markers and Alu-related entry probes.Column K, CEPH I YAC contigs after contig extension. Column L,megaYAC contigs that could be assembled solely from (CA)n markers.Column M, megaYAC contig after integration with CEPH I data andfurther contig extension.

shows the 15 contigs that could be assembled with the datafromAlu entry markers alone. Column H shows the 15 contigs

that could be assembled solely with YACs detected by (CA)nmarkers. Column J shows the 16 contigs that can be assembledfrom a combined probe collection. Only 2 of the 15 (CA)n-derived contigs lie entirely within Alu-derived contigs. Theother 13 (CA),-derived contigs extend (10 contigs) or lieentirely outside (3 contigs) the Alu-derived contigs.Both Alu elements and (CA)n repeats are unevenly distrib-

uted along chromosomes. Alu elements are reported to beconcentrated in R bands (24), whereas (CA)n repeats aresparse in subtelomeric regions (25). The distribution of Alusand (CA)n repeats at the submegabase scale has not beeninvestigated in detail. Fig. 2 shows the distribution of randommembers of these two classes of elements over a -31-cMsegment. The inverse distribution of the two classes of probessuggests that closure of YAC contigs derived primarily from(CA)n markers may be facilitated by use ofAlu-PCR product-derived markers. The success of Alu-PCR hybridization in3pl4, a Giemsa dark band comparatively underpopulated withAlu repeats (24) suggests the approach is sufficiently durableto work in virtually any genomic region (9). Of equal interest,although an inverse distribution of Alu repeats and (CA)nrepeats is observed at the submegabase level, from the per-spective of a genetic map Alu and (CA)n markers are exten-sively interspersed throughout 3p14. The conversion of Alu-based polymorphisms into genetic markers would thus bepredicted to lead to an integrated genetic map in whichAlu-based markers would alternate with (CA)n markers at adensity well below 1 cM.

Note Added in Proof. Two other high resolution physical maps of the3p14 region are described in refs. 26 and 27.

We acknowledge the contributions of the CEPH/Genethon groupsled by J. Weissenbach, I. Chumakov, and D. Cohen, whose YAClibraries and genetic map were essential for our work. We thank H.Kanki, K. Ohira, J. Takeyama, C. Guydan, J. Bernadini, S. Ontiveros,J. M. Andresen, J. Warren, B. Freeman, M. Bulyk, and A. Bany fortechnical assistance. We also thank K. Yamakawa and Y. Nakamura(cosmids), D. J. Munroe (Alu-bubble PCR), and M. Haas and D. Ward(fluorescence in situ hybridization). This work was supported byNational Institutes of Health Grants HG 00299 and CA 17575 toD.E.H.

1. Green, E. D. & Olson, M. V. (1990) Proc. Natl. Acad. Sci. USA87, 1213-1217.

2. Foote, S., Vollrath, D., Hilton, A. & Page, D. C. (1992) Science258, 60-66.

3. Chumakov, I., Rigault, P., Guillou, S., Ougen, P., Billaut, A., etal. (1992) Nature (London) 359, 380-387.

4. Bell, C., Budarf, M. L., Nieuwenhuijsen, B. W., Barnoski, B. L.,Buetow, K. H., et al. (1995) Hum. Mol. Genet. 4, 59-69.

5. Bellanne-Chantelot, C., Lacroix, B., Ougen, P., Billaut, A.,Beaufils, S., et al. (1992) Cell 70, 1059-1068.

6. Cohen, D., Chumakov, I. & Weissenbach, J. (1993) Nature(London) 366, 698-701.

7. Dobrovic, A., Houle, B., Belouchi, A. & Bradley, W. E. (1988)Cancer Res. 48, 682-685.

8. Albertsen, H. M., Abderrahim, H., Cann, H. M., Dausset, J., LePaslier, D. & Cohen, D. (1990) Proc. Natl. Acad. Sci. USA 87,4256-4260.

9. Liu, J., Stanton, V. P., Jr., Fujiwara, T. M., Wang, J. X., Re-zonzew, R., Crumley, M. J., Morgan, K., Gros, P., Housman, D.& Schurr, E. (1995) Genomics 26, 178-191.

10. Dausset, J., Ougen, P., Abderrahim, H., Billault, A., Sambucy,J. L., Cohen, D. & Le Paslier, D. (1992) Behring Inst. Mitt. 6,13-20.

11. Nelson, D. L., Ledbetter, S. A., Corbo, L., Victoria, M. F.,Ramirez-Solis, R. Webster, T. D., Ledbetter, D. H. & Caskey,C. T. (1989) Proc. Natl. Acad. Sci. USA 86, 6686-6690.

12. Munroe, D. J., Haas, M. & Bric, E. (1994) Genomics 19,506-514.13. Ledbetter, S. A., Nelson, D. L., Warren, S. T. & Ledbetter, D. H.

(1990) Genomics 6, 475-481.

4478 Gntc:Auaaie l

- - I I

Page 6: High-resolution physicalmapping Alu-hybridization ... · T317, which contains 3pter-p21 as its sole human chromosome3 content (7), wasusedto assign HA54-positive probes to either

Genetics: Aburatani et al.

14. Dracopoli, N. C., Haines, J. L., Korf, B. R., Moir, D. T., Morton,C. C., Seidman, C. E., Seidman, J. G. & Smith, D. R. (1994)Current Protocols in Human Genetics (Wiley, New York).

15. Riley, J., Butler, R., Ogilvie, D., Finniear, R., Jenner, D., Anand,R., Smith, J. C. & Markham, A. F. (1990) Nucleic Acids Res. 18,2887-2890.

16. Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman,D. J. (1990) J. Mol. Biol. 215, 403-410.

17. Buckler, A. J., Chang, D. D., Graw, S. L., Brook, J. D., Haber,D. A., Sharp, P. A. & Housman, D. E. (1991) Proc. Natl. Acad.Sci. USA 88, 4005-4009.

18. Krueger, N. X., Streuli, M. & Saito, H. (1990) EMBO J. 9,3241-3252.

19. Clark, C. C., Cohen, I., Eichstetter, I., Cannizzaro, L.A.,McPherson, J. D., Wasmuth, J. J. & lozzo, R. V. (1993) Genom-ics 18, 249-260.

Proc. Natl. Acad. Sci. USA 93 (1996) 4479

20. Tachibana, M., Perezjurado, L. A., Nakayama, A., Hodgkinson,C. A., Li, X., Schneider, M., Miki, T., Fex, J., Francke, U. &Arnheiter, H. (1994) Hum. Mol. Genet. 3, 553-557.

21. Yamakawa, K., Takahashi, E., Saito, H., Sato, T., Oshimura, M.,Hori, T. & Nakamura, Y. (1991) Genomics 9, 536-543.

22. Gyapay, G., Morissette, J., Vignal, A., Dib, C., Fizames, C.,Millasseau, P., Marc, S., Bernardi, G., Lathrop, M. & Weissen-bach, J. (1994) Nat. Genet. 7, 246-339.

23. Wilkie, P. J. & Weber, J. L. (1994) Genomics 19, 167-169.24. Korenberg, J. R. & Rykowsky, M. C. (1988) Cell 53, 391-400.25. Weissenbach, J., Gyapay, G., Dib, C., Vignal, A., Morissett, J.,

Millasseau, P., Vaysseix, G. & Lathrop, M. (1992) Nature (Lon-don) 359, 794-801.

26. Gemmill, R. M., Chumakov, I., Scott, P., Waggoner, B., Rigault,P., et al. (1995) Nature (London) 377, 299-303.

27. Hudson, T. J., Stein, L. D., Gerety, S. S., Ma, J., Castle, A. B., etal. (1995) Science 270, 1945-1954.