compilation of dna sequences of escherichia coli (update 1991)
TRANSCRIPT
© 1991 Oxford University Press Nucleic Acids Research, Vol. 19, Supplement 2023
Compilation of DNA sequences of Escherichia coli(update 1991)
Manfred Kroger*, Ralf Wahl and Peter Rice1
Institut fur Mikrobiologie und Molekularbiologie, Fachbereich Biologie, Justus-Liebig-Universitat GieBen,Frankfurter Strafie 107, D-6300 GieBen and 1EMBL, Postfach 10.2209, Meyerhofstr.1, D-6900Heidelberg, FRG
ABSTRACT
We have compiled the DNA sequence data for E.collavailable from the GENBANK and EMBL data librariesand over a period of several years independently fromthe literature. This Is the third listing replacing andincreasing the former listing roughly by one fifth.However, in order to save space this printed versioncontains DNA sequence information only. Thecomplete compilation Is now available in machinereadable form from the EMBL data library (ECD release6). After deletion of all detected overlaps a total of 1492 282 individual bp Is found to be determined till thebeginning of 1991. This corresponds to a total of31.62% of the entire E.coll chromosome consisting ofabout 4,720 kbp. This number may actually be higherby some extra 2,5% derived from lysogenlcbacteriophage lambda and various DNA sequencesalready received for statistical purposes only.
INTRODUCTION
Within this sequence supplement issue we were able to publisha compilation of DNA sequences of Escherichia coli both in 1989and 1990 and asked our colleagues from all over the world foradditions and corrections [1,2]. Ever since the number of newlypublished E.coli sequence data increased substantially (seeTable 1). Thus we were able to add and correct a number ofentries. According to our data a total of 1 492 282 bp is sequencedtill January 1991. At least one third of the nucleotides is publishedtwice. The data presented here may serve as a basis forencouragement to our colleagues to either send us theirunpublished, mostly flanking material or to determine additionallythe sometimes very small gaps towards known neighbouringsequences. This may finally help to produce the Escherichia coliK12 DNA sequence as the first complete sequence of a livingorganism. This compilation is available in its complete formquarterly from EMBL data library together with their currentrelease on tape (ECD). It may also be received on CD-ROMtogether with a service stand alone program for quick databasesearch and direct access to collected sequences.
PREVIOUS AND SUPPORTING EFFORTS
The most famous collection of E.coli data is the linkage mapcompiled by B.Bachmann [3]. These data were recently updatedand we tried to follow this update as much as possible. Threeother groups [4 — 8] started a program to fit the DNA sequencedata directly onto the physical map compiled by Y.Kohara et al.[9]. We preferred the genetic map positions rather then thephysical map coordinates, since we try to include genetic dataas well. In general one may obtain the physical data simply bymultiplying the genetic 'map' data by a factor of 47.2. Thisoperation needs to pay attention to a large inversion within theKohara restriction map, which however is considered in two othercosmidbanks [10,11]. In order to be prepared for merging ourdata with those regarding the Kohara map data directly [4 — 8]we have compared our data and paid attention to all of thesecollection as far as possible. Only the unpublished material (about68 000 bp) available exclusively to us, to K.Rudd or to B.Baum(both Bemesda, MD) is not included into the collection presentedhere, but may be available on personal request. Additions andcorrections of several colleagues are indicated in the main listwithin the comment column (see Table 1 for abbreviations).
PERFORMED COMPILATION
The general scope of this collection is to allow a compilationof all uncoordinated sequence information to finally end up witha complete Escherichia coli nucleotide sequence data bank,including all sequenced mutants. In order to give a visualimpression about the availability of sequence information of E. coliDNA we include an appropriate figure. This Figure 1 is printedautomatically on a high resolution printer and is to scale as faras possible. The extent of the black bars represents the mainlysequenced areas. All sequences with more then 2000 contiguousbasepairs are shown. The final print may not give enoughresolution, thus the comparison with the main list is stronglyrecommended.
Besides the pure sequence information some specified additionsare introduced, mostly if restriction maps could be found in theoriginal paper. We used B.Bachmann's genetic map data to sortthe sequences roughly by a tenth of a minute. Fine assortmentwas by a hundredth of a minute, if the sequences overlap. A
* To whom correspondence should be addressed
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2024 Nucleic Acids Research, Vol. 19, Supplement
hundredth of a minute corresponds to 472 bp, which seems tobe a sufficient resolution. If the sequences were mapped in eitherof the compilations using the Kohara map [4 — 8], we preferredto use their assignment including the respective orientation.
Note, the given map coordinate may vary somehow withindifferent issues of this collection, due to increasing number ofrecognized overlaps. If the numbers are not consecutive withincollected entries, you may find the missing entries within theelectronic ECD data bank only. Numbers higher then 100 withinthis line refer to DNA sequences, which could not be localizedwithin the chromosome yet. The numbers refer to the followingtype of information:
101 —126 Unlocalized structural genes in alphabetical order> 200 Insertion sequences in quasi numerical order (e.g.
200.05 is IS5; 201.50 is IS150)> 300 tRNAs filed as RNA sequences and unlocalized
tRNA genes in alphabetical order of thecorresponding amino acidsunlocalized RNAsreviews and summaries (neglected here)
>400>500
The gene symbols are either according to the Bachmann list orto the respective paper. Several symbols have been changed inthe past for different reasons. Thus the given entry names withinthe EMBL or GenBank entries differ sometimes from the givengene symbol. These differences are indicated as much as possiblemostly using the ' = ' symbol in the last position of the namecolumn pointing to the comment column. If more then two genesymbols are necessary to describe the content either of a singleor a condensed entry, the '&' symbol points to the commentcolumn, in order to indicate that this area consists of more thantwo genes. Thus the two columns should be read as oneconsecutive item. We tried to use each gene symbol at least oncein the first position. Since a fairly compressed form had to beused within this databank, some terms and abbreviations had tobe used and are explained in Table 1.
This third edition of ECD contains a major increase ofinformation by adding the exact coordinates for the performedoverlaps. In order to make the calculations transparent, wepreferred to keep all references with contributions to therespective area. In future issues we may only be able to quoteone reference per data bank entry. However, the full set ofinformation will always be provided on the electronic ECDversion. Structural information besides the DNA sequence dataand some other functional data, e.g. restriction map data,functional analysis data, sequence corrections or sequencedmutations are now also only in the electronic ECD version. MosttRNA sequences are compiled together with their respectiveanticodon sequence. For crossreferences to the tRNA collection[12] see our previous listing [1]. Most ribosomal operons arenot fully sequenced within their 16S and 23S RNA genes. Thusthe compiled sequences are sometimes only analogs. Insertionsequences are compiled using the known copy number withinE.coli K12 strain W3110 [13,14]. A future issue of this collectionmay contain this information at the respective genetic locus.
Strains other then E.coli K12 are indicated. However, therespective sequence data are not included in the final calculation.Names not found within the Bachmann list or not geneticallydefined within the original paper are abbreviated but explainedwith the full name. Undetermined or open reading frames (urfor orf) are indicated mostly according to the original paper andthe resulting protein size. Regions with no specific genetic
function are marked as intergenic or flanking regions. Somereviews on the genetic or functional structure as well as on cosmidbanks are included, but neglected here.
The accession number column gives the first accession numberfor any EMBL entry. This number remains constantly with thequoted nucleotide sequence. It is therefore the most importantcross reference. According to the general databank policies theaccession number will even be part of any condensed entry afterremoving overlaps. Thus an EMBL EC entry may be found viadifferent accession numbers, but each accession number pointsto one individual EC entry only. If the EMBL-accession numberdiffers from the GenBank accession number, the GenBankaccession number is given in the comment column.
The EMBL column gives the EMBL entry name for the givenreference, usually beginning with the identification EC Therespective GenBank entry name usually begins with ECO ,and is given in the respective column. This may be used to findEscherichia coli sequences directly either within the EMBLdatabank or GenBank. However, this identification may also pointto plasmid borne and other sequences (for a list of these entriessee the previous listing [1]).
Note, that the EC or ECO entry name may have been changedbetween different database releases, especially due to compressingoverlapping sequences or to changing preliminary names (usuallythe respective accession number). Numbers in brackets after thename point to the position of this reference within the list ofreferences of the respective GenBank/EMBL entry. Incorporationof these numbers is still incomplete. In order not to depend onthese changes the respective invariant accession number is alwaysgiven in the accession number column.
There are now five columns giving different types of numbersof basepairs:
1) The basepair ('bp') column gives the number of basepairsfound in the reference quoted. The number is mostly consistentwith the respective number given in the EMBL or GenBank entry.If it is clear from the original text, that the given sequenceinformation is used as illustration only and not originallydetermined here, the entry within this column is 0. If the textallows to complete restriction sites the flanking nucleotides areadded, or if it allows to recognize vector sequences, thesesequences are omitted. Thus the given number may differ fromthe EMBL or GenBank entry. This is mostly indicated in thecomment column.
2) The 'offset' column is a control number for ascending orderin the map column and describes the number of nucleotidescollected up to the previous entry.
3) The 'from' column defines the address number of thenucleotide to begin with in the respective collection.
4) The 'to' column defines the respective nucleotides toterminate the respective collection. If the from number is higherthen the to number our program automatically inverses thesequence.
5) The total column gives the number of basepairs added toa total number after deletion of all repeats and overlaps fromdifferent entries. This number is given only once per added areain the first entry after a sort by the genetic map position. Datafrom strains other than E.coli K12 are ignored in this calculation.If there is no overlap to other entries the numbers given in thebp and total column are identical. Adding up all entries in thiscolumn we arrive at the actual number of sequenced basepairsof the entire E. coli genome. The actual number of total basepairssequenced up to January 1991 is 1 492 282 bp = 31.6%
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2025
In order to distinguish between the different entries, e.g.forthis printed version of the data base, an additional one lettercolumn (•) is included as well. T defines the start of a completelycollectable entry. P defines the start of an entry only partiallycollectable. F defines an entry missing in the EMBL data base.C defines any collectable entry other then the starting one.B, J, U, and YV define entries from strains other then E.coli K12.The article column line gives the references for the quotedsequence information in a fairly condensed format and indicatesthe volume and first page of each entry, only. The appropriateyear is added in a special year column to allow the calculationof an annual index given in Table 2. A list of abbreviations forthe respective journals is given in Table 3.
DATA DISTRIBUTION IN MACHINE READABLE FORM
This compilation is available as a flat file (ECD) from the EMBLdata library and is automatically distributed with the each releasethe EMBL data library. In addition, this compilation is availabletogether with a stand alone service program on the CD-ROMversion of the EMBL Hata library, but this version is also availableon disk on request from Gie/3en. Both versions including theservice program are available from the EMBL file server [15].Using this program one may assemble the entire nucleotidesequence directly from the CD-ROM or may extract each singleor collected entry individually.
ACKNOWLEDGEMENTS
We like to thank Kenn Rudd (Bethesda) for his unpublishedlisting, and Peter Stoehr, David Hazeldine and Rainer Fuchs(EMBL) for constant flow of recent database additions. This workis supported by the Deutsche Forschungsgemeinschaft (SFB 272).
REFERENCES
1. M.Kroger, Nucl.Acids Res. 17 (Suppl.), r283-309 (1989)2. M.Kr6ger, R.Wahl and P.Rice, Nucl.Acids Res. 18 (Suppl.) 2549-2587
(1990)3. B.J.Bachmann, Microbiol. Rev. 54, 130-197 (1990)4. K.E.Rudd, W.Miller, J.Ostell and D.A.Benson, Nucl. Acids Res. 18,
313-321 (1990).5. C.Mecligue, J.P.Bouche', A.Henaut and A.Danchin, Molecular Microbiology
4, 169-187 (1990)6. C.Metligue, A.Hdnaut and A.Danchin, Molecular Microbiology 4,
1443-1454 (1990)7. H.Watanabe and T.Kunisawa, Protein Seq. & Data Analysis 3, 149-156
(1990)8. T.Kunisawa, M.Nakamura, H.Watanabe, J.Otsuka, A.Tsugita, L.-S.L.Yeh,
D.G.George and W.C.Barker, Protein Seq. & Data Analysis 3, 157-162(1990)
9. Y.Kohara, K.Akiyama and K.Isono, Cell 50, 495-508 (1987).10. R.P.Birkenbihl and W.Vkdmetter, Nucleic Acids Res. 17, 5057-5069 (1989).11. V.Knott, DJ.Blake and G.G.Browlee, Nucl.Acids Res. 17, 5901 -5912
(1989).12. M.Sprinzl, T.Hartmann, J.Weber, J.Blank and R.Zeidler, Nucl.Acids Res.
17 (Suppl.), r l -172 (1989)13. M.Umeda and E.Othsubo, J.Mol.Biol. 208, 601-614 (1989)14. R.P.Birkenbihl and W.Vielmetter, Mol.Gen.Genet. 220, 147-153(1989).15. P.J.Stoehr and R.A.Omond, Nucl.Acids Res. 17, 6763-6764 (1989).
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2026 Nucleic Acids Research, Vol. 19, Supplement
malFEKBrrnK
rpoBCrrnB
lleShsdRHS I e n VA
rrnH
lacIZTK
80
glgPACXB
rrnD
at qA
E. c o l i K12
4 720 000 bp
January 30, 1991
pheST
narRLUpfcBCDE
uvrCrrnC
Fig . l The sequenced areas within the circular chromosome of Escherichia coli K12. The sequenced areas are calculated in percent of the total chromosome accordingto the main list. All sequences exceeding 2000 bp are shown as in scale black bars at the respective genetic map position. Some prominent markers are included as well.
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2027
Table 1. Databank cross references and abbreviations. Some differences between the two databases or repeated entries, as well as accession numbers given in mostrecent papers are indicated directly.
Gb points to a difference mostly in the accession number between GenBank and EMBL databanks [see also AC line].GbNew is introduced, if the actual issue of GenBank data bank may not contain this more recent entry. However, caused by a certain difference in
updating these entries, they may already be incorporated into the most actual GenBank version. They are always available using theEMBL/GenBank file server.
KR indicates entries primarily found by K. Rudd [3]BB acknowledges the contribution of Bobby Baum, 6607 Pyle Road Bethesda, Maryland 20817, USA. Entries marked by BB are added [new] or
corrected [length, map location, overlap] using a listing compiled by Bobby Baum. His collection may be available as Ec.lis file via theNational Biomedical Research Foundation (USA).
MK [Manfred Kroger] andHD [Peter Rice or other colleagues at EMBL] point to unresolved differences to the databases found by these individuals,char characterization of genetic elementscomp comparison with other organismscorr correction of published datafunc functional descriptiongap small undetermined gapsmap restriction or genetic map datamut sequenced mutationsprom characterized promoter sequencesregul characterized regulatory sequencessumm summation or reviewunpub unpublished material quoted in the respective article or in the database
Table 2. Annual growing of the E.coli DNA-sequence information. This table represents all included sequence information. The final number for total informationexceeds the actual number of sequenced nucleotides (1 492 282 bp) roughly by one third.
Yearannualentries
totalentries
annualinformation (bp)
60020534924984142643420582053011166670
20077362221157128138213175317884415918324643220345724834928599332687526507
totalinformation (bp)
60080511541403224426703104330941294659577512445325226874418445626583839759157643573561898205011855071433856171984920467242073231
1967196819691970197119721973197419751976197719781979198019811982198319841985198619871988198919901991
124310564979
214155759312513715517014817919422911
13710202531354451608112217725234547060776293210801259145316821693
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2028 Nucleic Acids Research, Vol. 19, Supplement
Table 3. List of abbreviations for journals
ABB Archives of Biochemistry and BiophysicsAcMB Archives of MicrobiologyAdER Advances in Enzyme RegulationAgrB Agricultural Biochemistry (Tokyo)AEMb Applied Enviromental MicrobiologyAnMi Annales of MicrobiologyANYA Annales of New York Academy of ScienceBBA Biochemica Biophysica ActaBBRC Biochemical Biophysical Research CommunicationsBChF Biochemie (France)BiCh Biochemistry (USA)Bint Biochemistry InternationalBiRp Bioscience ReportBJ Biochemical Journal (UK)Bioo Bioorganic Chemistry (UdSSR)CJMi Canadian Journal of MicrobiologyCRC Carlsberg Research CommunicationCSHQ Cold Spring Harbour Symposium of Quantitave BiologyCell CellDANs Doklady Akademia Nauk (UdSSR)DNA DNAEJB European Journal of BiochemistryEMBO EMBO JournalFEBS FEBS LettersFedP Federation ProceedingsFEMS FEMS Microbiological LettersG GeneticsGAnT Gene Analysis TechniquesGene GeneJBCh Journal of Biological ChemistryJBac Journal of BacteriologyJBio Journal of Biochemistry (Japan)JCBc Journal of Cellular BiochemistryJGMi Journal of General MicrobiologyJMAG Journal of Molecular and Applied GeneticsJMB Journal of Molecular BiologyMBE Molecular Biology and EvolutionMGG Molecular General GeneticsMiRV Microbiological ReviewsMoMB Molecular MicrobiologyN NatureNAR Nucleic Acids ResearchNNB Nature New BiologyPNAS Proceedings of the National Academy of Science (USA)Prot PROTEINS: Structure, Function and GeneticsPSDA Protein Sequences and Data AnalysisScie ScienceUnpub. unpublished material quoted in the databanksZNfC Zeitschrift fflr Naturforschung Part C (Biological Sciences)
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2029
O. O. O. O. Q «<ogoomp
12 22HQQQ
O.; O.O. CO < ti, O Oi
CO bJtO
£<>»<< HO o. urn <co0 IP« >-J WC0>OEH<r-mO W W U 2 W «DO PlfflOH Q H H 55 W P D ro in 5 U U UO r o h Q U, b. H OT X O r>i n 2 < < <
8 8888888358888H Q Q Q < 0 C « . J Q U b , < 2 2 2 2 2 2 S . i 3 i j i j M H CO O . r i n o h Q U, b . H OT X O r i n 2 < < < n & Q
8888 888 8 8888 8888888888888 8 3258 8888888358888 88S88HUUU I UUU I W I UUMU I UHUUUHHUHHHHH I N I XX XH I UUHUHNHXXUHHU I W WX W W
VO U1 t-l r~ VO rH CT> >O rH fH r~ fO iH CM iH O 00 CTl t P) r-
c* CM vo vo cs ts en vo oo f-i r~ vo o o inooHHomutriNoo oo o iHno H
J JPUUU O OU HU I H U I WW I
UUUUrlOOUI WWWWXWWCd uuuuuuuuu
in r~ in oo r *-i o a ^ o o n ch in oo in oo »H in ro in o m o
x: w oi cn£ cox:
r-imooooocMOcMinminCT^ ^ c/ ^ co r^ o^CT* n r^ in vo CM fHi-H «H CN *H CN fH 0>
CM <N i n
m vo co oo in o CN CM CN in in ^f vo CM in r ^ o o CM CM vo o> oor~i ID CM o co co oo CN r in CM eft vo ro J1 co o m o 03 CA (7»CO TT C M ' H ' H C M r t r H rHtHCN CM *-t rH n rH rH
coVOCM
cnaj r-oflOHH
o c co f~^ vo m rvo rH co o m
rH CO 00HrHrHOIrHrHinrHrH
in corH CA r-i
CM
<inooocooi-iOfan o ooCM m rH
cocnHO
CM
O0m CM <n en rH r~ o i^voPI V CM CO CO CO 00 CKC>rH ro in r^ 00 O rH CO
rH rH
r- ^ oo VO^TCKCKCN CM oa in o> vo o> vo *» oo oo oo o en cooo enCMCOfO C M C O r ^ O * CM O O CM CM 00 Cn *T CM CM t- CO CM O CM P-H H ^ rH CM f rH CO m VO r* CO On CM en in VO r* rH t-t n CO
rH rH rH rH rH CM
rH rH rH CM rH CM rH CM rH in rH CM rH rH rH o| rH rH CM rH rH CM rH rH CM rH CM rH CM CO rH rH rH m CM CM rH rH CM rH (M
itUUfflUOQ£J3CCCIJC CVH rH o a a) a) O O.XI C0fflfflrHrHx:V4XI33l<3-U3XIJJ.pCffl
OOOeCMCM CM Mm n cntm in mm mm a>a>OrHvovo
OOOOOOOOOOOOOOOOOHrHrHrHrHrHrHrHrHrHrHrHrHCMlNCMCMCMCMCMCMCMCMCMCNCMCMCMCMCMCVlCMCMCMCMCMfncncOCO
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2030 Nucleic Acids Research, Vol. 19, Supplement
P, XXX2 0.0.0.
Or- cnOOOOOOO
? £55555h h b a x o U X K & u J.J.J Q «cc «S «3E o.u 0.0, So. «J!
88888888888 888 888888888888^81.,HUUUUHHUHHH I WWW I HHHHUHUHHHHHUHHHHvovocncMr-oovoroncor-invo^aor-oooncovor-^cMiovocovinoen^oooDrHi^mr^cnO't.-HrHenvDvororHinO'ivo^rncMrorMiHcococncoiouwCMCMrtrHCMCMrHinOinrH'tCOCOCnincMrHOOOO't'tmOOr-O'fOOCn
o o o o
£55555338S 38888888MX I XHHHHHHHd r > in en CM mvo m v o t i ocoenOrHp-cncni^rochroenrocncMOOCMCMOvocMvoOlOnnHOOHHOHnnuiNinoooooo
>x!oo _co mac xi .«oin, 30.
S88§8XHU 3HNHHHNXcon in in ** in oo *» vo rH m ineom<*x)vovo«rinocMcor~co
3 vo o o CM r~ o en rH vo
O- Du 0 * bb H 04 0^ C^ n CM f ^ 0u 5 flu 25 ^ SQ Q^ srt p^ j j ^ CK flK P ^ pn ^ fifl ^^ * ^^ gj* i ^ ^^
pc pc CO P^ T*J ^J ^^ ^ ^ C3 OM DC LC CU tC 3C C^ ^ r £M f^ SQ U i i C f C ri^y 3E N^ ^ 3 CQ f ^ —™J —J t"J ^ J ^JO CJ O OO OOUO OO OOOOmOOO OO OOOOOOOfHOOoO OCMOCMOOOO OMUCaUUUHHHHHHHUHXHHHHHHHHHHHHMUHHH I HXHXHHHH I U
U Cxcccap QooooXXDIHo. o. to to
. ooooI XWW3WWWUWWUW I
o en o* 0^ ^ 9^ fft o ff^ 9t ot en CR 0 o 9 9* en en o o en en c en en en crt en en 9 &* en en en en en enIriHHHi-IHrlHHHHrlHrtHH'-tH
O O Ul . C . C U £ « O O 0 Ua oo<«uu aaoc «« « «
Moz<cncaca2cn «nn oo<«uu a
cnMoz<cncaca «tnen coca
VO CO COrH r H r H r H CM CM
XIo o o o 3x:to
C DC a) 0 a id OiO<C CO 2
cocor-r-«*r-rHCMCM
XIW O 0 9 30«« i c c accrja:z o a) « c m2a
rH n vo in en en rHNHrlrlH
UocacQm x x
O 4)ojmc< x «
ooocnocNcninOtHOcnr-ooinot~-ooovo oo in m n incM en r» en <nm t-i CN CM co »»vo r- t- in r-^f fi f-i m t-4 vo oi in t-i ao
Nco vOf-tincoinvo mincOrHcNT (NIMHrtH
cnmrnincNfHoOCNtvoCM rHvO
eftr-CM
o o ^ o O o o v nCM VO rH CM r | H rH CM O4 T» r~ rH
en en r~ CM en oo o r*>rH rH CM CM
VO CMCMrH
IH0>O00 0vr-rlHHCMO r~ cnrHoo i n or-\ iH rn
HHHHHHWrl(VJr(OHr*m r~ o r-r- en CM t
CO O VO rH in rH rH tin en rHr~ co inCM vo CM
^» com CMHIM in
oonr»ontoCM'I1 in
O i nmm
(H^NHHOOOOCIOM/IHHHHlOHHHHnOIHHvo ao n o ^ H H r i i n o CM rH r»rain en ^ <Hm vor~ r~co en CM OO enenN O rtnr/1~ t- r*oo oo rncM
en en in r» eo rHcc 7 0 0 N o ^ o M O c o o } T T n o o o ^ c
rH oovo in roro r - r - ^ «»incr»vOin<»)cMcr>Oforo (*)<«) rHcnovo co Tvooo ino rHrHrHH H ^ r l C M M rH CM rH CM CM rH f) rH rH V CM f f CM rH rH n
cncnr-cNcneoomencnrH****H H CM CM rH
x>BCXQQO - H O C B O V J O O O J J C O O U U O O O O
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2031
S< < n2vOCN 0.Z 2 O O« WH1 1 J O O ^ ^ O Q Bin vom t-
qqqqqqWHHH H H H XQ B O P l V U
H K< <«c<n too. n mui
rHCN « ._. _ ^ , ^ , ._.*< n0.VO O.CNOCO
8888888 88888388 888S5S 88888888...X I HHHXHHHHHHHHHHHHHH I HHHHHXHH I UHHXXX I HUHHHHHHXHH
o aU
88 mumIHHUHUHUH
cn vo o rn co cn cn oCO VO r> VO rH Cn rH 00inovor-r»cNvDcnCMH PinOHOOOnnnnnnonnNHinninHHOMOHriHON
XX I XX|->X|->XXXXXXXXXXXXXXXXXXXXXXXXr7XMoiMuiXXXX
o n O o o O H H OXXXXXX>^>-]XX> om OCNCNOOO
XXKXXtt 'Xu u S. ** Q
S inrHqqqqqq -HCN O OCJO.0. <<.H < mo OH
rt n <** n 2 3 2 2 3 3 < QQ QUCNQ M x X >.H >*vo < ncNOoo H < rti<<(n cocumrt ncof»4 O.ZZO.O<<OTHHfHEHEHt-*KOTOTC/lKVD|HCQZSE 040*>CNO.vOO«fCnfn04QO.OO.DQZOOZ
(NO UUOOUBoOOuSuuGoOOUuiOOoBuflOUrtUOUNNNUOUOUoBuSoOOXHIUHHHHHHHUHHHUHHHHHXHHHHHHHHXHHHXXXHHHHHHHHUUHHI
OH CCjfH
QO
saqxiS^Q
uouuuuuuIUUHHUUUU
o m in vo co 00 O m cn r-CN H rH rH r» cn m VD vo vo co in r* in cocMin n O H O coo 00"» ta\ co r^ncNcnn *f cncNcn VDCN
O f l o
f m cn o r- r r n m m i-* f-t ?n co CN in CN O\ a* in in i^ a*CN OO O O in oim » r- <-< o> ro m cn co cn cn ro o> o cn rocn vo CN co r* r** co ^ r- r CN I/I vo »H VD H (N CO in H VD
ocoin in in o CN »Hr*r4 f-H «H rH
(IQU£O£mSmuoui u
n
crt om •-<VO *H
cn tH m w vo r^ (*> fH ro *-i I-I CT» r-* in o f-4 en in cn in rn in n CN r* r* o *H «-H CN cn CN CNfH VD r-i r- f-H rH vo vo t- r* in r* cn VD r^ vo vo vo *-t CN VD voin in o oo r- t in CN CN
rH rH *H i-H CN rH CN • *~t CN OJ r-t r-i r4 r-\ H rH H r-i CN CN • CN CN • rH f-H CN CNXI SX S3
O O O U U £ 3 U ffl JS X! O O CO O ffl O O M O H £ B 8 X ! 3 X 3 M O U £ £nB<nS<onnnmmai^nmumSDmopncnOBZcoffim^>
cnrHvooocNcot-vocoin vo
co «* rH in»» CNrH CN ro
enn CNI-IVD m
o"»Ocnoi t -
OfnvoHvo
oCN.-H
vorncocn
onnoo<f moo
CN n
f-t cn * * ro o VD rH vo CO CNr~ oo r*> CN o ** co r*> vo co
rH rH CO i n VD
c o o c c o a i HrH O f oo ^ VD VD cn rHoo f «H in ^ cn CN r-i vo
CN CN CN rH
or* cn in in o ro VD o cn ao rHCNCJ* vo vo CN CN cn CN f in CN VD
rH rH rH f ) rH CN rH rH rH <»(
ocn o vo oo vo CNm m CNGOVOCO CM
oromvof inoo
CN CO
I O O H N H H O H M J M J X O H
CNOrHCN CN CN
rHCTlrHCNrHVOOrHCOVDOrHrHrHOmfCOrHvO COOVDCNCNO^ Om
r H m CNrHCOVOMrH VOC71f-H CN OJ m f
<iHOOOOOr-trHccnCN
r-rHinrif IK
m o o Ovocn in r»m r»rH cn in r» co cn cnr H H I VO Cn r H r H CN
F-lrH
OcnomrHovOrHvocor-incocNcoofooc^vo
in CNCN rHCN rHrH CN CN CN CN rH rH rHCN CN rH CN <H rH m rH CN rH rH f) rH n cn cn CNcNOOvocnfinocNvo
rn f CN cn Ol
1
n
io
ooooa
Ml
m3 ftu »
3* *H•P4JB B0 O 1
©XICDm BO-r)
in rHrH•1-1 UrHU 3-rt0 E O•O-rl
mj5 ErH- 4 a0, M 0
i
*4
CN
14 M
« l >
14 I
a
oI rH
n i4Xi mxi no
J o e c o0 -rl -^ O
CO 00rH ftrHo oo
in JJOVO VO. o> •
in Bin
coi a
i a 1 1
o
SI?•u-om i
O p<QDQ W>.D COO O fo O^!J5 aaiDhOOtaftft
4 mE
m*-iEOu
ssT5DIB
-rim
oi • o 5 F r § •< "- ^ ot4 *M un on ^ H n •* *•ftXHxoi<< >oou<<<<rtcowrtcgQ«cococo o 2 o o <<
- - - , . . . . „ ft ft > 4-> ftft<M4J-P-P O ft^ ftUft3ZEO>OlEBBl4ftftOal44J4JID^>TJa£X£3BVl> iSBinf tgBEOOV4EEEEXIOrH«rHIDSBOOrHrHrH3 'OT)J3£XrHrH
O 3 V4 V< M ft-0 «JOCCXo)OlBlB>M1-iOffllD4)MO.V4l.T)l4'H+JfaCCCnCn Cnt-i J< J< >j M ft O< O>
XIX)
O O
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2032 Nucleic Acids Research, Vol. 19, Supplement
I K CQh. . . . _ . . .< CN 5 . 5 Z < "5
X HH HOZQjJjQOJJjijJiOSfiSSaxOxSvTvCL.n,ioooor^HbHPo.z<ooouc5JtnDbuuguQ£>Jo3U-<ja.KiQM 2<2HK«x<QHo,o<p<a<b<Q9xf|<2S OOOmoOOOOOOOOOOOOOOOOOOOOOOinOvoOOOO OOOOOOOOOOOOOOOOOQQQQ
O O O H O O O O O O O O O O O O O O O O O O O O O O O H O M O O O O O O V O O O O O O O O O O O O O O O O O O OHUHHXHHHHHUHHHHHHUHHHHHC4P4HHHXNXHHHH I HUNNNHMHNHNHHHHHNNHNN
OTtJ Z in<e < < m 2 < oi* wm z z
q o o > ; wtoo.o<a < u a . o < 2 t >
CO < OX O Oa S3OIUIMo oowww
OOObbuOO OOOO OOOU OOOOOO OO UcNOOOOinO OO OOOOOOOO O OOUUUMUUHHIHHHHUHHHHUHHHUUUUXMHHHXHUUHHHHHHUHUIWWWWWWWW
Qo
W
oooonnoNOOOOOonQOrioooononox x <-> x > x x I-J x I-J x x x > ••> x x x x
Ow
N o o ofMCM<O>HrHCM
r* co co vo r» ^ *HrHCM
CN r CN o vo <j» o o n
CN rH rH in CM CN n
o^^ co i^ o m^ rH cj» cj in in n *» m oo ^ n n ** CN r CN CK CM «*CN rH rH rH n rH n rH VO rH rH rH rH CN
^ n r r* CN CN n r** c\ oo *-*n cj» eo CT» CN CO O n vo vo oo CM
CM rH CN CN VO rH rH rH CN
IO>HOvHHHH(NHHCOHinH<*r vo in o i - invooorH p- in p- tincncNCN
i in rrHin
O0O0«HHHlHUlHrt!vo m CMo tonn !-••»
VOIOCNcNOim
CN inaom00rH
enr-rHCN
r- ino mCN
rH 5T rH CNVO^ CD Oh rH rH<-i CM t\O*D
a*vo
COrHCNH CNrHCN rH CN CN rH rH m CN CN n CN rH r-i r-t PO r-i CO r-i VO H H H r t CN rHrHCNiH CN rH CN CN VO rH rH
XI XI
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2033
uco r-moo\O r-m CMO mX X
88888888888888828888IWWWWWUWWWWWWWUHXWHWW
cnoooO'*CT>oom>o<T>m«»rocor»r-oO'»mo>CM>rHCMCOU?VOCOr~00<0<X>rHrHOaOrHC-mm
TrHmMO
8858888I HXHXHHHH
JEH<c£
8888 88828888830.H Zl
88 8t_ .,H H H H I H « I X W W X
Dr-
UmWX
uDQuBoouoouuuuouuBBuHHUHMUUHHHHHHHHHNHUH
vOcni0O^>oooDcoo»cocooOt-4<-4v0c~^cMminiOrHaooon«o>or~vDrHri<7lHHHHUIl0«C0^«inc>^<n«Ol0C0Nr-Hn>O4U)0MIIOC0H
rHrHOrHCMOCOOOiHOOrHOOCOOrHCMCMCMOrHmOOmCMOOrHrH
XXXXXKXX^XMMXi-l^XXXXXX^XX^XXXXMXX>-> o m < a *
r I S 05K £"« M rH O < £OJH;HM <<map _ H mcoDgprHdjeq qmm
52222(52 Qo.fifiE-i^pHpbnQr-lo.h^SCQ^'bOKa:ooOooO oooboboboocNobooooOocMoOoo
I WWWWWWW I WWWUWWUWWWXWUWWWWWWXWWWU
ucn
OOWWCNUImo
co CMCMO
I XX
um
tJUI HU
OHMUlrlHCM rH rH fl*
m vow r~ rn o m r»n ooiH o CM*j|*i'Oinr^r^ moCM CM *H CM »H n *H CO
m oo CM xnrrr- noi nmCM r H i H m
o C M *» o CMo> c~CMO OHnmHini-4 CM rH i-l rO<H CMi-4CM CM CM rH CM tH CM «H >H
ooooaiomoocNinoi^»m<»oorr)r-mcT>4»noCM U I H H n H H rH rHrHCO rH CM rH rH rH p« rH CM
IHHOrlOMOM/IHHHHHHOH^OOOHO)'
nuiH^in ID rH o voCO CO r- rH
» rH
UHhHHHHHH
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2034 Nucleic Acids Research, Vol. 19, Supplement
u u
BHHHfc
rH H 2 2 2 X
o.B,gooa5yy i i !q !q
cox
888888888888S 88£38B3g888HHHHHHHUHHHHX I UNXUUHHUUUU
o o o
88888UHHNU
H o p » 4 £ 3 t B
88 8888838888828888888888 3888NU I I HUHHHXHHHMHXHHHUHHUHHN I XHHH
*ooncnCNrHlOlO
_ _ noon•-HOOOOCnOOrHOrHOCNinOOtnOOOOOOrHOOOOOOininrHOOrOrHCNrHOOOOrHCMOrHOrHnrHOrHO CNOOOX X X X X Q X X X X X ^ X X ' D ' - 3 X > - > ' ^ > X > ' - > X > i J > > > « « < X X X M ! ^ X X X X X X « ' - j X X « X > X X X r ) X X I ~
. _ _ _ „ .^ _ _ , , _ _i Q o! PC" OiOoo - - " - - • - • ' - - * - - ' -IHHHZII;^^^ J -
ooooobooOoooOinoooo oOOcSooboboooooObootJoOoooOoooOOOOOoUUUUHHHUHHHHHXHHHMIWMWUWUWWWWUtdWUWWUWWUMUMUWWWWWWWUUUW 2OOUwww
00 00 00 00 00 00 00 00 00 00 00 00 00 cn 00 00 cn 00 c* c* co 00 00 00 00 00 00 00 00 00 cn cn 00 00 00 cn 00 00 00 00 00 00 00 00 00 00 co 00 00 cn 00 00 00 00 cn 00 00 00 00101 Chen cntIHrtHH.
o oo r»cNcninr~ inen CNCNOO V I T i f n * O * rnoo t - * in r~ in cn or~ » cs oouinooro into 00 >*) n vo in tn ij) t-» ^ ^ oo in ** in D *o «H tH ^ en r H
m r o »H in r* oo m CN *O »H in wo n o o o n K i H U !
o^ in*3" CN IN <N r*ro
cMCOU<ea «2UCQ
ao i1 f r-CM "» CN ^H
* CO *O
I (NrHCM <N rH • rH . rH tXI XI
sisi xiorH 3 o 3 ocoi u u o u < H cooirtr
t ooo» 010 <J»CN r» o o m n m\org CM CMICOQVO r»vo i n n mvo r - >o>onvD vo in r» <r> in r» •r-\ r-i • CM CM i-H iH fH CM H H H iH • rH *H i-H fH CM f"H
xt Si an 3 4) j : W J3 01 U O £ O O U ^ H U D O O O O O
a c u < o c « ta ct a) u 05 o) j m « a m ) m m m o aOn
m c a
cr> CN UJt» cn r~riH O
CMD « H r t n CM CN rH r-i rH CO CN CM CM
CTKTI
rH Ul
m c~ CM o oo r-«»* CN cn n r cr» O(NrHrH CN
rHo>r-CNr-ooC
cNt>D0O"*nmincncNCN»-H*iCN CN rH
CMOOC^OO
rH n CN CNcn in
rHrHOOrHrHO00*OrHrH.moo
KIOlMVOOHOir* \D r* ao r-\r~ rHcn t- *
m m r-
oo m cn rH00rH rH
VO rHr-oo r~ m
lt-HrHrHOrHCMCnOOrH*rHrHrHf00 en p- CN *r-- r- cn * m
CNCN CN
* O O cnooO m r- o cnn»oc- cno
* o o ^ n r N O o c
r H CM n CM CM CM r H CM <H r H rH CN W n CM CN rH n * rH rH CM rH CN CN rH rH rH rH n CM CN n n rH CN rH rH *
CO
n
Iuuouo
• p
o
-r|
o
3CM•I -I I I I I
a
o0r-tCO
• t
((J(N^-|
>
sC I
CMr-l
>
<55 ^
m aCO «
™ cOrHOCN
o
uo a
cOm
• p mO r H
aM -\ MO! oo1 OrH 1
OQ 05 PC
X I DO X IOCOO
C (-*H (
r
m*torHo
C
• ac
J CD-rtH CN
cni nr->O
1
) xa> 0 (
H
a(0
J24J
O
c
HCOC430*
1 18 1ff^3
Si
w3 3 B
or i*oiHX
om
m
Si
oaCO
JC
ftc0
1 1 1
10 H
couumcqCO^4 *M ,Jc f!4
c0n9Si
M 0MOirH33 a
oaa o« ~ H
id MO O 1
CDF
m
^J rH J^
CM
rH"MM
1 110, g
O i Q
EtD
tcmattit,JC Q^H CM (Q
oaoa3IP
ike
1M
O
• r )
0O
1 W 1 1
O CO Xp/OXI O
noa 0oaoarH
c«
ft09 3
5Qy*
<30
SZo
4J
0)14J 1
oax•M CO
XIoc
mo
O
oU 1
a
14
Oi CD
OJ3O
>•
oa
mot
( V .
, Jmj rW
aXI
mOo
1 CM 1
H01
,-\
CD * f CO•+J 4J J3
oaoatt
1
u8
Xat
•14• )
x:0
ua4JQ§su
X
o£
CO XI X I
coo oSi C C
CrHrH0 0MT>^ tO VO
corno oX X
aOi 1
us
orf
0.OUU\ 4 V<| V<l J-|
o•9 O O
I B- CVH'.H" id ilj id XI X! ~Orri -H Cb CV< O X 0>P m £" U C O> fi Oi Oi «J i
o o rH CMO c n * m t o n n m o o o men o r t n m moo n c ooo r H O r H O O r H O i n t o o m r - m o o o m o <or-cno>HcMjn* OCN o r - o o o
« r H r H r H CM CM CM CN
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2035
Q Q O.O.noo fc ppDHHSES
as M n «e fUHS£OO
w m x n i* 3Sum
§888 8S8 88888UUUHIUXHIHHHHHa>o*t-oo * * » » o o to to r~ oo o rH CM rH CN CN rH o\ rH r- to in o> rH to co o m H r- rH CM ro *» CN to 00 I - in o CM O co CM r- to rH c> CM in co co *»o c o o i n ooCTip)oi'»'»oa>too«CT>'»cotor»co ~" " ~ ~ . - . _ - _ - -
O X X u u 9C 3C Q 3C V V 35 ^•<3'9OKQi<'MwyZrGXPtXrCP4P><tfr1|9thQQu
88888 88 888885 S888888§888888888888838888IHHHHHIHHIHHHHHXIXHHHHHHXHHHHMHHHHHHHHXHHHHI
riOOri rHCM rHin rHrH rHOCOin inOOrHOCM O O O O H O l
xxbx i xxxxxxxxxxx'->>xx:x i rj« ' —M o n o s o o
lUIHHHOOINOOOO
O n o n i o iOOOOOnNNl'IriOHOOlllx']>ixxxxxxxxribx
x x o i
88^H-8
QQo to to " _
;o-> to to o.k2 ZCTKSO toco p p f< fn q j « « pdtotp d!ijcnoio taw p p f< f< q j «« pmoip 3<2pj io«2p 0.0. Sutdpd zcoxm^O oiqDDDIK-fflHHzS HH >4u MMturnr-o qqqx ox XK I-?IJ HMHD KMQICOK u< <o to tor-
iOO OCM pin 0 0 U 0" tjin in O O O O O UO O O O u GcMino O O 5 O O O 0 O O 8 u O G O 5 O O O 3 8 O O O O OinHUUUIUXUXHHHHHXXHUUUUIHUIHHHHHXXHHHHHHHHHHHHHHHHHHHHHHHHHUX
n i r» m m «* o in CM tonco CM
i o o o ai-t ro <N n o <H r- in n n CN r- <-< to r- cs M CN <t m CM m H oi o n in ft n <*) n in a* in
t U I H U ) ( N t-« r* in ^ w co CM co into t-i toM H
tD tO in tD rH tO r~ rHrH tOCM H H 1 N H H CMrH
O in CMrH rHtOM M CM CMCM
OCMCM
C M C M t O C M f O t O r - r - P i r HCNrHrHCMrHCMrHrHrHf-HCM
^f to to in r* chCM CMrH rHrH
r-r-rHtoin^rHr-inr-mrHrH MCMrH rH rHCM
r - i n tO rHrHrH CM CM
m om coo mm io a an si a
; OQ a ap u u m m u o; m C N II X CO ' "
oi m <o i IXHUUH «i
: o ( p m t o H t n ;> x o <r> >-> h. •->!
,S2! ozX 0
S en o ox:X <a o<soc
ootooonoinor^otcMnoor-o^ocnocMOOOO^toooooinr-oocntor^orHOOincMinoooooocooinocMOOOCM iH mooo OflOOOcNf- in I-I «OOCT> CMOOoocn -HvcNCMtDoo •* m r- *r in cn to o *f oin CM to CN CM CM OA oo CM CM cj* r* <-\ c n n ^ o to c*> ^ o> r* CM n r* in rH ro m r** r r*> »o r oo en coCM CN n rH iH rH t~ CM <*) CM rH CM rH CM r-\ <H U1 CO « 00 tO CM rH O
in o CMtoo>inoo\ GO in r ^f to CM
CM rH ro
ooo^oooiMHjiflOMjutCM CN O> OO CM CM O> r n oo CM O CO
M rH rH r~ rH rH CM
o O O M O O•"f^joo « m « o O f l O to CM in rH «i CM CM to C M D * n N H tor-co co «o to CM o to c o ^ CMCO <T* r ^ p j n r-*m n r - cocor*ChrH^
CM rH CM rH CM rH rH in PI *T 00 ^ CM rH't
ooo^finoocnr r totooo
ooeft
MilOOHtin m•-< CM
HtOrHrHrHrHOtOrHrHrHCMrHrrH r- ooPI CO
00 torHCO
t oi nrH
r-CMCMCMOHro r-P) intOCM^CM rH
cocotDCMrH
OUOOHHtfHHOHHHin *t toin P I r^ in I-H
rHrHrHrHrHinrHr~rrH CDr~ oo
H i no>OrH
O O H
rHCNrH PI
JQ
XQO-r(
Crl0%4•H «4JOMX1
O<HC
I -I I
DlrH
O
a±>
MI M
oo d
id
oo
CMuato
~» *>
n c cn D 03
i S ruoa ~C rH
CMCM-^c ij wi o.• HI< a •<<<<<22^24J4J-P-P
O"O0 - 1 IJ UrH 0) O 0
arH D OO 3 33 O OC O O
CCMCO I
xiaOBc u
•* uo
r- oCMtorHo
H
urnu ao-iox;
co
a
oaa
a
inI I I I I
CN
2o. u
<54J O
o
ao _o u
•y o Scue o i i i
o> o u ; JC MH t»
4 0CQ « H 36C u
O O OtJ <s a
ti) u•P
C C U Oa a 4J H E<< o aui
Q G T3<<«QU -OO CS zoomxco" • • -
K ,Q -*4« C C Hn««j«j4J.p«x:x:tT>D<e
fOCN•J O
0 0.
4g)
30<
-r(
3 arH-Hox;c
o-r< >
e
iXIp i
c-mto
mH c aO-rl
xio S> t O M CM 01
S tO-r| O> "»
H > aoo~i 3 oQfl'O O
00 C
2
un o
aoa
I I I 4
OVI
«o °(D- rH OVrl-rt <W+1 S XI TJ "O rH B O >t >t-^ - . ,
S o o c a-p U « « « O M tncnc CBCPSX;
hu t! & •
CM I XI I
O 00
a
8o
a
04J
O t»LO l t I C
^
U)
U
a 0> M l7> cn x xj M o o.x* xi xi xl zLt r i V4 O ^ Q U M O 0 Q D Q K3 PUD 3 fl* ««S<UXIXIXI4J
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2036 Nucleic Acids Research, Vol. 19, Supplement
mmx S U> 3 CO d. 4.
DO] Mjunbba AOIUXO: WUUUO<I33H<NDZ
888888 88888888888881,88HUWWWH I HHHHHHWMMMMWWXWM
CDHzoDO;
8 888Cd I M H HI I H I H M H H M H I H H H H H H H H H H H N H X U H I I I N I H H H H I H M U H O I
«O P-Oinvo mm oncM.o i n n c o VIP-UJr ) 0 O « « 14 CM CMO n in CM o n m
JJuuub vooaooo< »42 P* K P* O (n^^lAliiO O O O O O fHUJU>p-P-O O O O O O infHfHfHfHHHHNHH I XXXXX I Ir-i ^ oo co p- oo m <n CM o *o u> p-com cMCMino I O I C O O O O CMm CM en en en p-** en en *» in m CMH O O O OCMOiHVOVO P- I- CMOOOOOOHIOHHHH in
in inX
_ p. _O on 05« 14P-P,
uiOooOorjOouuuOrJouOOOuouucNUODouuOUcNiDO uuuu omuXHHHHHHHHHUHHHHHHHHUHHHXNNHUMUHHXHHIHHHUUXH
XXXXUMX I j>>>X^3XXXXXX I X
O.P.P-CMCM
JOOOUOOOUOOU inHUUUHUHHHHHH I X
xxm
OOCMCMVOCMfHCMentHOCMCMCMfHCOOeneOfHeOinCMCOinCOP-OOoo
CMfHtHfH -fHXI
m 300 CQ acQ
3 o u u u u o x : x x:x: ox: « x;x: w ox: o0,(d<a«f>iitif4uuttuu<auc«ouci;i<<domi(tefflffloaamoamai<caaacaffl<a)<zmm>m
como*»COCM
oiin n r - CM
*f oo fH vn eft o> <* enin oo t fH fH m fH p-
fH fH in m CM rn in
m vo entHooofHCMCM
M fH p* fH p» p« CM ID en p- CM on en CMen \ovoo ' ten p-oo f H o o m m p-
.HfHfH rH m en fH fH fH fH fH
Oen o
inen
ooOfHminvo p-p-en
fH *»
O ^ o ^ c jcM\DcMinp-enfH f>) fH
ccMCMOen^oocM oo*»CM in oo o» vo f oo cMfHCMCOfH fH p- CN
aooo
lfHfHfHfHfHCMfHfHcnin^ffHfHOP"C0enO0nCJ0HfHfHOfHTen o enoof »o oo CM o "» m *»m CM iHoo m CM tH"»<*en CMen fH U> CM * *
in min CM
oo en00 CMfH en
a>i nCM
OCMen
i H en H H CM xo CM fp- rnp-fHen H H O
cn*»in
iH fH fH O fH f
in oo on oo en oCMP-
1TJ
fH U) * »* iH CM fH CMtHtHCMnifH fH P- CMCM
• w14 fHfnm ^
<M in" M OHrl ^|*O **»—1
CM C
an> oXI » P-
^ eD iop-on BrifHCM 3 < M
<M VI V4VI *4 0
*H 0a) »O> »>X C»<H-OHZ Oitd fl2 •
18D in
O H O>-.«) fH•J >
» CM
2; B4J"«
Vi VlVI Vl
o uM M
a a•U-U
3X
aXI
a a1 ftftOH 1 1 1 1
ID
OOfSrtCuSxo a AVi Vi Vi
s0oK1iij
&
0•—I
o
o
t 1
3s
noaj -X)
o*oH
a1 1 8
MM
a.
en OQ oa
XI 0QUPO
c 5•H O
co
f HOh j
1
u0O
cVl 1
>i>iviau
1 1
0)
a6*
4 O 1
oaoa
1c
cVlMO
•MIS 14
14BB "Mao• •oJ C4J aOXI rH
u aO **4
-H
oa<ejJ'TJ+J'M
XI XI XIu u uc c c
aa-H -H -H<2 m m HDC on on enO **o *D *tHfHCMfH C O
O 3 O ftf-j-Hrj OX
h U XIC Dt OU O < OCOtnO*MH2HD O O
(0 S 4 ^ M O*tr>C4J CfH 3 uo n «J » »H . JJ<m«l(Hi< O M-HcztztsaeS HSM4J vi i
mom amrn o \o OCM iS fH C
Ql J^VI B VI
U O O O B < Q r <« H C <M a O B XI
4J
a
CO
-Hi-l
ou*H 1
CQCQ
3C O•H Co
on Oen Vi
o a
*JMHa « g•HUEa a
viS SM 4 JO >H
1 | tjtj|(O |
— CM <
S vi 2? *^ B
XI XI XI(IUUc c c
fHCMCM00 P-P-m *o **0fH tHfH
o o of"3 l-j l-j
a
s•HVI
I m
•
aoaz<OJJ
paoa
Jo
i i
Ol
H4J 1
X
ID•C
1 1 Ul 1
sN8COP.
Ch4O
-enaXIo
+
•i non•M
aCOM
£ Vi<M O
I
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2037
CO M M
OXKey
SBHIIH
;MP3Z D9PCKKSUMMMOT
UOlQCLDifiiU(O2C by D HMO *)MOOo.o;rHBL,HaiSi-o.piai-!xf«uuu-; xo! u u h u n n l y 2 3 PU S 2WS a. a. O a! oi ai«rt s
_--J8£C;88CM88888888888888 88 8888888888 8 8 888 88888888 8UUMUHXXHHXHHHHUHHHHHUHHH I HH I HHHHHHHHHH I H I H I HHH I HHHHHHHH I I H
XHOXO3H
OCMtcMCMrHcMrH^^CMCMiovDncMflOrH^OOCOCMVOvOrHOOCMCNWrH
O O U O f t , g h e 2 & « 0 .CJUOOOCNCMOUCNUUCJOOUUUUHXXHHXNHUUHHHHHHUHHHXHHO One* Cnp-
nnhiaunzu«MXP<znzHpfi<iiOKK KQMHSHUUUUUUUOUUuOiilUUUUObUUUUU UOulOUOHHHHHHHHHHHHXHHHHHHHHHHHIHHXHHH
r- o vo vo cn n n oo <n t n t P» ui m n n >o n tj\ r» n 1-4 n CT> O >O n en CM PItN n o\ CN CN in in \o cnin co n n 00 o\ r~ co 00 r» *» CM n CN O r~ CN H m r- \o CNHH^MtmininiOMOOKjHiouiuiuio «» m m o m >o vo fr-t •-( r~
u iu iu inn inu io«»« r»<Dcooc> CM CO inn cMtH
F-lrHrH
p-r-p-p-nntrHcncnooocninineo
vV u i £ u o UJCJC U (Oc a u u m m « u u «K<. 1 co 3 :oo u u(BDDDQcococooatocn
CMCMW cMrHcMOOno co n rH vo oo n n w m rH cn <n cn CM n n rHP- P - m rH On P-rH rH Cn VO CO P- U1 rH VO VO VO O» m OOP-rHCMPIO
rHCMrH rH rH CM •-< rHrH • rH rH CM rH rH CM rH CMXI XI
COCO OJ3 OC0 3 O UJ3 03 O 3 CO O CO H H i l CD m W
MOlOOnOHtMOHOHa\ m m n ^ m o r^cri oo m toocn n
m CM rHCMCMCO VOp- no
VO
Or~ncM
von incN CMcMcno m•-t oovo
f CM PI CD o P i t cn vocomoovo in CM t i n ncM o OOCMCMCAI H I H I H rH CNn CM tH CM CM 00
O00 HO>cMcn ncM
vo
CM en o vo *T oo rH CM oo o •-* n CM «H O O O •-* o t o *-tenco M O i o n c u i ovocnr^cMvovo c n t m nt o covorHmtoo cn in <n >H-H cr> •-• M ^ H M
CM rlHHriCNPI OHUlHtOnM HPI
nincMcMtnorH 00
on cn voco inn mrH OCOCMCMCn OCM CMrHCMCMn m
ooo HCMCP n
vooocn vo vo r-r- rHo mm in
4f-HOVOf-HCMrHVD«-Hfm vo n o vo•H 00 OOrHCMrH CM w-tr-lm
oop.0 0 CMCM
conoooo
cnOCM
voVO
vocnp-ooo m n nnrnnntoco rH CM coot~i vo p* cn n t ft n r^ n
rHrH
IOOHHHH
pa ca pa *J ao o
! ^a o o O«H4Jvoo o o o c a>, • • • o t oOHHHOrH4J I B I
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2038 Nucleic Acids Research, Vol. 19, Supplement
m<UOOUJQC1Q g
OCM 0 0uzuu
Pun H OO(on<<<uwin mm rax
888888 8888§8 88HHHHHHI IHHHHHMIHHI
10 OU CM H CM rH 03 U— - — H a. zm >-> HCM^CH* rio H co
CMCDOCD oo o mm ct <-t VD rH o <*> t» r- r> oo
lOcMrHin UWCMOO r»r»r~CM r»oNnHOoamrtinnnoiOinO O o o m n n i H O
WmOJMOlHLiHPPpUKOOrjrlrilCrln
jsb§is5sSsSi8sI HHHHHHMXHHHHHH
O»CMHOooK X I
O
OI H I
ooarHo88888888888i888888888888§§8 88HHXXHHHHHHHNUUHH>4HUMHHHUUUUHHNHU I W W
D N 0 I OrHrH M T I H O O r- MOO CM r-en r» rn n r»r» m« in«> nHnn^oiou i CMC- mmcMn o v n o o n H n n mHoooooocMtMiHcMOr* O O O H H H UICM CMCD*»oootrinnnnnooooooooooHOOOriooooooHnnnH
3 r lHNM rH
xmxmDtdCM
a,CO «C Om a ^ i m
OD~0oS5gooSB3655c5885SieSoSoSS8BSoogHHXHHHHHHHHHHHHHHHHUHHHUHHUUHHHHHUEd I
>n r-o> CM co CM on n moo vo ini-CM'ycM tHCM wc~nooco«3 inincMO> CM>n H«H rooom » < » H n » i o ^ n m n inmr»cM^ n ** CM
in *n in
O c n CMCMCM On *
rH PI 00
I D C M C M O CM>O cMn r«(M i n n O V D i H * cMincMO inCM CM vo n i -co r - I H n en co en vo en CM in •» * en * * *r vo co CM m >o CM CT> C I * H C
CM VO rH CMrH rH
c*cor ^ CMin in
OOHHOOOHrlOOOHHHHHHHHHHOMlOOOHHHHHivomvocM o oncM r-ooooo^r oo con ^*-i CO rH i n CM
in *»
C M ' *
m
en
O Oooao
oCM
CMSOrH
ni n
CMIOOO»*ooe»)CM
rHCMin
CM n m moo en ID fiCM CM 00 PI
CMflCMCMrnrn
cn
m
CMCM r H P I r H CMP) rH CM CM in m
aOuo
oa{?
- 4
a)
UO
o
4J
mommCM
UHu -
si
Iswm ok.S«4 C4J(C « ]C-4 I D>B
?5
O
0
I•H CM
I
VDOoOP.a
co
CQrH
nmcam
C^i
CQ CO0 3 OQ
12c c
a ximoo3 C C
xiximxioo mo oximo
a c
XIXI mo o m
i n I4J I I I CV4
O«BPl C*» CrH
mo hO q p rH 14 CUn B i
I I I
(Qma
Ua iK I
. -B I
>XICO
a -
m c
8*
CM i n o inn oo
VD VDCMvOVOrHrHrtrHrHO O " O O
r51? QI rH<MI 0 4d UH a 14
oo an C->oo co artrHrHOO
o)rt 0.
ox:o
<r — .oorH X <P<0«H 8 fflrHH i p , TJOX c £7 o I-J
o -cm* ao u*j ao B-H 0 B »cc aBrf-• OrTHfVO
MO»4OHO O H O PII-3 «C
Qi3i7<
ggoHa p<an aw [7 {7
< rtCO*-!
rt asCM<H 1W4J
3 ~ ~J
CMCMrH rH
w w
to -rfmQ3cM-M2 144J U
'I" - -p< HH17 »Hr4 01 0>
a
• O i l
rHrHB a
aXI
r-co3t~rH-O a
3 ~«4JOrHson«HO
%t<u au u tsO O O I
ia ts IB
o o <M- I -HU V<• C H H O
Q
it)6i3i)D4JC4JOiUDiaoiaXj
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2039
X X*» o. u<<< < x x x w co< < < H cq p.r-mo: mm gicuo, E-I2QPM<< m o SS9x m<m S o nv><i < 3 5
J8888 §§si888§§i§i§i8i§§88888888 §i88i§88S8ili§8888888888IUUUUUIHUXHHHHHUHUHPaHUXMHNHHHHHNHINNHHHHHHXHHHMHHHHNNHHHMNI
o r-i o CM r» *» O o r i n o « >
ro co vo rH co H O n
NT CMrHMOd! <MM r~m< fflMX 0. 0* fn « Q P.* O X CO. X CM O"
h Q
HUUHUH I WWW
pcMOUOOOU
r-ir-ir-ir-ir~ir-ir-ir-i
co r~ IH r- ouiHHO
-ICt O) <N r-1 i-l CN CM
or: u] o-cc
rHCM -CMXI
m ox: 3 x: u o oa o o <D a)«) m < o
d mmid S u£ x « co: o <a pn oca o o n eSmn3
(N OCMn m votco IHCM
co r*ao r*CMi-ICN CMCO
o> vo co i-i i nmm or - co
CM r-i mr i r -O CM 00
rHCMOO)
\o in I- WtN O f- VOVOCOOr-lHO r~cocM>ovD
•H CM'H'H'HCM
oo^r r-ooo VDin i t I-I m <H CM oo
COCMO >oin r~o I O O inco coCM r-i CM CM
cor~inococnvooCOi-Ht-OCMCMi-m1
CO«t tHCM Cft CMiH
CO VDCMCOrHr-i aocn>ncM
CM CO O 00 CM m r» *t >H inininHOnoHCM rH rH CM CM r-l CM
1-Hr-rHCft^rHrHrHfin CMo nrHCM
00CMr^
vocftcocor- TCM t m
nrHCM o n CMCO t 0*t rH rH CMCM i n ID COO CM
rH rH
CM00
moi n
o
nrH
IHnriHOHHHHfflOinOO CM lO I -
o « vo in coCM CM CM
rH rH .-I rH CM iH t-l H CM CM CM iH CO CM CM CM rH rH CM CMr O i o f o o o o o n
CO *» rH CM O> CM H rH rH rH rH CM rH rH rH rH CM
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2040 Nucleic Acids Research, Vol. 19, Supplement
PICM
zzzzz
8888888H88888888888888888888§88888888888888888 88888 28 888888
. _ . . . . ..._IHOOOOOOHHOCMOOOOHOP|f— — - _ .X<r>XXXXXXt<*?XXt<XXKXXX*iXX>X>>'?>X>XXXX»lX^*iXXX I XXXX>-> I
z mW N ts) W
w w o.< omcomomooxSbuDbbOOoDorjbootjuDcJOoODSoDDbbcJabDDQUObDOD DoOOD OououoouuuiNHHHHHHHHHHHHHHHHHHHHHHXHMHHHHGHHHHHHHHHHIUHHHHIHHHHHHHHHUU
CM PI H
«JC O V j S O U O U 0 O$m_m^<^%xaampmxmmn^$c^xmxmmxmx^miiamm^n^'t<'iimmm5iDXXX'im^^oxoxtiri
H«M tom u i ^ o o < r H < * O M n PIOOV o H o ** o OT» r-o oo x> o PI m PI Pir-o>o **oo>o vo o in oo oo >o in C * P I P I O PIO> *» t CMH* *m PI VO H tOPICM H PIHCMCM CMH H N H r l * CM VO
1 H H H H O O CM yO O 1/1 H H CJ* H f—I H fH H O H H PI H H O H H H O f** H H PI H H H O in O H ^ O H H Cf* H H Ot 00U1H00 fXOPlOCM 0"> H O H O O 00 00 Ul CT1U1 H O CM CMr» P I vo C M O O i - P i i n o» m m CM C M P I ^ * CM m CJ> in in HP I H C M i n P I P I H ^ t H P I ^* H ^ H P I
CMO mcM<OOOCMCMCO00CMU) H O H
PI r-00 o> o PI PI
mCM HPIHP IH HU1PIP1 H«»H t Ul H CM
xi vi xt XIXI xi ximm xixt xio v i u o o o o m m o o o
81
PI K a c m mi£> H PI o C 0 ^uiomO 0 i *S C O t H O i 9 l in fH O +J • Cft O* H QTt
4 X) >0M> **>U 3 fHH C Hffl g, O O «)O& C T-> •">0 3 O <t
tP O 9S O Wm VJ c < H 8 H M
O - I « +J «•H*J "O C C C C
M e m C H - H I ? « <9 > 0 '" s« '" H p . . _ H O S H _ 2 * 1 " °
O • O. C << 0 . Z H Z OOfH > -o >-i *J p -p B •« A f O t d j p , o i i j j a u o <p o a aj<Cffl<o>
I I K U I I I i l l A l Oil O i K I M t j i i £ « i l MSB I I I I I 4 3 I I I U I I I H O I - H g t O tf - • Pi V4u> sj M iam«a a m to to - m<a ooi ismu
>J -y t o o PIH . _ _ _ . . oi . ovo PI-HQQ O a O • -I-I H CM tu < OO fjv O O >MXIH CMCOf fit < O Li O Ot UCXViK a ft a fH -C-C MM M O M-^ OB • CO M » (n » C >0U «) O H H • M H « M • VI 0 Vl OV- v pz - z uu? an -rt - MXXOX: - » a x xx m * > vi:
"iNueammafa<2i<ifCi<3HDa >m<3uusuunaoaax >•<'<•< <si<ei1 -Qi<0|mw«« js9<i<si (O O H S , > > H I ( gVd AV33HO oo-pogniCBjJHvS^^^^MiTJ^koal^t?^ vi ;rjC B 4J u *> Ori B.B a * . - .K 1_
• f<HfH>,£,eceec IXCXX-H CHijimHtiMOHHHHHeiiuiixBSs^iOHdHfflj j -oa* j i i i tHitcmD^^Wb>MT3lJ4JWXia04aiJ'3&VlMVlO>VlVlJ<^^^^^VlW4J4JM4J£OTJOpauaiBB3^ ftVlVl«Vi-HLn
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2041
aCM p i SB SB 00 Z !
OUcnO (
l < n < K K O U(;hMWX3fi«Ha,
)O UQUO XOXi22uu23°o8HuoSSyS|SSS2£^*5m2«m«gwa;«(>iaiiio;S,2. b=S.««nSSoiigrt:<MMUufi,xsiOOODOOOOOOuOOOuOOOOOOOOOOOOOOOOOOOOOOOOD* OOOOOOOOOOOOO^OOOOOOOOOHOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO* SOOOMUSOOUOUUHOOOOUHHUHXHHHHKHHHUHNHHHUNNHHHHHHHHHHHIilHNHUh I HHHNZHNHHHHHHKNHNH
O r H o i n v o < o i i v D C M O i n c o i n . . c M O H H _ o o H o v o r o o o o H > c M r B N o o o c M » M oOOOOOHOrlOOrtHOnOHOOOONOONPIriOOrtnuiPIOOOOOHHIINOOOONOOOHHONnHrlHOO>^>xxxxscxxxxxJExx«>>;>xx^jdJGX>^xxxa:>>>^xa:acuxxM>^5E^«xxxxjcxxa:x^x
32O
O« x < co m <
C-.^HtflOi'-jJJiJC-,-u ooco co to m < A rH w m m M
<<U PPPD!____JPSr~Wff!pSniC4Ci<0- &a ~
O OOOOOOOOOOOOOUOUOUOOUtJOooDoOtnOOOO OOO (BE-U I UUUMUMUUCdHUHHUUHUHHUHHUUUHUfdXUUUH I H N H t l l l H H H H ^ H H H H H H H H H H N U H
OB Cin rH r~ inco r~vovDrHt~rncMrHrHeDvocMvocNrHOror-inor~ocMinmcN(nrocNroiHto<_)cNinor.oorooa>rHincMOOvD^rHincMCMa*CT* CM »H r- r~ tn^nc inHcotDnnooooor^^ f f io r j^o tHf - in i / ) toiohvoor* r~ w CN »H (N fH (*i m ao CT» *O r* *o O H O > OrH rH CN " * t VO »»CM * * f ("1 >X> T» 00 <H iH iH ^» i n U> >H H If l CM O Ul O< CM rH CO«»
Hrt H CM <H iH
1-4 o ^ crt r-* \o t• CMrHiH CM
o o o »in in in in I-H r c» t-* v© ro *o n ^f co r~ iCM CM CM CM
x; o 3u m o « g act 1aa BJ5 S X c 3 :
to JTXIXIX: 0 0 a- i o ©CBCDOWO
CM r-l CM CM
am u x:
* H r* r- 03 vo fH vo *-irH i H CM CM CM
o o ox: x:mcouuK
CMCMO00CMIMHHH«
O00
n
OO0 0 o CNi-ICN in CNrocN >n o
ao in oO» i-t 00
O
>*(NHHHnO Mnr^<»<fMHeoHriHini(iinuntHiM»ii<)M CMrH rH CN W CM PI CN iH rH r~ CN iH rH CN
D r- m in o CM eft *0 co r* m oin non4r4Hi£NOin o in
VOCM
CMC)coco
CM
CM n um i n
^-\
O Nin rH{ * )
vorH
i_r- cof-HCM i nt-\ rH
COCN
OHHOHHiVOVO
IHOOOHIO r o ro o «H in coro r-t m r- ff\ ro in r*CO f-H CN «H ^ ^ in
I-H co T-t r-< 00 vo vo r-vooinonaiaiOH CMOOO•H CM o o H m in r-ocn
HHriH CMCNCM
<-< n o * * o cnff>oommm >o coin CT> >» eo vo r~ cn vo en r- rH mmm mincod) cnocMm rH rH
CM rH CM rH f rH CM CM
OO rH >,>,
LOJJ 4JlO-rl-r|rH 3 3
o c c
oca im c Ovo& HJ5rH
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
2042 Nucleic Acids Research, Vol. 19, Supplement
tA JE W fc« 0* Pi O I
8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 2 8 8 8 8 0 8 8 8 8 : . __^WUUWWWWWWWWWWWWXUMWWWWMWWWIWWWWWWWHI
M Ma wtu «2CQ QO.
8
oo VOCN >oio vo«» i-ito.-io r- 1 mm u i n n r - r-*coen otoatcMOcoaoontotocNtoon
8 8 88 88888888882888I M I W I H H I H H H H N H H H H H H H H H
atrocMOto*m^cNc~oOiHcncncocor»t»)i-ict<oom
_ JUUUUUUUUUUOUUUOmUUOUUUUUUHHMHMHUUUUUHHHHUHXHHUHHHHHH
j in O o o r - r» 5f cn O co o to m oo m iH CM CN CM 00co co oo oo oo oo oo oo oo oo oo oo oo oo oo cn oo cn at at oo oo oo oo at oo cn oo oo oo oo oo oo oo oo oo
t cn en at at at at cn at cn cn cII-IIHIHI-II-II-II-II-II-II-II
iHHHHnno^p-ototr-noot^WHHto
(,n^yh.Q.ft,p<fti<S3w>x«E'Dt.tiUmOmOUOUUUCjOOOUOUCOOOOmHXHXHHHHHHHHHHHHHHHHHXo o o o n f o o o00 01 00 @l 00 03 V 00 GO 00 00 00 00 00 00 00 00 00 00 03 00 Ot
** m r- o <T> at »-* 00 n CTt fo oo »H f*> oo oo en m n oj c» m at rH CN *H CM r-* ro fo oo CN en *D nV O H O M ^ H tooooo HIOH
^ CN CN CN CN CN iH H CN
oOQx
CNCN--ICN CN-H CN <H
x:x: x •-• x: u to JC ui cauumUHUo :«u< im ca x m o co ca z OQ z oh i b ' O ' 3 o H o <
CN rH i-l CN rH fH
.c u u o JS o om cn oo in < m m CQ
CN CN »n M
o w wO « «<co<
r-*-H CN m c*> or-oo o ^ o i
• i-l rH CN r-( O)
o mocoueoS o n z z' a x ' c p
omO
m oo at to r-<H r i i-lrH
or-cNOtf)CM
c~ a tm\o
o m<N rn t- torn H O cNOtooo*O ao m H O I*> CM at at CM *fr-l i-l i-IPI O CM CM CN m
m CN CN CN r~ m at H oo ooro CM r-^ at f*> H to to m
CM i-l (NHHrt
HODHatn^moocMOiH i-l r ) CO CM
tocn
IO cn•-I at
CM
mtooooocoioo 03 CN
CN f ) O
CNf-tO<HrIPl
CMatCN
CNiHriiHrHtO iH iH CO CM H CM iH iH «H i-l CO iH
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Nucleic Acids Research, Vol. 19, Supplement 2043
O kSXCOSOQ«QQ «rfxo6oi
.diginZHUHHi
i nOrt-ioiO
UlrHrHrH rHCNiO«Ka.t-lOW(-,ZZ2222U
ooooooCD IN mxtSSSSSS( N C - I O C N S E C - I C - J C - I I - I O H P H
Z
\J M W CO Z H w U U U U U U W K W DC JM W H W W W W H tf rf CU CN tn w fC t-H *^ M M
888§888§888888§88888888888§ 88§§88888888 S 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8HUHHHHC4HHP4HHHHUHUHNHHHHHHHHMM I HHXHHHHHUHH I
H0lNl/inOrtHN(N(MC»OOOnOl-OO<MM7lffl0>-l«'-PHn-l*1P1
ux>x>r.xxxxxxxxx>xxxxxxxxxxxxxxaSxxxxxxxxx
I HHHHHHHHHHHHHIilHHUXUH
i c 0
oo v_n» rH o CN H rH PI o P-PI rH rH a> *»• I M I O H HooooooooooooriooriooX>>XTkZXMX>ikZXXx
rHrHrH OCM-IOlOI<4<PQ(O -> rH rH rH rH CN |. o u U < a a «PSX to H £HH E_ (_ B o.
ocQQoirf x o o o O o o a _-_-_-£I J U U I O U Z H U U H U U U U
X O iKXfn H
K t X
z > P < n } JO OOOOOO*a.CNOi-ltK XX XX XX O
l 2 C l 0 C a 0 0
HO0< CN O
IHO-I s>. Z
OUUUUUUUUOUUUU UUUUUUUUUUUUUUUUrtUUUU
UHHUNHHNHHHHNNIHHHHHNHHUHUUHUHHXHHHHHHNHH
Q H X W « i M M , . - - , , _ ^
OCNOWOOMWOOOCOOUOOUOOOI EdXHMHHHrHHHHrHHUUUUUUU
u ) u ) h rH a* **•P-to vo\ovo 10»o *-*PI »oCN incN
<»-* t-l f-* tH tH rH
CK CJl W "H "H <J\ CN <O U> U1 ' t 00 ' t U1 U> CT> CO tH 1/1 »f t -^ O U 1 H r > fH 00 *H Ot CN rH rH l i l
01 CM vo r* u i n h
oo CN »-4 KO GO *-f ro <y» irt n i/i f-« CM in cn CN oo «H oo o oo m i/> m in in in vo CN oo f-< i/i O <NOl^MOOlOlOO CM HHrlO) OU>OHHHHHHHHt~HUIOr>0CN >H rH rH iH CO CN <N •-< CN rH CN rH CN rH rH
CNCNCNCNCNCN
o o m m cn oUfficnxx DO
men ix^ x
n x: ox: o j s o o w s o o o o o o D< < 3 CB< coca oa< 3 < 2 < < 2 S o goo mm z c mm mm men c
OZI
n: cc.ee.
< < 3ZZZ
o o 0505« 05 woe a: « cso < o>coca oa< 3 < 2 < < 2 S o goo mm z c mm mm menII'ZZiZZZZZIZIIIOBbfiiii
oamw
m r a\ *-iHOtHnOHHHnn
rH CN rHO r H O
<w 2 « t»< 2 2 2 xZhlOOZZZZb
ooooooco»»m»»oooooinr r iOrHin oomcNco minm rcCN MD*O cn^cn ooo^rO rHCNrHCN rH rH U1
in cNcno n CNCN
aomcNOOrHaincocoHOoo4e(DcMOinonin040Oinu>«r«>otcMnronnaHaiuii/iOHMHnoocN9ain mcN r - t o in o UJ 00 CN rH m CN r» orort co mat o i o n vooom M P I H I D H U I IOIXJ n m CN o> >o CN '»'» oof oo> (DOMOOro » 0 0 m vo aooo IHCNCNCN «^r into r~vo oorH 01 oono*nMnN<HHonfnnHin«o rHrH
f-i *t f*l rH rH rH rH rH rH rH frj rH (N rH rH rH rH t-i w-i r-t r*
ilTlCQr-tr-tr-ir-ir-t
vo m inCN CN in
tHHHHHHHHHHHHHHHHHHHi IHHHrlrtrtHHHHi
4-1rlHrlC^OHrm inrH VO
CN CN CN CN U) rH CN rHO OS >O "* PI P-rn CN't rH rH CN CO CM rH PI »» O1*1 rH CN f-ir-ir-tr-i r-t t-t *-i *-*
per
•do.
c•A
inP ICN
P IX
uzurf i
oB
CNP IVOi-H
O
omVIoa
oB
COrHP-r H
o
1
mm
new
xxaao . oE B B
*^ f-i r-\•HOOC*l ^D ^DOrHrHO O O
1
Ima
•O
O
I IB rH~ I O I
i_ 0
o m •cd m JJ4J 0P10 S VlrHc 5 aw+> B >_B HO.>t O i0> -rlrH
Vl U
% S -S OCN•P CN1 amrH * l (7 l>i ocNincno « (_•+J VO rH rH rH rH CN 8 1_ l o a s o o o 4J vi
3X1 -r)0 1 < . • _ • - _
a-a'«xixixtxix.xfdd>cr>c « o_c i
0.
m
OrH
•
D)rl
CNP -
cr. W« H
(DOW0 OrHa «_« 4 >•a «-rt£41 0.am
&s0 ftB 0-H Vl
^
ft
Ml_l
p. q]
VI
ote
60Vl
ao0
o4J
oB0o>0aa5VI
& .
B0
VoB -r.
1 1 1 1
_D
O0
fta0-r|
1 Xt 1
B
a0
PC 3 d-5 ffl E
B
X0
o
3i0ao
xtUB
vOCN
• • < -
OOO•rlrHa x
> i OX
VI•0 0
r01 *>
-HX0TJau
3 rf-rl
oB
uB eg3 O
1 o<8 oo u
ao
>,a aoU
B3
ou
0oB
oo
oCMOP-PlrH
Om0
r H
o• r |
_
aJ JO>BO
rH 010- . >,a ao oo oB B
O O OrH O Om
_fA
ftB0
r H
1I
Vl
us
aO U >
OOincorfrHrH CNPTtinrHP) rHrH SB
0B0Cn
0
0>
Bin
i O
rHPI
2 23t
3 3 0
0B
-HVl•O10B0
sel
51 D
S
K^ i
O r Hj ' a f ^ a o a m o o n - i a K O - e . B Vi « wa
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018
Downloaded from https://academic.oup.com/nar/article-abstract/19/suppl/2023/1058067by gueston 15 February 2018