table s1. list of 2110 prokaryotic genomes used in this ...oksana/phd_thesis/supplementary... ·...

73
Table S1. List of 2110 prokaryotic genomes used in this study. Each genome contains information about genome size, number of genes (NCBI-genes and Prodigal-genes), and fraction of genes without hits to any database, containing Hidden Markov Models. Table is sorted by Superkingdom and Organism name. Bioproject ID Superkingdom Organism name Genome size (bp) Genes retrieved from Genbank files Genes predicted by Prodigal Fraction of genes without any hits to any HMM database PRJNA66875 Archaea Acidianus hospitalis W1 2137654 2329 2484 26.03 PRJNA51395 Archaea Acidilobus saccharovorans 345-15 1496453 1499 1503 18.72 PRJNA43333 Archaea Aciduliprofundum boonei T469 1486778 1544 1551 16.38 PRJNA57757 Archaea Aeropyrum pernix K1 1669696 1700 1720 21.67 PRJNA57717 Archaea Archaeoglobus fulgidus DSM 4304 2178400 2407 2518 16.28 PRJNA43493 Archaea Archaeoglobus profundus DSM 5631 1563423 1823 1873 24.19 PRJNA65269 Archaea Archaeoglobus veneficus SNP6 1901943 2090 2153 19.09 PRJNA58711 Archaea Caldivirga maquilingensis IC-167 2077567 1963 2077 20.79 PRJNA58601 Archaea Candidatus Korarchaeum cryptofilum OPF8 1590757 1602 1683 16.59 PRJNA61411 Archaea Cenarchaeum symbiosum A 2045086 2017 1923 39.92 PRJNA75119 Archaea Desulfurococcus fermentans DSM 16532 1384116 1421 1467 21.26 PRJNA59133 Archaea Desulfurococcus kamchatkensis 1221n 1365223 1471 1419 23.22 PRJNA62227 Archaea Desulfurococcus mucosus DSM 2162 1314639 1345 1371 19.66 PRJNA40863 Archaea Ferroglobus placidus DSM 10642 2196266 2480 2599 20.10 PRJNA162201 Archaea Fervidicoccus fontis Kam940 1319206 1385 1345 19.49 PRJNA50305 Archaea Halalkalicoccus jeotgali B3 3698650 3873 3929 28.89 PRJNA72475 Archaea Haloarcula hispanica ATCC 33960 3890005 3859 3908 26.91 PRJNA57719 Archaea Haloarcula marismortui ATCC 43049 4274642 4240 4456 30.57 PRJNA61571 Archaea Halobacterium salinarum R1 2668776 2883 2771 29.15 PRJNA57769 Archaea Halobacterium sp. NRC-1 2571010 2605 2694 28.98 PRJNA167315 Archaea Haloferax mediterranei ATCC 33500 3904707 3863 3949 28.10 PRJNA46845 Archaea Haloferax volcanii DS2 4012900 4018 4024 27.52 PRJNA54919 Archaea Halogeometricum borinquense DSM 11551 3944467 3898 3952 27.96 PRJNA59107 Archaea Halomicrobium mukohataei DSM 12286 3332349 3349 3387 28.31 PRJNA72619 Archaea halophilic archaeon DL31 3643158 3476 3754 31.09

Upload: lykiet

Post on 15-Sep-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

Table S1. List of 2110 prokaryotic genomes used in this study. Each genome contains information about genome size, number of genes (NCBI-genes and Prodigal-genes), and fraction of genes without hits to any database, containing Hidden Markov Models. Table is sorted by Superkingdom and Organism name.  

Bioproject ID Superkingdom Organism name Genome size (bp)

Genes retrieved from Genbank files

Genes predicted by Prodigal

Fraction of genes without any hits to any HMM

database PRJNA66875 Archaea Acidianus hospitalis W1 2137654 2329 2484 26.03 PRJNA51395 Archaea Acidilobus saccharovorans 345-15 1496453 1499 1503 18.72 PRJNA43333 Archaea Aciduliprofundum boonei T469 1486778 1544 1551 16.38 PRJNA57757 Archaea Aeropyrum pernix K1 1669696 1700 1720 21.67 PRJNA57717 Archaea Archaeoglobus fulgidus DSM 4304 2178400 2407 2518 16.28 PRJNA43493 Archaea Archaeoglobus profundus DSM 5631 1563423 1823 1873 24.19 PRJNA65269 Archaea Archaeoglobus veneficus SNP6 1901943 2090 2153 19.09 PRJNA58711 Archaea Caldivirga maquilingensis IC-167 2077567 1963 2077 20.79 PRJNA58601 Archaea Candidatus Korarchaeum cryptofilum OPF8 1590757 1602 1683 16.59 PRJNA61411 Archaea Cenarchaeum symbiosum A 2045086 2017 1923 39.92 PRJNA75119 Archaea Desulfurococcus fermentans DSM 16532 1384116 1421 1467 21.26 PRJNA59133 Archaea Desulfurococcus kamchatkensis 1221n 1365223 1471 1419 23.22 PRJNA62227 Archaea Desulfurococcus mucosus DSM 2162 1314639 1345 1371 19.66 PRJNA40863 Archaea Ferroglobus placidus DSM 10642 2196266 2480 2599 20.10 PRJNA162201 Archaea Fervidicoccus fontis Kam940 1319206 1385 1345 19.49 PRJNA50305 Archaea Halalkalicoccus jeotgali B3 3698650 3873 3929 28.89 PRJNA72475 Archaea Haloarcula hispanica ATCC 33960 3890005 3859 3908 26.91 PRJNA57719 Archaea Haloarcula marismortui ATCC 43049 4274642 4240 4456 30.57 PRJNA61571 Archaea Halobacterium salinarum R1 2668776 2883 2771 29.15 PRJNA57769 Archaea Halobacterium sp. NRC-1 2571010 2605 2694 28.98 PRJNA167315 Archaea Haloferax mediterranei ATCC 33500 3904707 3863 3949 28.10 PRJNA46845 Archaea Haloferax volcanii DS2 4012900 4018 4024 27.52 PRJNA54919 Archaea Halogeometricum borinquense DSM 11551 3944467 3898 3952 27.96 PRJNA59107 Archaea Halomicrobium mukohataei DSM 12286 3332349 3349 3387 28.31 PRJNA72619 Archaea halophilic archaeon DL31 3643158 3476 3754 31.09

Page 2: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA68105 Archaea Halopiger xanaduensis SH-6 4355268 4221 4264 28.60 PRJEA162019 Archaea Haloquadratum walsbyi C23 3260476 2987 3090 31.64 PRJNA58673 Archaea Haloquadratum walsbyi DSM 16790 3179361 2862 3071 31.42 PRJNA59189 Archaea Halorhabdus utahensis DSM 12940 3116795 2998 3045 25.38 PRJNA58807 Archaea Halorubrum lacusprofundi ATCC 49239 3692576 3560 3643 29.89 PRJNA43501 Archaea Haloterrigena turkmenica DSM 5511 5440782 5113 5225 28.50 PRJNA57755 Archaea Hyperthermus butylicus DSM 5456 1667163 1602 1791 23.99 PRJNA58365 Archaea Ignicoccus hospitalis KIN4 I 1297538 1434 1490 25.14 PRJNA51875 Archaea Ignisphaera aggregans DSM 17230 1875953 1930 1971 27.30 PRJNA66329 Archaea Metallosphaera cuprina Ar-4 1840348 2029 1992 24.20 PRJNA58717 Archaea Metallosphaera sedula DSM 5348 2191517 2256 2391 22.40 PRJNA63623 Archaea Methanobacterium sp. AL-21 2583753 2493 2516 18.45 PRJNA67359 Archaea Methanobacterium sp. SWAN-1 2546541 2397 2445 20.69 PRJNA45857 Archaea Methanobrevibacter ruminantium M1 2937203 2217 2198 20.66 PRJNA58827 Archaea Methanobrevibacter smithii ATCC 35061 1853160 1795 1774 18.91 PRJNA59347 Archaea Methanocaldococcus fervens AG86 1507251 1581 1625 13.94 PRJNA48803 Archaea Methanocaldococcus infernus ME 1328194 1441 1469 9.52 PRJNA57713 Archaea Methanocaldococcus jannaschii DSM 2661 1739927 1770 1863 17.04 PRJNA42499 Archaea Methanocaldococcus sp. FS406-22 1773136 1816 1855 16.54 PRJNA41131 Archaea Methanocaldococcus vulcanius M7 1761737 1742 1752 17.26 PRJNA61623 Archaea Methanocella arvoryzae MRE50 3179916 3085 3112 21.64 PRJNA157911 Archaea Methanocella conradii HZ254 2378438 2455 2516 20.48 PRJNA42887 Archaea Methanocella paludicola SANAE 2957635 3004 3051 22.11 PRJNA58023 Archaea Methanococcoides burtonii DSM 6242 2575032 2272 2587 19.51 PRJNA58823 Archaea Methanococcus aeolicus Nankai-3 1569500 1490 1509 10.47 PRJNA58741 Archaea Methanococcus maripaludis C5 1789046 1822 1886 15.86 PRJNA58947 Archaea Methanococcus maripaludis C6 1744193 1826 1872 17.79 PRJNA58847 Archaea Methanococcus maripaludis C7 1772694 1788 1838 15.36 PRJNA58035 Archaea Methanococcus maripaludis S2 1661137 1722 1749 11.93 PRJNA70729 Archaea Methanococcus maripaludis X1 1746697 1850 1873 16.01 PRJNA58767 Archaea Methanococcus vannielii SB 1720048 1678 1738 12.91

Page 3: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA49529 Archaea Methanococcus voltae A3 1936387 1717 1718 19.71 PRJNA58785 Archaea Methanocorpusculum labreanum Z 1804962 1739 1825 16.05 PRJNA58561 Archaea Methanoculleus marisnigri JR1 2478101 2489 2484 18.60 PRJNA49857 Archaea Methanohalobium evestigatum Z-7303 2406232 2254 2406 17.85 PRJNA47313 Archaea Methanohalophilus mahii DSM 5219 2012424 1987 2018 13.51 PRJNA52695 Archaea Methanoplanus petrolearius DSM 11571 2843290 2785 2814 22.50 PRJNA57883 Archaea Methanopyrus kandleri AV19 1694969 1691 1812 23.89 PRJNA58815 Archaea Methanoregula boonei 6A8 2542943 2450 2523 19.87 PRJNA66207 Archaea Methanosaeta concilii GP6 3026645 2850 3045 22.19 PRJNA81199 Archaea Methanosaeta harundinacea 6Ac 2571034 2371 2481 19.37 PRJNA58469 Archaea Methanosaeta thermophila PT 1879471 1696 1808 16.15 PRJNA68249 Archaea Methanosalsum zhilinae DSM 4017 2138444 1976 2031 14.15 PRJNA57879 Archaea Methanosarcina acetivorans C2A 5751492 4540 4853 23.69 PRJNA57715 Archaea Methanosarcina barkeri str. Fusaro 4873766 3625 4006 20.51 PRJNA57893 Archaea Methanosarcina mazei Go1 4096345 3371 3447 19.13 PRJNA58407 Archaea Methanosphaera stadtmanae DSM 3091 1767403 1534 1550 15.79 PRJNA59193 Archaea Methanosphaerula palustris E1-9c 2922917 2655 2777 19.39

PRJNA51637 Archaea Methanothermobacter marburgensis str. Marburg 1639135 1757 1748 13.32

PRJNA51535 Archaea Methanothermococcus okinawensis IH1 1677455 1595 1649 14.21 PRJNA60167 Archaea Methanothermus fervidus DSM 2088 1243342 1283 1319 9.49 PRJNA67321 Archaea Methanotorris igneus Kol 5 1854197 1772 1807 14.19 PRJNA46245 Archaea Natrialba magadii ATCC 43099 4443643 4212 4318 31.16 PRJNA171337 Archaea Natrinema sp. J7-2 3793615 4302 3805 35.12 PRJNA58435 Archaea Natronomonas pharaonis DSM 2160 2749696 2852 2849 25.36 PRJNA58903 Archaea Nitrosopumilus maritimus SCM1 1645259 1795 1944 29.69 PRJNA58041 Archaea Picrophilus torridus DSM 9790 1545895 1535 1592 15.45 PRJNA57727 Archaea Pyrobaculum aerophilum str. IM2 2222430 2605 2631 31.13 PRJNA58409 Archaea Pyrobaculum arsenaticum DSM 13514 2121076 2298 2490 28.30 PRJNA58787 Archaea Pyrobaculum calidifontis JCM 11548 2009313 2149 2291 26.87 PRJNA58635 Archaea Pyrobaculum islandicum DSM 4184 1826402 1978 2176 27.66 PRJNA58421 Archaea Pyrobaculum neutrophilum V24Sta 1769823 1966 2050 26.72

Page 4: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA84411 Archaea Pyrobaculum oguniense TE7 2452920 2835 2942 34.22 PRJNA82379 Archaea Pyrobaculum sp. 1860 2467972 2827 2804 33.58 PRJNA62903 Archaea Pyrococcus abyssi GE5 1768562 1784 1902 12.72 PRJNA169620 Archaea Pyrococcus furiosus COM1 1909827 2064 2085 17.31 PRJNA57873 Archaea Pyrococcus furiosus DSM 3638 1908256 2065 2077 16.44 PRJNA57753 Archaea Pyrococcus horikoshii OT3 1738505 2061 1873 22.46 PRJNA66551 Archaea Pyrococcus sp. NA2 1861320 1980 1992 16.77 PRJNA167261 Archaea Pyrococcus sp. ST04 1736885 1748 1822 12.94 PRJNA68281 Archaea Pyrococcus yayanosii CH1 1716818 1865 1872 17.93 PRJNA73415 Archaea Pyrolobus fumarii 1A 1843267 1986 2001 28.62 PRJNA45893 Archaea Staphylothermus hellenicus DSM 12710 1580347 1599 1623 24.89 PRJNA58719 Archaea Staphylothermus marinus F1 1570485 1570 1640 23.43 PRJNA58379 Archaea Sulfolobus acidocaldarius DSM 639 2225959 2223 2355 23.07 PRJNA162067 Archaea Sulfolobus islandicus HVE10 4 2655201 2723 2929 23.87 PRJNA43679 Archaea Sulfolobus islandicus L.D.8.5 2748647 2949 3114 27.49 PRJNA58871 Archaea Sulfolobus islandicus L.S.2.15 2736272 2738 3071 25.24 PRJNA58849 Archaea Sulfolobus islandicus M.14.25 2608832 2609 2879 23.29 PRJNA58851 Archaea Sulfolobus islandicus M.16.27 2692402 2657 2956 23.80 PRJNA58841 Archaea Sulfolobus islandicus M.16.4 2586647 2736 2850 24.79 PRJNA162071 Archaea Sulfolobus islandicus REY15A 2522992 2644 2816 24.19 PRJNA58923 Archaea Sulfolobus islandicus Y.G.57.14 2702058 2905 3050 26.43 PRJNA58825 Archaea Sulfolobus islandicus Y.N.15.51 2854410 2900 3256 26.45 PRJNA167998 Archaea Sulfolobus solfataricus 98 2 2668974 2679 2975 25.59 PRJNA57721 Archaea Sulfolobus solfataricus P2 2992245 2994 3280 23.56 PRJNA57807 Archaea Sulfolobus tokodaii str. 7 2694756 2816 3005 26.35 PRJNA54733 Archaea Thermococcus barophilus MP 2064237 2265 2231 17.75 PRJNA59389 Archaea Thermococcus gammatolerans EJ3 2045438 2157 2172 20.74 PRJNA59043 Archaea Thermococcus onnurineus NA1 1847607 1976 1994 17.00 PRJNA59399 Archaea Thermococcus sibiricus MM 739 1845800 2037 2002 16.76 PRJNA70841 Archaea Thermococcus sp. 4557 2011320 2133 2117 19.44 PRJNA54735 Archaea Thermococcus sp. AM4 2086428 2229 2273 21.88

Page 5: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA168259 Archaea Thermococcus sp. CL1 1950313 2017 2079 20.12 PRJNA58563 Archaea Thermofilum pendens Hrk 5 1813393 1876 1931 23.88 PRJNA167488 Archaea Thermogladius cellulolyticus 1633 1356318 1414 1421 20.00 PRJNA61573 Archaea Thermoplasma acidophilum DSM 1728 1564906 1478 1572 15.70 PRJNA57751 Archaea Thermoplasma volcanium GSS1 1584804 1526 1640 17.72 PRJNA74443 Archaea Thermoproteus tenax Kra 1 1841542 2051 2090 24.63 PRJNA65089 Archaea Thermoproteus uzoniensis 768-20 1936063 2186 2198 24.75 PRJNA48993 Archaea Thermosphaera aggregans DSM 11486 1316595 1387 1410 21.81 PRJNA52827 Archaea Vulcanisaeta distributa DSM 14429 2374137 2493 2570 27.77 PRJNA63631 Archaea Vulcanisaeta moutnovskia 768-28 2298983 2320 2508 27.65 PRJNA58167 Bacteria Acaryochloris marina MBIC11017 8361599 8383 7745 31.75 PRJNA59279 Bacteria Acetobacter pasteurianus IFO 3283-01 3340249 3050 3151 18.32 PRJNA158377 Bacteria Acetobacter pasteurianus IFO 3283-01-42C 3247995 2984 3074 18.39 PRJNA158373 Bacteria Acetobacter pasteurianus IFO 3283-03 3339669 3048 3148 18.33 PRJNA158381 Bacteria Acetobacter pasteurianus IFO 3283-07 3338426 3047 3149 18.32 PRJNA158379 Bacteria Acetobacter pasteurianus IFO 3283-12 3336990 3046 3150 18.33 PRJNA158383 Bacteria Acetobacter pasteurianus IFO 3283-22 3339649 3048 3148 18.32 PRJNA158531 Bacteria Acetobacter pasteurianus IFO 3283-26 3339683 3048 3149 18.32 PRJNA158375 Bacteria Acetobacter pasteurianus IFO 3283-32 3337040 3046 3147 18.31 PRJNA88073 Bacteria Acetobacterium woodii DSM 1030 4044777 3473 3632 12.77 PRJNA51423 Bacteria Acetohalobium arabaticum DSM 5501 2469596 2282 2348 9.09 PRJNA58901 Bacteria Acholeplasma laidlawii PG-8A 1496992 1380 1389 17.59 PRJNA59899 Bacteria Achromobacter xylosoxidans A8 7359146 6815 6749 10.66 PRJNA43471 Bacteria Acidaminococcus fermentans DSM 20731 2329769 2026 2055 10.93 PRJNA74445 Bacteria Acidaminococcus intestini RyC-MR95 2487765 2404 2369 17.56 PRJNA59215 Bacteria Acidimicrobium ferrooxidans DSM 10331 2158157 1964 2064 14.90 PRJNA58447 Bacteria Acidiphilium cryptum JF-5 3963080 3559 3642 12.75 PRJNA63345 Bacteria Acidiphilium multivorum AIU301 4214744 3949 3897 12.85 PRJNA70791 Bacteria Acidithiobacillus caldus SM-1 3237599 3186 3248 22.26 PRJNA67387 Bacteria Acidithiobacillus ferrivorans SS3 3207552 3093 3246 20.52 PRJNA57649 Bacteria Acidithiobacillus ferrooxidans ATCC 23270 2982397 3147 3045 22.48

Page 6: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58613 Bacteria Acidithiobacillus ferrooxidans ATCC 53993 2885038 2826 2908 17.49 PRJNA59127 Bacteria Acidobacterium capsulatum ATCC 51196 4127356 3382 3368 18.39 PRJNA58501 Bacteria Acidothermus cellulolyticus 11B 2443540 2157 2192 11.57 PRJNA42497 Bacteria Acidovorax avenae subsp. avenae ATCC 19860 5482170 4737 4779 11.53 PRJNA58429 Bacteria Acidovorax citrulli AAC00-1 5352772 4709 4877 17.49 PRJNA59233 Bacteria Acidovorax ebreus TPSY 3796573 3479 3523 9.51 PRJNA58427 Bacteria Acidovorax sp. JS42 4585154 4155 4336 13.37 PRJNA61601 Bacteria Acinetobacter baumannii 3477996 3577 3547 21.18 PRJNA158677 Bacteria Acinetobacter baumannii 1656-2 4023106 3824 3836 18.37 PRJNA59083 Bacteria Acinetobacter baumannii AB0057 4059242 3801 3879 19.06 PRJNA59271 Bacteria Acinetobacter baumannii AB307-0294 3760981 3458 3464 14.87 PRJNA58765 Bacteria Acinetobacter baumannii ACICU 3996761 3759 3780 17.72 PRJNA58731 Bacteria Acinetobacter baumannii ATCC 17978 4001457 3803 3945 17.22 PRJNA61637 Bacteria Acinetobacter baumannii AYE 4048735 3789 3793 16.79 PRJNA162739 Bacteria Acinetobacter baumannii MDR-TJ 4042440 3811 3836 18.20 PRJNA158685 Bacteria Acinetobacter baumannii MDR-ZJ06 4011434 3888 3797 19.32 PRJNA83123 Bacteria Acinetobacter calcoaceticus PHEA-2 3862530 3599 3580 13.14 PRJNA50119 Bacteria Acinetobacter oleivorans DR1 4152543 3874 3853 15.67 PRJNA61597 Bacteria Acinetobacter sp. ADP1 3598621 3310 3275 13.70

PRJNA58891 Bacteria Actinobacillus pleuropneumoniae serovar 3 str. JL03 2242062 2036 2076 6.76

PRJNA58789 Bacteria Actinobacillus pleuropneumoniae serovar 5b str. L20 2274482 2012 2129 5.77

PRJNA59231 Bacteria Actinobacillus pleuropneumoniae serovar 7 str. AP76 2345435 2142 2199 7.30

PRJNA58247 Bacteria Actinobacillus succinogenes 130Z 2319663 2079 2139 5.05 PRJNA158169 Bacteria Actinoplanes missouriensis 431 8773466 8125 8080 20.16 PRJNA162333 Bacteria Actinoplanes sp. SE50 110 9239851 8247 8265 22.84 PRJNA58951 Bacteria Actinosynnema mirum DSM 43827 8248144 6916 7061 19.06 PRJNA80859 Bacteria Advenella kashmirensis WT001 4423879 3933 5114 13.11 PRJNA168181 Bacteria Aequorivita sublithincola DSM 14238 3520671 3140 3200 18.04 PRJNA64757 Bacteria Aerococcus urinae ACS-120-V-Col10a 2080974 1726 1943 12.05

Page 7: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58617 Bacteria Aeromonas hydrophila subsp. hydrophila ATCC 7966 4744448 4122 4198 8.80

PRJNA58631 Bacteria Aeromonas salmonicida subsp. salmonicida A449 5040536 4437 4789 14.17

PRJNA66323 Bacteria Aeromonas veronii B565 4551783 4028 4079 10.50

PRJNA80743 Bacteria Aggregatibacter actinomycetemcomitans ANH9381 2136808 2058 2112 10.96

PRJNA41333 Bacteria Aggregatibacter actinomycetemcomitans D11S-1 2204665 2285 2152 14.24

PRJNA46989 Bacteria Aggregatibacter actinomycetemcomitans D7S-1 2309073 2255 2251 15.09 PRJNA59407 Bacteria Aggregatibacter aphrophilus NJ8700 2313035 2231 2200 13.47 PRJNA57865 Bacteria Agrobacterium fabrum str. C58 5674258 5355 5284 11.65 PRJNA58269 Bacteria Agrobacterium radiobacter K84 7273300 6684 6835 10.68 PRJNA63403 Bacteria Agrobacterium sp. H13-3 5573770 5345 5219 11.77 PRJNA58249 Bacteria Agrobacterium vitis S4 6320946 5389 5788 12.28 PRJNA58985 Bacteria Akkermansia muciniphila ATCC BAA-835 2664102 2138 2190 20.24 PRJNA58169 Bacteria Alcanivorax borkumensis SK2 3120143 2755 2826 9.51 PRJNA49953 Bacteria Alicycliphilus denitrificans BC 4835713 4542 4597 11.71 PRJNA66307 Bacteria Alicycliphilus denitrificans K601 5070751 4696 4798 9.94

PRJNA59199 Bacteria Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 3205686 3084 3153 18.65

PRJNA158681 Bacteria Alicyclobacillus acidocaldarius subsp. acidocaldarius Tc-4-1 3124048 3208 3116 18.99

PRJNA59251 Bacteria Aliivibrio salmonicida LFI1238 4655660 4284 4492 18.31 PRJNA168180 Bacteria Alistipes finegoldii DSM 17242 3734239 3110 3231 23.97 PRJNA45913 Bacteria Alistipes shahii WAL 8301 3763317 2563 3130 22.52 PRJNA58467 Bacteria Alkalilimnicola ehrlichii MLHE-1 3275944 2865 2902 9.05 PRJNA58171 Bacteria Alkaliphilus metalliredigens QYMF 4929566 4625 4838 20.44 PRJNA58495 Bacteria Alkaliphilus oremlandii OhILAs 3123558 2836 2978 13.86 PRJNA46083 Bacteria Allochromatium vinosum DSM 180 3669074 3220 3262 11.59 PRJNA67349 Bacteria Alteromonas sp. SN2 4972148 4371 4364 15.60 PRJNA47083 Bacteria Aminobacterium colombiense DSM 12261 1980592 1876 1918 10.62 PRJNA41053 Bacteria Ammonifex degensii KC4 2157067 2080 2207 14.39

Page 8: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA158689 Bacteria Amycolatopsis mediterranei S699 10236779 9575 9517 16.82 PRJNA171830 Bacteria Amycolatopsis mediterranei S699 10246920 9222 9523 15.56 PRJNA50565 Bacteria Amycolatopsis mediterranei U32 10236715 9228 9515 15.55 PRJNA67253 Bacteria Amycolicicoccus subflavus DQS3-9A1 4863490 4705 4567 16.66 PRJNA58043 Bacteria Anabaena variabilis ATCC 29413 7105752 5706 5939 21.83 PRJNA168323 Bacteria Anaerobaculum mobile DSM 13181 2160700 2017 2049 8.53 PRJNA59219 Bacteria Anaerococcus prevotii DSM 20548 1998633 1806 1837 13.53 PRJNA62245 Bacteria Anaerolinea thermophila UNI-1 3532378 3177 3180 16.56 PRJNA58989 Bacteria Anaeromyxobacter dehalogenans 2CP-1 5029329 4473 4518 17.54 PRJNA58135 Bacteria Anaeromyxobacter dehalogenans 2CP-C 5013479 4346 4469 16.68 PRJNA58755 Bacteria Anaeromyxobacter sp. Fw109-5 5277990 4466 4601 16.49 PRJNA58953 Bacteria Anaeromyxobacter sp. K 5061632 4457 4485 16.50 PRJNA42155 Bacteria Anaplasma centrale str. Israel 1206806 925 1021 22.15 PRJNA58577 Bacteria Anaplasma marginale str. Florida 1202435 942 1011 19.41 PRJNA57629 Bacteria Anaplasma marginale str. St. Maries 1197687 949 1015 19.86 PRJNA57951 Bacteria Anaplasma phagocytophilum HZ 1471282 1352 1295 38.61 PRJNA59135 Bacteria Anoxybacillus flavithermus WK1 2846746 2832 2943 12.09 PRJNA57765 Bacteria Aquifex aeolicus VF5 1590791 1553 1775 11.90 PRJNA49489 Bacteria Arcanobacterium haemolyticum DSM 20595 1986154 1731 1850 15.95 PRJNA158699 Bacteria Arcobacter butzleri ED-1 2256675 2158 2165 12.82 PRJNA58557 Bacteria Arcobacter butzleri RM4018 2341251 2259 2287 13.79 PRJNA49001 Bacteria Arcobacter nitrofigilis DSM 7299 3192235 3126 3146 14.72 PRJNA158135 Bacteria Arcobacter sp. L 2947662 2848 2879 16.38 PRJNA58231 Bacteria Aromatoleum aromaticum EbN1 4727255 4598 4510 18.98 PRJNA53509 Bacteria Arthrobacter arilaitensis Re117 3918192 3439 3721 18.59 PRJNA58109 Bacteria Arthrobacter aurescens TC1 5226648 4587 4821 15.44 PRJNA58969 Bacteria Arthrobacter chlorophenolicus A6 4980870 4590 4644 20.13 PRJNA63629 Bacteria Arthrobacter phenanthrenivorans Sphe3 4535320 4131 4242 14.71 PRJNA58141 Bacteria Arthrobacter sp. FB24 5070478 4506 4628 15.48 PRJNA78011 Bacteria Arthrobacter sp. Rue61a 4968046 4474 4609 13.77 PRJDA42161 Bacteria Arthrospira platensis NIES-39 6788435 6630 6263 35.77

Page 9: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58297 Bacteria Aster yellows witches'-broom phytoplasma AYWB 723970 693 686 31.91

PRJNA55641 Bacteria Asticcacaulis excentricus CB 48 4308776 3763 3834 15.10 PRJNA59195 Bacteria Atopobium parvulum DSM 20469 1543805 1353 1362 12.19 PRJNA61603 Bacteria Azoarcus sp. BH72 4376040 3989 3971 8.99 PRJNA58905 Bacteria Azorhizobium caulinodans ORS 571 5369772 4717 4813 10.18 PRJEA162161 Bacteria Azospirillum brasilense Sp245 7530241 7848 6886 21.63 PRJNA82343 Bacteria Azospirillum lipoferum 4B 6846400 6233 6023 16.21 PRJNA46085 Bacteria Azospirillum sp. B510 7599738 6309 6629 13.50 PRJNA57597 Bacteria Azotobacter vinelandii DJ 5365318 5051 4822 13.62 PRJNA53535 Bacteria Bacillus amyloliquefaciens DSM 7 3980199 3922 4033 12.71 PRJNA58271 Bacteria Bacillus amyloliquefaciens FZB42 3918589 3693 3755 10.08 PRJNA158133 Bacteria Bacillus amyloliquefaciens LL3 4001985 4228 4078 17.30

PRJNA84215 Bacteria Bacillus amyloliquefaciens subsp. plantarum CAU B946 4019861 3823 3917 11.41

PRJNA159001 Bacteria Bacillus amyloliquefaciens subsp. plantarum YAU B9601-Y2 4242774 3989 4154 13.73

PRJNA158701 Bacteria Bacillus amyloliquefaciens TA208 3937511 4089 4001 15.86 PRJNA158881 Bacteria Bacillus amyloliquefaciens XH7 3939203 4190 4001 17.28 PRJNA165195 Bacteria Bacillus amyloliquefaciens Y2 4238624 4238 4162 17.46 PRJNA58083 Bacteria Bacillus anthracis str. 'Ames Ancestor' 5503926 5486 5748 17.80 PRJNA59385 Bacteria Bacillus anthracis str. A0248 5503926 5291 5744 16.96 PRJNA59303 Bacteria Bacillus anthracis str. CDC 684 5506763 5896 5754 18.03 PRJNA162021 Bacteria Bacillus anthracis str. H9401 5495471 5791 5741 17.46 PRJNA58091 Bacteria Bacillus anthracis str. Sterne 5228663 5287 5461 14.07 PRJNA59887 Bacteria Bacillus atrophaeus 1942 4168266 4186 4108 14.96 PRJNA43329 Bacteria Bacillus cellulosilyticus DSM 2522 4681672 4266 4323 18.42 PRJNA59299 Bacteria Bacillus cereus 03BB102 5449308 5606 5501 16.87 PRJNA58753 Bacteria Bacillus cereus AH187 5599857 5783 5671 17.61 PRJNA58751 Bacteria Bacillus cereus AH820 5588834 5810 5674 17.21 PRJNA57909 Bacteria Bacillus cereus Ames 5227293 5330 5462 16.11 PRJNA57673 Bacteria Bacillus cereus ATCC 10987 5432652 5844 5484 17.80

Page 10: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA57975 Bacteria Bacillus cereus ATCC 14579 5427083 5255 5533 14.97 PRJNA58757 Bacteria Bacillus cereus B4264 5419036 5398 5334 15.19 PRJNA50615 Bacteria Bacillus cereus biovar anthracis str. CI 5486649 5558 5581 16.25 PRJNA58103 Bacteria Bacillus cereus E33L 5843235 5641 5847 15.22 PRJNA83611 Bacteria Bacillus cereus F837 76 5288498 5468 5353 15.50 PRJNA173403 Bacteria Bacillus cereus FRI-35 5382319 5435 5413 15.04 PRJNA58759 Bacteria Bacillus cereus G9842 5736823 5857 5727 18.47 PRJNA82815 Bacteria Bacillus cereus NC7401 5552031 5761 5605 17.38 PRJNA58529 Bacteria Bacillus cereus Q1 5506207 5502 5544 15.56 PRJNA58237 Bacteria Bacillus clausii KSM-K16 4303871 4108 4308 12.54 PRJNA68053 Bacteria Bacillus coagulans 2-6 3073079 2971 2979 14.74 PRJNA54335 Bacteria Bacillus coagulans 36D1 3552226 3290 3363 13.63 PRJNA58317 Bacteria Bacillus cytotoxicus NVH 391-98 4094159 3844 4101 14.13 PRJNA57791 Bacteria Bacillus halodurans C-125 4202352 4066 4081 12.74 PRJNA58097 Bacteria Bacillus licheniformis DSM 13 = ATCC 14580 4222597 4179 4304 12.73 PRJNA58199 Bacteria Bacillus licheniformis DSM 13 = ATCC 14580 4222645 4196 4301 12.15 PRJNA48371 Bacteria Bacillus megaterium DSM 319 5097447 5124 5191 15.19 PRJNA15862 Bacteria Bacillus megaterium QM B1551 5523192 5629 5588 17.06 PRJNA159841 Bacteria Bacillus megaterium WSH-002 5075293 5274 5219 17.85 PRJNA45847 Bacteria Bacillus pseudofirmus OF4 4249248 4335 4258 19.28 PRJNA59017 Bacteria Bacillus pumilus SAFR-032 3704465 3681 3699 12.41 PRJNA49513 Bacteria Bacillus selenitireducens MLS10 3592487 3255 3364 13.14 PRJNA162189 Bacteria Bacillus sp. JS 4120406 4240 4020 13.69 PRJNA62463 Bacteria Bacillus subtilis BSn5 4093599 4145 4048 11.85 PRJNA173926 Bacteria Bacillus subtilis QB928 4146839 4034 4133 12.27 PRJDA38027 Bacteria Bacillus subtilis subsp. natto BEST195 4097429 4385 4170 15.78 PRJNA51879 Bacteria Bacillus subtilis subsp. spizizenii str. W23 4027676 4064 3991 11.97 PRJNA73967 Bacteria Bacillus subtilis subsp. spizizenii TU-B-10 4207222 4297 4150 14.57 PRJNA57675 Bacteria Bacillus subtilis subsp. subtilis str. 168 4215606 4245 4221 12.97 PRJNA158879 Bacteria Bacillus subtilis subsp. subtilis str. RO-NN-1 4011949 4128 3950 13.16 PRJNA49135 Bacteria Bacillus thuringiensis BMB171 5643051 5349 5587 15.88

Page 11: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA173374 Bacteria Bacillus thuringiensis HD-771 6438373 6569 6628 21.90 PRJNA173860 Bacteria Bacillus thuringiensis HD-789 6334630 6462 6510 23.04 PRJNA158151 Bacteria Bacillus thuringiensis serovar chinensis CT-43 6151150 6206 6317 19.97

PRJNA158875 Bacteria Bacillus thuringiensis serovar finitimus YBT-020 5682383 5782 5719 17.58

PRJNA58089 Bacteria Bacillus thuringiensis serovar konkukian str. 97-27 5314794 5197 5357 13.31

PRJNA58795 Bacteria Bacillus thuringiensis str. Al Hakam 5313030 4798 5359 12.49 PRJNA58315 Bacteria Bacillus weihenstephanensis KBAB4 5872743 5653 5811 16.48 PRJNA82341 Bacteria Bacteriovorax marinus SJ 3435933 3249 3280 26.34 PRJNA84217 Bacteria Bacteroides fragilis 638R 5373121 4326 4433 18.66 PRJNA57639 Bacteria Bacteroides fragilis NCTC 9343 5241700 4308 4352 20.47 PRJNA58195 Bacteria Bacteroides fragilis YCH46 5310990 4625 4416 22.33 PRJNA62135 Bacteria Bacteroides helcogenes P 36-108 3998906 3244 3359 17.90 PRJNA63269 Bacteria Bacteroides salanitronis DSM 18170 4308663 3641 3782 26.05 PRJNA62913 Bacteria Bacteroides thetaiotaomicron VPI-5482 6293399 4825 4943 17.52 PRJNA58253 Bacteria Bacteroides vulgatus ATCC 8482 5163189 4065 4218 18.92 PRJNA39177 Bacteria Bacteroides xylanisolvens XB1A 5976145 4407 4844 20.62 PRJNA58533 Bacteria Bartonella bacilliformis KC583 1445021 1283 1239 16.38 PRJNA62131 Bacteria Bartonella clarridgeiae 73 1522743 1386 1256 14.88 PRJNA59405 Bacteria Bartonella grahamii as4aup 2369520 1768 1994 20.12 PRJNA57745 Bacteria Bartonella henselae str. Houston-1 1931047 1612 1631 21.34 PRJNA174512 Bacteria Bartonella quintana RM-11 1587646 1203 1359 14.95 PRJNA57635 Bacteria Bartonella quintana str. Toulouse 1581384 1308 1313 16.98 PRJNA59129 Bacteria Bartonella tribocorum CIP 105476 2642404 2154 2310 25.99

PRJNA58111 Bacteria Baumannia cicadellinicola str. Hc (Homalodisca coagulata) 686194 595 607 0.83

PRJNA61595 Bacteria Bdellovibrio bacteriovorus HD100 3782950 3583 3565 26.87 PRJNA59057 Bacteria Beijerinckia indica subsp. indica ATCC 9039 4418616 3784 4026 17.13 PRJNA168182 Bacteria Belliella baltica DSM 15883 4196595 3680 3832 21.09 PRJNA59047 Bacteria Beutenbergia cavernae DSM 12333 4669183 4197 4217 11.79 PRJNA58559 Bacteria Bifidobacterium adolescentis ATCC 15703 2089645 1631 1683 12.91

Page 12: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162513 Bacteria Bifidobacterium animalis subsp. animalis ATCC 25527 1932693 1538 1564 12.73

PRJNA58911 Bacteria Bifidobacterium animalis subsp. lactis AD011 1933695 1528 1589 13.25 PRJNA163691 Bacteria Bifidobacterium animalis subsp. lactis B420 1938595 1561 1570 13.61 PRJNA158871 Bacteria Bifidobacterium animalis subsp. lactis BB-12 1942198 1642 1579 16.73 PRJNA163693 Bacteria Bifidobacterium animalis subsp. lactis Bi-07 1938822 1597 1571 14.77 PRJNA59359 Bacteria Bifidobacterium animalis subsp. lactis Bl-04 1938709 1567 1572 13.57 PRJNA158867 Bacteria Bifidobacterium animalis subsp. lactis BLC1 1943990 1558 1577 13.17

PRJNA158869 Bacteria Bifidobacterium animalis subsp. lactis CNCM I-2494 1943113 1660 1576 17.18

PRJNA59357 Bacteria Bifidobacterium animalis subsp. lactis DSM 10140 1938483 1566 1575 13.69

PRJNA158865 Bacteria Bifidobacterium animalis subsp. lactis V9 1944050 1572 1573 13.58 PRJNA167988 Bacteria Bifidobacterium bifidum BGN4 2223664 1835 1798 18.77 PRJNA59883 Bacteria Bifidobacterium bifidum PRL2010 2214656 1707 1834 15.67 PRJNA59545 Bacteria Bifidobacterium bifidum S17 2186882 1784 1794 17.36 PRJNA158863 Bacteria Bifidobacterium breve ACS-071-V-Sch8b 2327492 1826 1943 13.61 PRJNA13487 Bacteria Bifidobacterium breve UCC2003 2422684 1854 2045 13.88 PRJNA43091 Bacteria Bifidobacterium dentium Bd1 2636367 2129 2155 15.67 PRJNA58833 Bacteria Bifidobacterium longum DJO10A 2389526 2003 1968 16.97 PRJNA57939 Bacteria Bifidobacterium longum NCC2705 2260266 1729 1826 13.87 PRJNA62693 Bacteria Bifidobacterium longum subsp. infantis 157F 2408831 1999 2000 18.60

PRJNA159865 Bacteria Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 2828958 2552 2565 25.05

PRJNA58677 Bacteria Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222 2832748 2416 2565 23.37

PRJNA60163 Bacteria Bifidobacterium longum subsp. longum BBMN68 2265943 1806 1823 13.45

PRJNA45963 Bacteria Bifidobacterium longum subsp. longum F8 2384987 1682 1952 17.03

PRJNA62695 Bacteria Bifidobacterium longum subsp. longum JCM 1217 2385164 1924 1949 17.25

PRJNA49131 Bacteria Bifidobacterium longum subsp. longum JDM301 2477838 1959 2027 14.10

PRJNA158861 Bacteria Bifidobacterium longum subsp. longum KACC 2395764 1985 1951 16.03

Page 13: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

91563

PRJNA89391 Bacteria Blastococcus saxobsidens DD2 4875340 4844 4649 18.42 PRJNA165873 Bacteria Blattabacterium sp. (Blaberus giganteus) 632588 578 588 3.52

PRJNA41533 Bacteria Blattabacterium sp. (Blattella germanica) str. Bge 640935 591 595 3.71

PRJNA81083 Bacteria Blattabacterium sp. (Cryptocercus punctulatus) str. Cpu 609561 548 560 4.51

PRJNA77127 Bacteria Blattabacterium sp. (Mastotermes darwiniensis) str. MADAR 590336 544 557 3.54

PRJNA41287 Bacteria Blattabacterium sp. (Periplaneta americana) str. BPLAN 640442 582 601 3.89

PRJNA61563 Bacteria Bordetella avium 197N 3732255 3423 3403 10.52 PRJNA57613 Bacteria Bordetella bronchiseptica RB50 5339179 5006 5005 8.06 PRJNA57615 Bacteria Bordetella parapertussis 12822 4773551 4402 4512 6.39 PRJNA158859 Bacteria Bordetella pertussis CS 4124236 3456 3927 9.96 PRJNA57617 Bacteria Bordetella pertussis Tohama I 4086189 3806 3887 9.74 PRJNA61631 Bacteria Bordetella petrii 5287950 5035 5076 10.02 PRJNA159867 Bacteria Borrelia afzelii PKo 1404288 1455 1388 24.09 PRJNA58653 Bacteria Borrelia afzelii PKo 1232503 1263 1160 24.85 PRJNA71231 Bacteria Borrelia bissettii DN127 1403443 1463 1422 24.11 PRJNA57581 Bacteria Borrelia burgdorferi B31 1519856 1381 1509 25.48 PRJNA161197 Bacteria Borrelia burgdorferi JD1 1531091 1458 1513 22.34 PRJNA161241 Bacteria Borrelia burgdorferi N40 1339539 1242 1306 20.84 PRJNA59429 Bacteria Borrelia burgdorferi ZS7 1345494 1239 1330 21.78 PRJNA162335 Bacteria Borrelia crocidurae str. Achema 1528893 1470 1486 26.88 PRJNA58791 Bacteria Borrelia duttonii Ly 1574881 1305 1554 24.66 PRJNA162165 Bacteria Borrelia garinii BgVir 993811 956 925 20.36 PRJNA58125 Bacteria Borrelia garinii PBi 986914 932 919 19.12 PRJNA59225 Bacteria Borrelia hermsii DAH 922307 819 833 14.89 PRJNA58793 Bacteria Borrelia recurrentis A1 1242163 990 1214 25.27 PRJNA58311 Bacteria Borrelia turicatae 91E135 917330 818 830 14.93 PRJNA58649 Bacteria Brachybacterium faecium DSM 4810 3614992 3068 3119 11.99

Page 14: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59291 Bacteria Brachyspira hyodysenteriae WA1 3036634 2644 2620 19.24 PRJNA158369 Bacteria Brachyspira intermedia PWS A 3308048 2872 2889 22.25 PRJNA48819 Bacteria Brachyspira murdochii DSM 12563 3241804 2809 2870 22.06 PRJNA50609 Bacteria Brachyspira pilosicoli 95 1000 2586443 2301 2282 18.92 PRJNA57599 Bacteria Bradyrhizobium japonicum USDA 110 9105828 8317 8513 17.92 PRJNA158851 Bacteria Bradyrhizobium japonicum USDA 6 9207384 8829 8741 20.22 PRJNA58505 Bacteria Bradyrhizobium sp. BTAi1 8493513 7622 7672 15.78 PRJNA58941 Bacteria Bradyrhizobium sp. ORS 278 7456587 6752 6680 16.63 PRJNA158167 Bacteria Bradyrhizobium sp. S23321 7231841 6898 6849 17.11 PRJNA59175 Bacteria Brevibacillus brevis NBRC 100599 6296436 5949 5924 17.38 PRJNA42117 Bacteria Brevundimonas subvibrioides ATCC 15264 3445263 3327 3336 17.78 PRJNA83615 Bacteria Brucella abortus A13334 3286032 3338 3134 13.72 PRJNA58019 Bacteria Brucella abortus bv. 1 str. 9-941 3286445 3085 3126 13.85 PRJNA58873 Bacteria Brucella abortus S19 3283936 3000 3132 10.83 PRJNA59009 Bacteria Brucella canis ATCC 23365 3312769 3251 3103 14.26 PRJNA83613 Bacteria Brucella canis HSK A52141 3277512 3280 3071 13.35 PRJNA59241 Bacteria Brucella melitensis ATCC 23457 3311219 3136 3155 13.72 PRJNA62937 Bacteria Brucella melitensis biovar Abortus 2308 3278307 3350 3122 13.64 PRJNA57735 Bacteria Brucella melitensis bv. 1 str. 16M 3294931 3198 3152 11.46 PRJNA158857 Bacteria Brucella melitensis M28 3311748 3363 3158 13.92 PRJNA158855 Bacteria Brucella melitensis M5-90 3312229 3360 3163 13.72 PRJNA158853 Bacteria Brucella melitensis NI 3294475 3229 3143 11.93 PRJNA59319 Bacteria Brucella microti CCM 4915 3337369 3287 3093 13.97 PRJNA58113 Bacteria Brucella ovis ATCC 25840 3275590 2892 3222 13.17 PRJNA71131 Bacteria Brucella pinnipedialis B2 94 3399268 3325 3216 15.00 PRJNA159871 Bacteria Brucella suis 1330 3315163 3266 3101 14.67 PRJNA57927 Bacteria Brucella suis 1330 3315175 3273 3103 14.77 PRJNA59015 Bacteria Brucella suis ATCC 23445 3324607 3241 3153 14.73 PRJNA83617 Bacteria Brucella suis VBI22 3316088 3270 3108 14.66 PRJNA68101 Bacteria Buchnera aphidicola (Cinara tujafilina) 444925 369 399 3.26 PRJNA58579 Bacteria Buchnera aphidicola BCc 422434 365 374 2.44

Page 15: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59285 Bacteria Buchnera aphidicola str. 5A (Acyrthosiphon pisum) 642122 555 580 1.41

PRJNA158533 Bacteria Buchnera aphidicola str. Ak (Acyrthosiphon kondoi) 653223 569 595 1.63

PRJNA57805 Bacteria Buchnera aphidicola str. APS (Acyrthosiphon pisum) 655725 574 595 1.88

PRJNA57827 Bacteria Buchnera aphidicola str. Bp (Baizongia pistaciae) 618379 507 526 1.45

PRJNA158845 Bacteria Buchnera aphidicola str. JF98 (Acyrthosiphon pisum) 641771 477 736 6.43

PRJNA158847 Bacteria Buchnera aphidicola str. JF99 (Acyrthosiphon pisum) 641716 590 593 4.31

PRJNA158843 Bacteria Buchnera aphidicola str. LL01 (Acyrthosiphon pisum) 641799 577 605 4.40

PRJNA57913 Bacteria Buchnera aphidicola str. Sg (Schizaphis graminum) 641454 545 599 1.14

PRJNA158849 Bacteria Buchnera aphidicola str. TLW03 (Acyrthosiphon pisum) 641770 573 610 4.99

PRJNA59283 Bacteria Buchnera aphidicola str. Tuc7 (Acyrthosiphon pisum) 641895 553 580 1.41

PRJNA158535 Bacteria Buchnera aphidicola str. Ua (Uroleucon ambrosiae) 627953 538 555 1.19

PRJNA58303 Bacteria Burkholderia ambifaria AMMD 7528567 6610 6615 11.07 PRJNA58701 Bacteria Burkholderia ambifaria MC40-6 7642536 6697 6745 11.98 PRJNA58371 Bacteria Burkholderia cenocepacia AU 1054 7279116 6477 6510 10.49 PRJNA58369 Bacteria Burkholderia cenocepacia HI2424 7702840 6919 6885 11.21 PRJNA57953 Bacteria Burkholderia cenocepacia J2315 8055782 7260 7311 12.09 PRJNA58769 Bacteria Burkholderia cenocepacia MC0-3 7971389 7008 7114 10.42 PRJNA173858 Bacteria Burkholderia cepacia GG4 6467321 5825 5852 10.40 PRJNA66301 Bacteria Burkholderia gladioli BSR3 9052299 7410 7745 13.42 PRJNA59397 Bacteria Burkholderia glumae BGR1 7284636 5773 6413 15.99 PRJNA57725 Bacteria Burkholderia mallei ATCC 23344 5835527 5025 4994 17.28 PRJNA58383 Bacteria Burkholderia mallei NCTC 10229 5742303 5157 4914 18.39 PRJNA58385 Bacteria Burkholderia mallei NCTC 10247 5848380 5413 5012 20.16

Page 16: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58387 Bacteria Burkholderia mallei SAVP1 5232401 5189 4490 19.65 PRJNA58697 Bacteria Burkholderia multivorans ATCC 17616 7008622 6259 6356 12.93 PRJNA58909 Bacteria Burkholderia multivorans ATCC 17616 7008810 6230 6368 12.68 PRJNA58699 Bacteria Burkholderia phymatum STM815 8676562 7496 7810 15.05 PRJNA58729 Bacteria Burkholderia phytofirmans PsJN 8214658 7241 7333 13.54 PRJNA162511 Bacteria Burkholderia pseudomallei 1026b 7231415 6070 5959 13.83 PRJNA58515 Bacteria Burkholderia pseudomallei 1106a 7089249 7181 5756 21.88 PRJNA58391 Bacteria Burkholderia pseudomallei 1710b 7308054 6347 5976 16.90 PRJNA58389 Bacteria Burkholderia pseudomallei 668 7040403 7117 5795 21.63 PRJNA174460 Bacteria Burkholderia pseudomallei BPC006 7155061 7159 5869 21.80 PRJNA57733 Bacteria Burkholderia pseudomallei K96243 7247547 5855 5939 11.83 PRJNA60487 Bacteria Burkholderia rhizoxinica HKI 454 3750138 3870 3323 26.72 PRJNA58073 Bacteria Burkholderia sp. 383 8676277 7717 7717 11.42 PRJNA42975 Bacteria Burkholderia sp. CCGE1001 6833751 5965 6018 12.93 PRJNA42523 Bacteria Burkholderia sp. CCGE1002 7884858 6889 7154 15.09 PRJNA46253 Bacteria Burkholderia sp. CCGE1003 7043595 5988 6075 11.27 PRJNA165871 Bacteria Burkholderia sp. KJ006 6629912 6024 5796 14.86 PRJNA81081 Bacteria Burkholderia sp. YI23 8896411 7804 8224 13.63 PRJNA58081 Bacteria Burkholderia thailandensis E264 6723972 5634 5661 11.86 PRJNA58075 Bacteria Burkholderia vietnamiensis G4 8391070 7617 7717 21.23 PRJNA57823 Bacteria Burkholderia xenovorans LB400 9731138 8702 8702 14.51 PRJNA39147 Bacteria Butyrivibrio fibrisolvens 16 4 3164379 2904 3332 22.59 PRJNA51489 Bacteria Butyrivibrio proteoclasticus B316 4404886 3811 3867 24.73 PRJNA59201 Bacteria Caldicellulosiruptor bescii DSM 6725 2931662 2666 2786 18.86 PRJNA60157 Bacteria Caldicellulosiruptor hydrothermalis 108 2770676 2546 2630 13.95 PRJNA60393 Bacteria Caldicellulosiruptor kristjanssonii 177R1B 2802443 2482 2672 17.15 PRJNA60491 Bacteria Caldicellulosiruptor kronotskyensis 2002 2843785 2466 2588 14.29 PRJNA60575 Bacteria Caldicellulosiruptor lactoaceticus 6A 2674809 2318 2563 16.39 PRJNA51501 Bacteria Caldicellulosiruptor obsidiansis OB47 2532343 2188 2376 13.41 PRJNA60165 Bacteria Caldicellulosiruptor owensensis OL 2428903 2147 2308 13.65 PRJNA58289 Bacteria Caldicellulosiruptor saccharolyticus DSM 8903 2970275 2682 2855 15.73

Page 17: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA158165 Bacteria Caldilinea aerophila DSM 14535 = NBRC 104270 5144873 4119 4163 13.28

PRJNA158173 Bacteria Caldisericum exile AZM16c01 1558103 1582 1513 12.89 PRJNA60821 Bacteria Calditerrivibrio nitroreducens DSM 19672 2216552 2100 2127 10.17 PRJNA58667 Bacteria Campylobacter concisus 13826 2099413 2080 2078 22.27 PRJNA58669 Bacteria Campylobacter curvus 525.92 1971264 1931 1941 16.12 PRJNA58545 Bacteria Campylobacter fetus subsp. fetus 82-40 1773615 1719 1741 12.98 PRJNA58981 Bacteria Campylobacter hominis ATCC BAA-381 1714951 1687 1637 22.44 PRJNA57899 Bacteria Campylobacter jejuni RM1221 1777831 1838 1878 19.16 PRJNA58671 Bacteria Campylobacter jejuni subsp. doylei 269.97 1845106 1731 1975 18.43 PRJNA58503 Bacteria Campylobacter jejuni subsp. jejuni 81-176 1699052 1758 1726 15.30 PRJNA58771 Bacteria Campylobacter jejuni subsp. jejuni 81116 1628115 1626 1618 11.22 PRJNA159531 Bacteria Campylobacter jejuni subsp. jejuni IA3902 1672219 1666 1699 13.34

PRJNA61249 Bacteria Campylobacter jejuni subsp. jejuni ICDCCJ07001 1708924 1531 1858 12.51

PRJNA159535 Bacteria Campylobacter jejuni subsp. jejuni M1 1616648 1622 1639 11.28

PRJNA57587 Bacteria Campylobacter jejuni subsp. jejuni NCTC 11168 = ATCC 700819 1641481 1643 1658 11.72

PRJEA174152 Bacteria Campylobacter jejuni subsp. jejuni NCTC 11168-BN148 1641481 1643 1658 11.84

PRJNA159533 Bacteria Campylobacter jejuni subsp. jejuni S3 1724586 1813 1813 16.05 PRJNA58115 Bacteria Campylobacter lari RM2100 1571661 1546 1578 12.68

PRJNA59207 Bacteria Candidatus Accumulibacter phosphatis clade IIA str. UW-1 5306133 4562 4711 15.76

PRJNA58963 Bacteria Candidatus Amoebophilus asiaticus 5a2 1884364 1335 1537 26.22 PRJNA71379 Bacteria Candidatus Arthromitus sp. SFB-mouse-Japan 1620005 1515 1538 17.26 PRJNA159517 Bacteria Candidatus Arthromitus sp. SFB-mouse-Yit 1586397 1420 1495 15.71 PRJNA73425 Bacteria Candidatus Arthromitus sp. SFB-rat-Yit 1515556 1346 1401 15.22

PRJNA59163 Bacteria Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 1224919 852 1089 22.63

PRJNA57999 Bacteria Candidatus Blochmannia floridanus 705557 589 605 1.93

PRJNA58329 Bacteria Candidatus Blochmannia pennsylvanicus str. BPEN 791654 610 624 1.38

Page 18: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA62083 Bacteria Candidatus Blochmannia vafer str. BVAF 722593 587 604 1.51

PRJNA62959 Bacteria Candidatus Cloacamonas acidaminovorans str. Evry 2246820 1816 1748 19.11

PRJNA59067 Bacteria Candidatus Desulforudis audaxviator MP104C 2349476 2157 2292 13.89

PRJNA59289 Bacteria Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) 2169363 2155 2134 25.51

PRJNA58479 Bacteria Candidatus Koribacter versatilis Ellin345 5650368 4777 4995 20.69 PRJNA59227 Bacteria Candidatus Liberibacter asiaticus str. psy62 1227328 1109 1092 19.95

PRJNA61245 Bacteria Candidatus Liberibacter solanacearum CLso-ZC1 1258278 1192 1125 22.40

PRJNA68687 Bacteria Candidatus Midichloria mitochondrii IricVA 1183732 1210 1327 30.98 PRJNA68739 Bacteria Candidatus Moranella endobia PCIT 538294 406 447 2.81

PRJNA76933 Bacteria Candidatus Mycoplasma haemominutum 'Birmingham 1' 513880 547 546 53.16

PRJNA51175 Bacteria Candidatus Nitrospira defluvii 4317083 4271 4072 24.02 PRJNA66305 Bacteria Candidatus Pelagibacter sp. IMCC9063 1284727 1447 1422 12.20 PRJNA58401 Bacteria Candidatus Pelagibacter ubique HTCC1062 1308759 1354 1353 7.94 PRJNA61641 Bacteria Candidatus Phytoplasma australiense 879959 839 836 36.18 PRJNA59087 Bacteria Candidatus Phytoplasma mali 601943 497 503 28.40 PRJNA173859 Bacteria Candidatus Portiera aleyrodidarum BT-B-HRs 358242 256 263 5.01

PRJNA58079 Bacteria Candidatus Protochlamydia amoebophila UWE25 2414465 2031 1910 28.27

PRJNA47081 Bacteria Candidatus Puniceispirillum marinum IMCC1322 2753527 2546 2599 12.87

PRJNA156845 Bacteria Candidatus Rickettsia amblyommii str. GAT-30V 1480884 1390 1694 30.19

PRJNA46841 Bacteria Candidatus Riesia pediculicola USDA 582127 556 500 12.97

PRJNA58645 Bacteria Candidatus Ruthia magnifica str. Cm (Calyptogena magnifica) 1160782 976 1139 10.92

PRJNA58139 Bacteria Candidatus Solibacter usitatus Ellin6076 9965640 7826 8208 19.01 PRJNA59427 Bacteria Candidatus Vesicomyosocius okutanii HA 1022154 939 974 6.95 PRJNA70727 Bacteria Capnocytophaga canimorsus Cc5 2571406 2405 2256 24.74 PRJNA59197 Bacteria Capnocytophaga ochracea DSM 7271 2612925 2171 2185 20.45 PRJNA57821 Bacteria Carboxydothermus hydrogenoformans Z-2901 2401520 2620 2477 15.07

Page 19: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA65789 Bacteria Carnobacterium sp. 17-4 2685399 2474 2509 10.56 PRJNA59077 Bacteria Catenulispora acidiphila DSM 44928 10467782 8914 9025 19.22 PRJNA57891 Bacteria Caulobacter crescentus CB15 4016947 3737 3703 13.56 PRJNA59307 Bacteria Caulobacter crescentus NA1000 4042929 3877 3722 14.02 PRJNA41709 Bacteria Caulobacter segnis ATCC 21756 4655622 4139 4288 11.97 PRJNA58551 Bacteria Caulobacter sp. K31 5889399 5438 5487 14.87 PRJNA66779 Bacteria Cellulomonas fimi ATCC 484 4266344 3762 3821 13.68 PRJNA48821 Bacteria Cellulomonas flavigena DSM 20109 4123179 3678 3721 16.75 PRJNA62159 Bacteria Cellulophaga algicola DSM 14237 4888353 4163 4280 22.72 PRJNA63401 Bacteria Cellulophaga lytica DSM 7489 3765936 3284 3303 17.63 PRJNA68143 Bacteria Cellvibrio gilvus ATCC 13127 3526441 3164 3210 15.20 PRJNA59139 Bacteria Cellvibrio japonicus Ueda107 4576573 3754 3691 14.98 PRJNA58069 Bacteria Chelativorans sp. BNC1 4935185 4543 4705 12.88 PRJNA59113 Bacteria Chitinophaga pinensis DSM 2588 9127347 7192 7309 20.98 PRJNA57785 Bacteria Chlamydia muridarum Nigg 1080451 911 897 19.36 PRJEA71067 Bacteria Chlamydia trachomatis 1051525 8 913 20.09 PRJEA71069 Bacteria Chlamydia trachomatis 1051879 8 913 20.09 PRJEA71071 Bacteria Chlamydia trachomatis 1050078 8 907 20.33 PRJEA71073 Bacteria Chlamydia trachomatis 1049632 8 905 19.93 PRJEA71075 Bacteria Chlamydia trachomatis 1050053 8 904 19.85 PRJEA71077 Bacteria Chlamydia trachomatis 1050091 8 913 20.85 PRJEA71081 Bacteria Chlamydia trachomatis 1050127 8 907 20.33 PRJEA71083 Bacteria Chlamydia trachomatis 1056314 8 914 20.39 PRJEA71089 Bacteria Chlamydia trachomatis 1050282 8 907 20.22 PRJEA71091 Bacteria Chlamydia trachomatis 1049642 8 906 20.13 PRJEA71093 Bacteria Chlamydia trachomatis 1050130 8 905 19.82 PRJEA71095 Bacteria Chlamydia trachomatis 1050283 8 905 19.82 PRJEA71097 Bacteria Chlamydia trachomatis 1049940 8 904 19.74 PRJEA71101 Bacteria Chlamydia trachomatis 1046584 8 902 19.78 PRJEA71103 Bacteria Chlamydia trachomatis 1046319 8 904 19.85 PRJEA71107 Bacteria Chlamydia trachomatis 1046364 8 903 19.76

Page 20: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJEA71109 Bacteria Chlamydia trachomatis 1046337 8 902 19.78 PRJEA71111 Bacteria Chlamydia trachomatis 1046376 8 903 19.76 PRJEA71113 Bacteria Chlamydia trachomatis 1046381 8 903 19.65 PRJEA71115 Bacteria Chlamydia trachomatis 1046220 8 903 19.76 PRJEA71117 Bacteria Chlamydia trachomatis 1046379 8 903 19.76 PRJEA71119 Bacteria Chlamydia trachomatis 1046364 8 903 19.76 PRJEA71121 Bacteria Chlamydia trachomatis 1046218 8 903 19.76 PRJEA71123 Bacteria Chlamydia trachomatis 1046364 8 903 19.76 PRJEA71125 Bacteria Chlamydia trachomatis 1046348 8 902 19.67 PRJEA71127 Bacteria Chlamydia trachomatis 1046672 8 901 19.80 PRJEA71457 Bacteria Chlamydia trachomatis 1046041 8 911 20.02 PRJEA71459 Bacteria Chlamydia trachomatis 1046246 8 905 19.93 PRJEA71461 Bacteria Chlamydia trachomatis 1046928 8 903 19.76 PRJNA61633 Bacteria Chlamydia trachomatis 434 Bu 1038842 889 897 19.71 PRJEA71065 Bacteria Chlamydia trachomatis A 363 1044026 0* 905 20.00 PRJNA58333 Bacteria Chlamydia trachomatis A HAR-13 1051969 919 916 20.11 PRJNA159863 Bacteria Chlamydia trachomatis A2497 1044325 894 906 19.78 PRJNA159993 Bacteria Chlamydia trachomatis A2497 1051806 989 914 23.70 PRJNA59351 Bacteria Chlamydia trachomatis B Jali20 OT 1044352 893 907 20.00 PRJNA59349 Bacteria Chlamydia trachomatis B TZ1A828 OT 1044282 894 903 19.76 PRJNA57637 Bacteria Chlamydia trachomatis D UW-3 CX 1042519 894 897 19.77 PRJNA159881 Bacteria Chlamydia trachomatis D-EC 1050015 878 905 19.46 PRJNA159879 Bacteria Chlamydia trachomatis D-LC 1050013 878 904 19.42 PRJNA161369 Bacteria Chlamydia trachomatis E 11023 1043025 926 899 21.70 PRJNA161403 Bacteria Chlamydia trachomatis E 150 1042996 927 901 21.83 PRJEA167483 Bacteria Chlamydia trachomatis E SW3 1042903 889 898 19.70 PRJEA167484 Bacteria Chlamydia trachomatis F SW4 1042736 889 901 19.94 PRJEA167485 Bacteria Chlamydia trachomatis F SW5 1042743 889 900 19.90 PRJNA161409 Bacteria Chlamydia trachomatis G 11074 1042875 919 897 21.04 PRJNA161361 Bacteria Chlamydia trachomatis G 11222 1042354 927 899 21.52 PRJNA161377 Bacteria Chlamydia trachomatis G 9301 1042811 921 897 21.12

Page 21: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA161353 Bacteria Chlamydia trachomatis G 9768 1042810 920 898 21.12 PRJEA71099 Bacteria Chlamydia trachomatis L1 440 LN 1039261 0* 894 19.91 PRJEA71105 Bacteria Chlamydia trachomatis L2 25667R 1038839 0* 894 19.69 PRJNA61635 Bacteria Chlamydia trachomatis L2b UCH-1 proctitis 1038863 889 894 19.57 PRJNA68843 Bacteria Chlamydia trachomatis L2c 1038313 954 899 22.67 PRJEA161995 Bacteria Chlamydia trachomatis Sweden2 1042839 889 901 19.89 PRJNA57963 Bacteria Chlamydophila abortus S26 3 1144377 961 1008 20.87 PRJNA57783 Bacteria Chlamydophila caviae GPIC 1181356 1005 1002 21.77 PRJNA57971 Bacteria Chlamydophila felis Fe C-56 1173791 1013 994 21.33 PRJNA66295 Bacteria Chlamydophila pecorum E58 1106197 988 943 21.80 PRJNA57809 Bacteria Chlamydophila pneumoniae AR39 1234385 1116 1058 25.62 PRJNA57811 Bacteria Chlamydophila pneumoniae CWL029 1230230 1052 1060 23.48 PRJNA57829 Bacteria Chlamydophila pneumoniae J138 1226565 1069 1056 23.39 PRJNA159529 Bacteria Chlamydophila pneumoniae LPCoLN 1248550 1105 1035 24.25 PRJNA57997 Bacteria Chlamydophila pneumoniae TW-183 1225935 1113 1053 24.84 PRJNA159527 Bacteria Chlamydophila psittaci 01DC11 1172197 975 997 20.64 PRJNA159521 Bacteria Chlamydophila psittaci 02DC15 1172182 978 997 20.61 PRJNA159525 Bacteria Chlamydophila psittaci 08DC60 1172032 973 1004 20.74 PRJNA159845 Bacteria Chlamydophila psittaci 6BC 1179220 1010 1000 21.44 PRJNA63621 Bacteria Chlamydophila psittaci 6BC 1179213 975 995 20.20 PRJNA159523 Bacteria Chlamydophila psittaci C19 98 1169374 978 996 20.57 PRJEA162063 Bacteria Chlamydophila psittaci RD1 1171629 967 1008 20.15 PRJNA59185 Bacteria Chlorobaculum parvum NCIB 8327 2289249 2043 2091 11.03 PRJNA58375 Bacteria Chlorobium chlorochromatii CaD3 2572079 2002 2112 13.76 PRJNA58127 Bacteria Chlorobium limicola DSM 245 2763181 2434 2573 15.46 PRJNA58175 Bacteria Chlorobium luteolum DSM 273 2364842 2083 2196 11.38 PRJNA58131 Bacteria Chlorobium phaeobacteroides BS1 2736403 2469 2592 15.59 PRJNA58133 Bacteria Chlorobium phaeobacteroides DSM 266 3133902 2650 2890 20.43 PRJNA58129 Bacteria Chlorobium phaeovibrioides DSM 265 1966858 1753 1800 9.12 PRJNA57897 Bacteria Chlorobium tepidum TLS 2154946 2255 2007 16.80 PRJNA58621 Bacteria Chloroflexus aggregans DSM 9485 4684931 3730 3960 16.61

Page 22: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA57657 Bacteria Chloroflexus aurantiacus J-10-fl 5258541 3853 4341 17.07 PRJNA59085 Bacteria Chloroflexus sp. Y-400-fl 5268950 4159 4344 20.35 PRJNA59187 Bacteria Chloroherpeton thalassium ATCC 35110 3293456 2710 2804 15.49 PRJNA58001 Bacteria Chromobacterium violaceum ATCC 12472 4751080 4407 4298 13.30 PRJNA62921 Bacteria Chromohalobacter salexigens DSM 3043 3696649 3298 3321 7.12 PRJNA43089 Bacteria Citrobacter rodentium ICC168 5444283 5103 5055 11.46

PRJNA61625 Bacteria Clavibacter michiganensis subsp. michiganensis NCPPB 382 3395237 3108 3155 16.09

PRJNA61577 Bacteria Clavibacter michiganensis subsp. sepedonicus 3403786 3238 3244 19.93 PRJNA46219 Bacteria Clostridiales genomosp. BVAB3 str. UPII9-5 1809746 1567 1491 17.33 PRJNA39155 Bacteria Clostridiales sp SS3-4 3601020 2993 3333 21.42 PRJNA45957 Bacteria Clostridiales sp SSC-2 3114788 2771 3072 19.39 PRJNA57677 Bacteria Clostridium acetobutylicum ATCC 824 4132880 3848 3890 16.48 PRJNA68293 Bacteria Clostridium acetobutylicum DSM 1731 4145581 3922 3883 16.77 PRJNA159515 Bacteria Clostridium acetobutylicum EA 2018 4132226 3916 3885 16.81 PRJNA58137 Bacteria Clostridium beijerinckii NCIMB 8052 6000632 5020 5237 13.84 PRJNA58927 Bacteria Clostridium botulinum A str. ATCC 19397 3863450 3551 3510 14.39 PRJNA61579 Bacteria Clostridium botulinum A str. ATCC 3502 3903260 3671 3556 15.17 PRJNA58931 Bacteria Clostridium botulinum A str. Hall 3760560 3403 3387 13.36 PRJNA59229 Bacteria Clostridium botulinum A2 str. Kyoto 4155278 3878 3824 16.65 PRJNA59149 Bacteria Clostridium botulinum A3 str. Loch Maree 4259691 3983 3908 18.72 PRJNA59159 Bacteria Clostridium botulinum B str. Eklund 17B 3847969 3475 3487 18.47 PRJNA59147 Bacteria Clostridium botulinum B1 str. Okra 4107013 3846 3792 17.60 PRJNA59173 Bacteria Clostridium botulinum Ba4 str. 657 4257769 4003 4032 19.27 PRJNA66203 Bacteria Clostridium botulinum BKT015925 3207592 2997 2992 19.44 PRJNA59157 Bacteria Clostridium botulinum E3 str. Alaska E43 3659644 3257 3223 15.14 PRJNA159513 Bacteria Clostridium botulinum F str. 230613 4010614 3527 3880 16.39 PRJNA58929 Bacteria Clostridium botulinum F str. Langeland 4012918 3655 3656 15.07 PRJEA162091 Bacteria Clostridium botulinum H04402 065 3919740 3719 3654 15.30 PRJNA58709 Bacteria Clostridium cellulolyticum H10 4068724 3390 3510 18.12 PRJNA51503 Bacteria Clostridium cellulovorans 743B 5262222 4254 4365 17.31 PRJEA45855 Bacteria Clostridium cf. saccharolyticum K10 3769775 3073 3397 20.90

Page 23: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA82345 Bacteria Clostridium clariflavum DSM 19732 4897678 3892 4193 20.04 PRJNA158365 Bacteria Clostridium difficile 2007855 4179867 0* 3709 11.59 PRJNA57679 Bacteria Clostridium difficile 630 4298133 3908 3827 13.45 PRJNA158363 Bacteria Clostridium difficile BI1 4464700 0* 4001 15.20 PRJNA42473 Bacteria Clostridium difficile BI9 4178227 0* 3944 13.64 PRJNA41017 Bacteria Clostridium difficile CD196 4110554 3485 3628 10.38 PRJNA158359 Bacteria Clostridium difficile CF5 4159517 0* 3720 12.45 PRJNA158361 Bacteria Clostridium difficile M120 4047729 0* 3595 12.38 PRJNA42463 Bacteria Clostridium difficile M68 4308325 0* 3843 13.09 PRJNA40921 Bacteria Clostridium difficile R20291 4191339 3543 3701 10.55 PRJNA58885 Bacteria Clostridium kluyveri DSM 555 4023800 3913 3947 19.39 PRJNA59369 Bacteria Clostridium kluyveri NBRC 12016 3955303 3523 3841 15.92 PRJNA49117 Bacteria Clostridium lentocellum DSM 5427 4714237 4182 4245 19.46 PRJNA50583 Bacteria Clostridium ljungdahlii DSM 13528 4630065 4184 4251 13.43 PRJNA58643 Bacteria Clostridium novyi NT 2547720 2325 2319 12.32 PRJNA57901 Bacteria Clostridium perfringens ATCC 13124 3256683 2876 2878 15.24 PRJNA58117 Bacteria Clostridium perfringens SM101 2960088 2619 2654 14.74 PRJNA57681 Bacteria Clostridium perfringens str. 13 3085740 2723 2730 14.08 PRJNA58519 Bacteria Clostridium phytofermentans ISDg 4847594 3902 4087 14.28 PRJNA51419 Bacteria Clostridium saccharolyticum WM1 4662871 4154 4247 14.37 PRJNA84307 Bacteria Clostridium sp. BNL1100 4613747 3920 4000 17.63 PRJNA68705 Bacteria Clostridium sp. SY8519 2835737 2619 2544 14.99 PRJNA59585 Bacteria Clostridium sticklandii str DSM 519 2715461 2573 2553 11.82 PRJNA57683 Bacteria Clostridium tetani E88 2873333 2432 2791 13.48 PRJNA57917 Bacteria Clostridium thermocellum ATCC 27405 3843301 3173 3378 18.35 PRJNA161989 Bacteria Clostridium thermocellum DSM 1313 3561619 2911 3046 16.47 PRJNA70793 Bacteria Collimonas fungivorans Ter331 5186898 4432 4593 11.10 PRJNA57855 Bacteria Colwellia psychrerythraea 34H 5373180 4910 4508 16.91 PRJNA62961 Bacteria Comamonas testosteroni CNB-1 5464824 4895 4959 12.27 PRJNA43467 Bacteria Conexibacter woesei DSM 14684 6359369 5914 5931 13.98 PRJEA45861 Bacteria Coprococcus catus GD 7 3522704 2985 3308 20.26

Page 24: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59253 Bacteria Coprothermobacter proteolyticus DSM 5265 1424912 1482 1424 13.25 PRJNA47079 Bacteria Coraliomargarita akajimensis DSM 45221 3750771 3120 3145 20.72 PRJNA157997 Bacteria Corallococcus coralloides DSM 2259 10080619 8033 8134 23.95 PRJNA65787 Bacteria Coriobacterium glomerans PW2 2115681 1768 1806 11.95 PRJNA59409 Bacteria Corynebacterium aurimucosum ATCC 700975 2819226 2559 2662 18.04 PRJNA83607 Bacteria Corynebacterium diphtheriae 241 2426551 2245 2295 17.86 PRJNA84309 Bacteria Corynebacterium diphtheriae 31A 2535346 2381 2439 19.73 PRJNA84311 Bacteria Corynebacterium diphtheriae BH8 2485519 2361 2423 19.15 PRJNA84313 Bacteria Corynebacterium diphtheriae C7 (beta) 2499189 2337 2396 18.78 PRJNA84295 Bacteria Corynebacterium diphtheriae CDCE 8392 2433326 2249 2310 17.57 PRJNA84297 Bacteria Corynebacterium diphtheriae HC01 2427149 2248 2291 17.91 PRJNA84317 Bacteria Corynebacterium diphtheriae HC02 2468612 2230 2312 16.20 PRJNA84299 Bacteria Corynebacterium diphtheriae HC03 2478364 2262 2311 17.21 PRJNA84301 Bacteria Corynebacterium diphtheriae HC04 2484332 2276 2317 18.66 PRJNA83605 Bacteria Corynebacterium diphtheriae INCA 402 2449071 2215 2316 16.53 PRJNA57691 Bacteria Corynebacterium diphtheriae NCTC 13129 2488635 2320 2332 18.66 PRJNA84303 Bacteria Corynebacterium diphtheriae PW8 2530683 2322 2420 19.36 PRJNA84305 Bacteria Corynebacterium diphtheriae VA01 2395441 2191 2221 16.16 PRJNA62905 Bacteria Corynebacterium efficiens YS-314 3219505 2998 2878 17.17 PRJNA57905 Bacteria Corynebacterium glutamicum ATCC 13032 3309401 3099 3063 20.40 PRJNA61611 Bacteria Corynebacterium glutamicum ATCC 13032 3282708 3058 3027 19.54 PRJNA58897 Bacteria Corynebacterium glutamicum R 3363299 3080 3151 17.97 PRJNA58399 Bacteria Corynebacterium jeikeium K411 2476822 2120 2134 14.36 PRJNA59411 Bacteria Corynebacterium kroppenstedtii DSM 44385 2446804 2018 2123 16.98 PRJNA159665 Bacteria Corynebacterium pseudotuberculosis 1 06-A 2279118 1963 2160 15.79 PRJNA159677 Bacteria Corynebacterium pseudotuberculosis 1002 2335113 2090 2131 16.47 PRJNA167260 Bacteria Corynebacterium pseudotuberculosis 258 2314404 2088 2104 16.75 PRJNA162175 Bacteria Corynebacterium pseudotuberculosis 267 2337628 2148 2110 17.29 PRJNA83609 Bacteria Corynebacterium pseudotuberculosis 3 99-5 2337938 2142 2136 17.20 PRJNA162167 Bacteria Corynebacterium pseudotuberculosis 31 2297010 2063 2125 16.91 PRJNA89381 Bacteria Corynebacterium pseudotuberculosis 316 2310415 2106 2095 17.19

Page 25: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA159669 Bacteria Corynebacterium pseudotuberculosis 42 02-A 2337606 2051 2136 15.67 PRJNA159675 Bacteria Corynebacterium pseudotuberculosis C231 2328208 2091 2135 16.59 PRJNA159667 Bacteria Corynebacterium pseudotuberculosis CIP 52.97 2320595 2060 2161 17.06 PRJNA168258 Bacteria Corynebacterium pseudotuberculosis Cp162 2293464 2002 2109 15.98 PRJNA50585 Bacteria Corynebacterium pseudotuberculosis FRC41 2337913 2110 2134 15.15 PRJNA159673 Bacteria Corynebacterium pseudotuberculosis I19 2337730 2095 2139 16.56 PRJNA157909 Bacteria Corynebacterium pseudotuberculosis P54B96 2337657 2084 2142 16.73 PRJNA159671 Bacteria Corynebacterium pseudotuberculosis PAT10 2335323 2079 2145 16.57 PRJNA50555 Bacteria Corynebacterium resistens DSM 45100 2601311 2171 2268 16.56 PRJDA169879 Bacteria Corynebacterium ulcerans 0102 2579188 2354 2336 19.36 PRJNA159659 Bacteria Corynebacterium ulcerans 809 2502095 2182 2254 15.62 PRJNA68291 Bacteria Corynebacterium ulcerans BR-AD22 2606374 2338 2409 19.06 PRJNA61639 Bacteria Corynebacterium urealyticum DSM 7109 2369219 2024 2009 14.53 PRJNA62003 Bacteria Corynebacterium variabile DSM 44702 3433007 3039 3169 17.32 PRJNA58893 Bacteria Coxiella burnetii CbuG Q212 2008870 1871 2069 27.01 PRJNA58895 Bacteria Coxiella burnetii CbuK Q154 2102380 1947 2146 26.86 PRJNA58629 Bacteria Coxiella burnetii Dugway 5J108-111 2212937 2045 2105 25.81 PRJNA58637 Bacteria Coxiella burnetii RSA 331 2053744 1975 2117 29.91 PRJNA57631 Bacteria Coxiella burnetii RSA 493 2032674 1853 2055 26.15 PRJNA49661 Bacteria Croceibacter atlanticus HTCC2559 2952962 2702 2684 16.64 PRJNA40821 Bacteria Cronobacter turicensis z3032 4599092 4455 4204 11.19 PRJNA59041 Bacteria Cryptobacterium curtum DSM 15641 1617804 1357 1355 9.85 PRJNA57815 Bacteria Cupriavidus metallidurans CH34 6913352 6755 6361 16.63 PRJNA68689 Bacteria Cupriavidus necator N-1 8480857 7832 7814 12.51 PRJNA61615 Bacteria Cupriavidus taiwanensis LMG 19424 6476522 5975 5755 12.72 PRJNA43697 Bacteria Cyanobacterium sp UCYN-A 1443806 1200 1200 9.63 PRJNA59013 Bacteria Cyanothece sp. ATCC 51142 5460377 5304 5103 24.06 PRJNA59025 Bacteria Cyanothece sp. PCC 7424 6554169 5710 5921 24.76 PRJNA59435 Bacteria Cyanothece sp. PCC 7425 5786110 5327 5425 21.99 PRJNA52547 Bacteria Cyanothece sp. PCC 7822 7841948 6642 7017 28.03 PRJNA59027 Bacteria Cyanothece sp. PCC 8801 4787694 4367 4551 19.97

Page 26: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59143 Bacteria Cyanothece sp. PCC 8802 4803347 4444 4593 20.57 PRJNA71485 Bacteria Cyclobacterium marinum DSM 745 6221273 5000 5114 20.10 PRJNA57651 Bacteria Cytophaga hutchinsonii ATCC 33406 4433218 3785 3780 22.76 PRJNA58025 Bacteria Dechloromonas aromatica RCB 4501104 4171 4228 12.58 PRJNA81439 Bacteria Dechlorosoma suillum PS 3806980 3443 3439 11.41 PRJNA46653 Bacteria Deferribacter desulfuricans SSM1 2542933 2400 2466 15.02 PRJNA57763 Bacteria Dehalococcoides ethenogenes 195 1469720 1580 1543 21.61 PRJNA58477 Bacteria Dehalococcoides sp. BAV1 1341892 1371 1407 16.38 PRJNA58413 Bacteria Dehalococcoides sp. CBDB1 1395502 1458 1436 15.65 PRJNA42115 Bacteria Dehalococcoides sp. GT 1360154 1417 1426 18.75 PRJNA42393 Bacteria Dehalococcoides sp. VS 1413462 1438 1459 16.15

PRJNA48131 Bacteria Dehalogenimonas lykanthroporepellens BL-DC-9 1686510 1659 1718 17.65

PRJNA58615 Bacteria Deinococcus deserti VCD115 3855329 3459 3634 18.79 PRJNA58275 Bacteria Deinococcus geothermalis DSM 11300 3247018 3054 3154 17.59 PRJNA162509 Bacteria Deinococcus gobiensis I-0 4406036 4340 4248 25.76 PRJNA62225 Bacteria Deinococcus maricopensis DSM 21211 3498530 3264 3300 18.17 PRJNA63399 Bacteria Deinococcus proteolyticus MRP 2886836 2656 2742 20.60 PRJNA57665 Bacteria Deinococcus radiodurans R1 3284156 3102 3250 20.17 PRJNA58703 Bacteria Delftia acidovorans SPH-1 6767514 6040 6114 14.18 PRJNA67319 Bacteria Delftia sp. Cs1-4 6685842 5861 5908 12.24 PRJNA46657 Bacteria Denitrovibrio acetiphilus DSM 12809 3222077 2964 3021 14.15 PRJNA51371 Bacteria Desulfarculus baarsii DSM 2075 3655731 3277 3295 13.68 PRJNA58913 Bacteria Desulfatibacillum alkenivorans AK-01 6517073 5255 5289 17.06 PRJNA82553 Bacteria Desulfitobacterium dehalogenans ATCC 51507 4321753 4011 4142 16.45 PRJNA57749 Bacteria Desulfitobacterium hafniense DCB-2 5279134 4883 4927 13.70 PRJNA58605 Bacteria Desulfitobacterium hafniense Y51 5727534 5060 5425 14.66 PRJNA65785 Bacteria Desulfobacca acetoxidans DSM 11109 3282536 2866 2971 17.73 PRJNA59061 Bacteria Desulfobacterium autotrophicum HRM2 5657782 4947 4925 16.87 PRJNA58777 Bacteria Desulfococcus oleovorans Hxd3 3944167 3265 3316 13.93 PRJNA59183 Bacteria Desulfohalobium retbaense DSM 5692 2909567 2526 2552 13.61 PRJNA59217 Bacteria Desulfomicrobium baculatum DSM 4028 3942657 3436 3454 12.31

Page 27: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA168320 Bacteria Desulfomonile tiedjei DSM 6799 6527027 5494 5637 23.92 PRJNA156759 Bacteria Desulfosporosinus acidiphilus SJ4 4991181 4505 4636 19.10 PRJNA75097 Bacteria Desulfosporosinus meridiei DSM 13257 4873567 4352 4483 16.25 PRJNA82939 Bacteria Desulfosporosinus orientis DSM 765 5863081 5242 5436 14.53 PRJNA58153 Bacteria Desulfotalea psychrophila LSv54 3659634 3236 3243 18.88 PRJNA59109 Bacteria Desulfotomaculum acetoxidans DSM 771 4545624 4068 4291 21.53 PRJNA67317 Bacteria Desulfotomaculum carboxydivorans CO-1-SRB 2892255 2657 2815 13.69 PRJNA67357 Bacteria Desulfotomaculum kuznetsovii DSM 6115 3601386 3398 3543 17.92 PRJNA58277 Bacteria Desulfotomaculum reducens MI-1 3608104 3276 3520 15.97 PRJNA67507 Bacteria Desulfotomaculum ruminis DSM 2154 3969014 3796 3920 17.06 PRJNA42613 Bacteria Desulfovibrio aespoeensis Aspo-2 3629109 3304 3342 14.91 PRJNA66847 Bacteria Desulfovibrio africanus str. Walvis Bay 4200534 3725 3782 15.96 PRJNA57941 Bacteria Desulfovibrio alaskensis G20 3730232 3258 3348 14.61 PRJNA63159 Bacteria Desulfovibrio desulfuricans ND132 3858580 3454 3476 13.98

PRJNA59213 Bacteria Desulfovibrio desulfuricans subsp. desulfuricans str. ATCC 27774 2873437 2356 2398 15.04

PRJNA59309 Bacteria Desulfovibrio magneticus RS-1 5315620 4704 4560 21.30 PRJNA59223 Bacteria Desulfovibrio salexigens DSM 2638 4289847 3807 3846 14.37 PRJNA58679 Bacteria Desulfovibrio vulgaris DP4 3661391 3091 3141 15.10 PRJNA161961 Bacteria Desulfovibrio vulgaris RCH1 3734357 3221 3247 16.79 PRJNA59089 Bacteria Desulfovibrio vulgaris str. 'Miyazaki F' 4040304 3180 3240 13.71 PRJNA57645 Bacteria Desulfovibrio vulgaris str. Hildenborough 3773159 3531 3313 21.04 PRJNA45897 Bacteria Desulfurispirillum indicum S5 2928377 2571 2652 12.87 PRJNA49487 Bacteria Desulfurivibrio alkaliphilus AHT2 3097763 2620 2697 11.89

PRJNA63405 Bacteria Desulfurobacterium thermolithotrophum DSM 11699 1541968 1509 1553 8.33

PRJNA57643 Bacteria Dichelobacter nodosus VCS1703A 1389350 1280 1286 9.63 PRJNA52537 Bacteria Dickeya dadantii 3937 4922802 4571 4291 10.74 PRJNA42519 Bacteria Dickeya dadantii Ech586 4818394 4144 4201 7.87 PRJNA59363 Bacteria Dickeya dadantii Ech703 4679450 3970 4009 6.60 PRJNA59297 Bacteria Dickeya zeae Ech1591 4813854 4163 4229 8.38 PRJNA59439 Bacteria Dictyoglomus thermophilum H-6-12 1959987 1912 1889 10.42

Page 28: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59177 Bacteria Dictyoglomus turgidum DSM 6724 1855560 1744 1817 8.87 PRJNA58707 Bacteria Dinoroseobacter shibae DFL 12 4417868 4194 4281 12.38 PRJNA59049 Bacteria Dyadobacter fermentans DSM 18053 6967790 5719 5866 18.51 PRJNA159657 Bacteria Edwardsiella tarda FL6-60 3728801 3256 3244 7.00 PRJNA59079 Bacteria Eggerthella lenta DSM 2243 3632260 3070 3107 15.35 PRJNA68707 Bacteria Eggerthella sp. YY7918 3123671 2680 2638 15.01 PRJNA58071 Bacteria Ehrlichia canis str. Jake 1315030 925 981 20.46 PRJNA57933 Bacteria Ehrlichia chaffeensis str. Arkansas 1176248 1105 945 25.37 PRJNA58245 Bacteria Ehrlichia ruminantium str. Gardel 1499920 950 970 20.68 PRJNA58013 Bacteria Ehrlichia ruminantium str. Welgevonden 1516355 920 974 19.01 PRJNA58243 Bacteria Ehrlichia ruminantium str. Welgevonden 1512977 958 979 20.50 PRJNA58949 Bacteria Elusimicrobium minutum Pei191 1643562 1529 1544 19.75 PRJNA60431 Bacteria Emticicia oligotrophica DSM 17448 5220361 4267 4319 15.65 PRJNA68103 Bacteria Enterobacter aerogenes KCTC 2190 5280350 4912 4868 8.25 PRJNA72793 Bacteria Enterobacter asburiae LF7a 5012132 4612 4664 7.45 PRJNA80739 Bacteria Enterobacter cloacae EcWSU1 4798091 4619 4498 8.45 PRJNA59969 Bacteria Enterobacter cloacae SCF1 4814049 4399 4390 5.58

PRJNA48363 Bacteria Enterobacter cloacae subsp. cloacae ATCC 13047 5598796 5518 5452 12.78

PRJNA172463 Bacteria Enterobacter cloacae subsp. cloacae ENHKU01 4726582 4338 4402 5.95

PRJNA45967 Bacteria Enterobacter cloacae subsp. cloacae NCTC 9394 4908759 3725 4580 8.51

PRJNA168997 Bacteria Enterobacter cloacae subsp. dissolvens SDM 4968248 4542 4603 6.76 PRJNA58727 Bacteria Enterobacter sp. 638 4676461 4240 4412 7.65 PRJNA159663 Bacteria Enterococcus faecalis 62 3130818 3075 3014 16.11 PRJNA171261 Bacteria Enterococcus faecalis D32 3062505 2965 2972 17.25 PRJNA54927 Bacteria Enterococcus faecalis OG1RF 2739625 2579 2507 11.74 PRJNA57669 Bacteria Enterococcus faecalis V583 3359974 3265 3273 19.65 PRJNA55353 Bacteria Enterococcus faecium DO 3052572 3114 2951 17.39 PRJNA70619 Bacteria Enterococcus hirae ATCC 9790 2856440 2755 2709 22.02 PRJNA39181 Bacteria Enterococcus sp. 7L76 3096657 2295 2983 16.37

Page 29: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA46943 Bacteria Erwinia amylovora ATCC 49946 3905604 3612 3496 13.08 PRJNA46839 Bacteria Erwinia amylovora CFBP1430 3833832 3735 3420 15.25 PRJNA50547 Bacteria Erwinia billingiae Eb661 5372268 4931 4890 9.07 PRJNA159693 Bacteria Erwinia pyrifoliae DSM 12163 4072827 4038 3746 17.60 PRJNA40659 Bacteria Erwinia pyrifoliae Ep1 96 4072846 3755 3733 14.09 PRJNA159955 Bacteria Erwinia sp. Ejp617 3957675 3672 3607 14.08 PRJNA59029 Bacteria Erwinia tasmaniensis Et1 99 4067864 3696 3723 11.74 PRJNA68021 Bacteria Erysipelothrix rhusiopathiae str. Fujisawa 1787941 1704 1695 15.98 PRJNA58299 Bacteria Erythrobacter litoralis HTCC2594 3052398 3011 2969 16.00 PRJNA165043 Bacteria Escherichia blattae DSM 4481 4158725 3904 3818 8.08 PRJNA59245 Bacteria Escherichia coli 'BL21-Gold(DE3)pLysS AG' 4570938 4228 4298 6.50 PRJEA161985 Bacteria Escherichia coli 042 5355323 5038 5053 9.57 PRJNA58531 Bacteria Escherichia coli 536 4938920 4685 4547 7.63 PRJNA59383 Bacteria Escherichia coli 55989 5154862 4919 4846 8.86 PRJNA161975 Bacteria Escherichia coli ABU 83972 5132961 4796 4740 8.10 PRJNA58623 Bacteria Escherichia coli APEC O1 5497653 4890 5260 10.90 PRJNA58783 Bacteria Escherichia coli ATCC 8739 4746218 4200 4388 5.33 PRJNA58803 Bacteria Escherichia coli B str. REL606 4629812 4209 4313 6.04 PRJEA161949 Bacteria Escherichia coli BL21(DE3) 4558947 4319 4267 6.18 PRJNA161947 Bacteria Escherichia coli BL21(DE3) 4558953 4159 4267 5.65 PRJNA59391 Bacteria Escherichia coli BW2952 4578159 4084 4273 5.16 PRJNA57915 Bacteria Escherichia coli CFT073 5231428 5379 4900 12.62 PRJDA162051 Bacteria Escherichia coli DH1 4621430 4260 4300 5.82 PRJNA161951 Bacteria Escherichia coli DH1 4630707 4160 4305 5.78 PRJNA58395 Bacteria Escherichia coli E24377A 5249288 4997 5002 11.94 PRJNA59379 Bacteria Escherichia coli ED1a 5209548 5123 4989 9.69 PRJEA161993 Bacteria Escherichia coli ETEC H10407 5325888 4974 5130 9.77 PRJNA58393 Bacteria Escherichia coli HS 4643538 4384 4322 8.42 PRJNA59377 Bacteria Escherichia coli IAI1 4700560 4443 4326 7.04 PRJNA59381 Bacteria Escherichia coli IAI39 5132068 4892 4861 8.75 PRJNA162007 Bacteria Escherichia coli IHE3034 5108383 4757 4818 8.80

Page 30: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162099 Bacteria Escherichia coli KO11FL 5027172 4709 4677 10.04 PRJNA52593 Bacteria Escherichia coli KO11FL 5029323 4653 4718 7.59 PRJNA161965 Bacteria Escherichia coli LF82 4773108 4376 4411 5.53 PRJNA162139 Bacteria Escherichia coli NA114 4935241 4873 4704 9.54 PRJNA41013 Bacteria Escherichia coli O103:H2 str. 12009 5524860 5354 5329 9.74 PRJNA41023 Bacteria Escherichia coli O111:H- str. 11128 5766081 5732 5676 13.17 PRJNA59343 Bacteria Escherichia coli O127:H6 str. E2348 69 5069678 4824 4879 9.41 PRJNA59091 Bacteria Escherichia coli O157:H7 str. EC4115 5704171 5477 5540 12.76 PRJNA57831 Bacteria Escherichia coli O157:H7 str. EDL933 5620522 5449 5408 10.65 PRJNA57781 Bacteria Escherichia coli O157:H7 str. Sakai 5594477 5447 5382 10.63 PRJNA59235 Bacteria Escherichia coli O157:H7 str. TW14359 5622737 5373 5436 10.54 PRJNA41021 Bacteria Escherichia coli O26:H11 str. 11368 5855531 5795 5760 10.56 PRJNA46655 Bacteria Escherichia coli O55:H7 str. CB9615 5452353 5121 5149 9.02 PRJNA162153 Bacteria Escherichia coli O55:H7 str. RM12579 5448306 5128 5136 9.81 PRJNA162115 Bacteria Escherichia coli O7:K1 str. CE10 5378729 5080 5143 9.61 PRJNA161987 Bacteria Escherichia coli O83:H1 str. NRG 857C 4894879 4582 4549 6.75 PRJNA162061 Bacteria Escherichia coli P12b 4935294 4393 4822 8.69 PRJNA62979 Bacteria Escherichia coli S88 5166121 4991 4886 8.24 PRJNA59425 Bacteria Escherichia coli SE11 5155626 5002 4888 9.96 PRJDA161939 Bacteria Escherichia coli SE15 4839683 4488 4449 5.82 PRJNA58919 Bacteria Escherichia coli SMS-3-5 5215377 4913 4820 9.22 PRJNA162049 Bacteria Escherichia coli str. 'clone D i14' 5038386 4919 4609 10.33 PRJNA162047 Bacteria Escherichia coli str. 'clone D i2' 5038386 4919 4609 10.33 PRJNA58979 Bacteria Escherichia coli str. K-12 substr. DH10B 4686137 4200 4398 8.06 PRJNA57779 Bacteria Escherichia coli str. K-12 substr. MG1655 4639675 4320 4306 5.92 PRJNA161931 Bacteria Escherichia coli str. K-12 substr. W3110 4646332 4337 4323 5.69 PRJNA162043 Bacteria Escherichia coli UM146 5107563 4783 4802 7.98 PRJNA62981 Bacteria Escherichia coli UMN026 5358200 5107 5018 8.74 PRJNA161991 Bacteria Escherichia coli UMNK88 5666764 5608 5579 14.61 PRJNA58541 Bacteria Escherichia coli UTI89 5179971 5211 4838 11.19 PRJNA162011 Bacteria Escherichia coli W 5008864 4700 4702 7.60

Page 31: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162101 Bacteria Escherichia coli W 5005347 4731 4691 7.75 PRJNA163995 Bacteria Escherichia coli Xuzhou21 5516736 5183 5276 9.75 PRJNA59375 Bacteria Escherichia fergusonii ATCC 35469 4643861 4377 4318 7.33 PRJNA46255 Bacteria Ethanoligenens harbinense YUAN-3 3008576 2701 2798 16.00 PRJNA59171 Bacteria Eubacterium eligens ATCC 27750 2831389 2765 2651 19.05 PRJNA59777 Bacteria Eubacterium limosum KIST612 4316707 4579 4090 22.69 PRJNA59169 Bacteria Eubacterium rectale ATCC 33656 3449685 3626 3273 24.45 PRJNA39159 Bacteria Eubacterium rectale DSM 17629 3344951 2898 3172 22.59 PRJNA39161 Bacteria Eubacterium rectale M104 1 3698419 3212 3543 27.58 PRJNA39157 Bacteria Eubacterium siraeum 70 3 2943413 2347 2655 23.61 PRJNA45919 Bacteria Eubacterium siraeum V10Sc8a 2836123 2211 2496 21.24 PRJNA58053 Bacteria Exiguobacterium sibiricum 255-15 3040786 3015 3075 14.73 PRJNA59093 Bacteria Exiguobacterium sp. AT1b 2999895 3020 3050 14.96 PRJNA45961 Bacteria Faecalibacterium prausnitzii L2-6 3321367 2756 3183 21.75 PRJNA39151 Bacteria Faecalibacterium prausnitzii SL3 3 3214418 2746 3061 22.34 PRJNA53371 Bacteria Ferrimonas balearica DSM 9799 4279159 3782 3800 10.46 PRJNA58625 Bacteria Fervidobacterium nodosum Rt17-B1 1948941 1750 1824 12.03 PRJNA78143 Bacteria Fervidobacterium pennivorans DSM 9078 2166381 1947 1998 12.98

PRJNA161919 Bacteria Fibrobacter succinogenes subsp. succinogenes S85 3843004 2871 3140 23.79

PRJNA41169 Bacteria Fibrobacter succinogenes subsp. succinogenes S85 3842635 3085 3117 23.83

PRJNA46625 Bacteria Filifactor alocis ATCC 35896 1931012 1641 1692 18.66 PRJNA58867 Bacteria Finegoldia magna ATCC 29328 1986740 1813 1845 14.93 PRJNA59413 Bacteria Flavobacteriaceae bacterium 3519-10 2768102 2534 2513 19.60 PRJNA73421 Bacteria Flavobacterium branchiophilum FL-15 3563292 2872 3115 23.05 PRJNA80731 Bacteria Flavobacterium columnare ATCC 49512 3162432 2642 2734 19.61 PRJNA157999 Bacteria Flavobacterium indicum GPTSA100-9 2993089 2671 2738 19.52 PRJNA58493 Bacteria Flavobacterium johnsoniae UW101 6096872 5017 5199 20.78 PRJNA61627 Bacteria Flavobacterium psychrophilum JIP02 86 2861988 2432 2516 18.88 PRJNA168257 Bacteria Flexibacter litoralis DSM 6794 4919337 3878 4060 30.10 PRJNA68147 Bacteria Flexistipes sinusarabici DSM 4947 2526590 2346 2376 12.22

Page 32: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA65271 Bacteria Fluviicola taffensis DSM 16823 4633577 4033 4095 27.23 PRJNA162107 Bacteria Francisella cf. novicida 3523 1945310 1854 1838 14.25 PRJNA162105 Bacteria Francisella cf. novicida Fx1 1913619 1818 1793 13.24

PRJNA164779 Bacteria Francisella noatunensis subsp. orientalis str. Toba 04 1847202 1595 2213 19.54

PRJNA58499 Bacteria Francisella novicida U112 1910031 1719 1799 12.51

PRJNA59105 Bacteria Francisella philomiragia subsp. philomiragia ATCC 25017 2049711 1915 1972 13.89

PRJNA68321 Bacteria Francisella sp. TX077308 2035931 1976 1960 14.43

PRJNA58999 Bacteria Francisella tularensis subsp. holarctica FTNF002-00 1890909 1580 2089 18.94

PRJNA58595 Bacteria Francisella tularensis subsp. holarctica LVS 1895994 1967 2093 18.52 PRJNA32025 Bacteria Francisella tularensis subsp. holarctica OSU18 1895727 0* 2109 20.39 PRJNA58687 Bacteria Francisella tularensis subsp. holarctica OSU18 1895727 1555 2109 19.16

PRJNA58939 Bacteria Francisella tularensis subsp. mediasiatica FSC147 1893886 1406 2102 17.96

PRJNA58693 Bacteria Francisella tularensis subsp. tularensis FSC198 1892616 1804 2030 18.62

PRJNA161973 Bacteria Francisella tularensis subsp. tularensis NE061598 1892681 1836 2029 21.50

PRJNA57589 Bacteria Francisella tularensis subsp. tularensis SCHU S4 1892775 1804 2029 18.63 PRJNA89373 Bacteria Francisella tularensis subsp. tularensis TI0902 1892744 1544 2028 18.11 PRJNA89379 Bacteria Francisella tularensis subsp. tularensis TIGB03 1968651 1624 2124 21.93

PRJNA58811 Bacteria Francisella tularensis subsp. tularensis WY96-3418 1898476 1634 2026 18.77

PRJNA58695 Bacteria Frankia alni ACN14a 7497934 6723 6075 22.03 PRJNA58397 Bacteria Frankia sp. CcI3 5433628 4499 4761 20.97 PRJNA58367 Bacteria Frankia sp. EAN1pec 8982042 7191 7577 19.45 PRJNA42615 Bacteria Frankia sp. EuI1c 8815781 7083 7197 15.22 PRJNA46257 Bacteria Frankia symbiont of Datisca glomerata 5340989 4215 4689 22.61 PRJNA81775 Bacteria Frateuria aurantia DSM 6220 3603458 3101 3190 12.62

PRJNA57885 Bacteria Fusobacterium nucleatum subsp. nucleatum ATCC 25586 2174500 2067 2012 15.54

PRJNA66567 Bacteria Gallibacterium anatis UMN179 2694139 2500 2505 10.75

Page 33: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA51505 Bacteria Gallionella capsiferriformans ES-2 3162471 2894 2935 15.71 PRJNA51635 Bacteria gamma proteobacterium HdN1 4587455 3808 3861 14.54 PRJNA43211 Bacteria Gardnerella vaginalis 409-05 1617545 1258 1267 17.43 PRJNA55487 Bacteria Gardnerella vaginalis ATCC 14019 1667350 1365 1297 15.85 PRJNA162045 Bacteria Gardnerella vaginalis HMP9231 1726519 1317 1322 14.21 PRJNA58813 Bacteria Gemmatimonas aurantiaca T-27 4636964 3935 3998 17.50 PRJNA58227 Bacteria Geobacillus kaustophilus HTA426 3592666 3540 3576 14.26 PRJNA49467 Bacteria Geobacillus sp. C56-T3 3650813 3315 3600 12.81 PRJNA59045 Bacteria Geobacillus sp. WCH70 3508804 3168 3429 13.60 PRJNA55779 Bacteria Geobacillus sp. Y4.1MC1 3911947 3669 3823 13.52 PRJNA55381 Bacteria Geobacillus sp. Y412MC52 3673940 3459 3603 13.20 PRJNA41171 Bacteria Geobacillus sp. Y412MC61 3667901 3446 3601 13.18 PRJNA58829 Bacteria Geobacillus thermodenitrificans NG80-2 3608012 3445 3564 13.81 PRJNA48129 Bacteria Geobacillus thermoglucosidasius C56-YS93 3993793 3759 3951 14.82 PRJNA82949 Bacteria Geobacillus thermoleovorans CCB US3 UF5 3596620 3887 3559 18.18 PRJNA58749 Bacteria Geobacter bemidjiensis Bem 4615150 4057 4033 14.36 PRJNA58543 Bacteria Geobacter daltonii FRC-32 4304501 3798 3833 13.96 PRJNA58713 Bacteria Geobacter lovleyi SZ 3994874 3685 3714 15.25 PRJNA57731 Bacteria Geobacter metallireducens GS-15 4011182 3532 3592 12.20 PRJNA55771 Bacteria Geobacter sp. M18 5277406 4434 4484 16.37 PRJNA59037 Bacteria Geobacter sp. M21 4745806 4080 4100 14.67 PRJNA161977 Bacteria Geobacter sulfurreducens KN400 3714272 3328 3340 11.38 PRJNA57743 Bacteria Geobacter sulfurreducens PCA 3814139 3447 3417 12.97 PRJNA58475 Bacteria Geobacter uraniireducens Rf4 5136364 4357 4540 14.23 PRJNA43725 Bacteria Geodermatophilus obscurus DSM 43160 5322497 4811 5084 17.54 PRJNA73759 Bacteria Glaciecola nitratireducens FR1064 4134229 3654 3498 13.45 PRJNA66595 Bacteria Glaciecola sp. 4H-3-7+YE-5 5393591 4547 4609 13.62 PRJNA58011 Bacteria Gloeobacter violaceus PCC 7421 4659019 4430 4512 19.24 PRJNA59075 Bacteria Gluconacetobacter diazotrophicus PAl 5 3914947 3501 3625 14.59 PRJNA61587 Bacteria Gluconacetobacter diazotrophicus PAl 5 3999591 3938 3757 16.52 PRJNA46523 Bacteria Gluconacetobacter xylinus NBRC 3288 3513191 3195 3342 18.60

Page 34: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58239 Bacteria Gluconobacter oxydans 621H 2922384 2664 2836 17.75 PRJNA41403 Bacteria Gordonia bronchialis DSM 43247 5290012 4696 4996 16.94 PRJNA86651 Bacteria Gordonia polyisoprenivorans VH2 5844299 5110 5239 13.98 PRJNA39175 Bacteria Gordonibacter pamelaeae 7-10-1-b 3608022 2027 3220 23.35 PRJNA58881 Bacteria Gramella forsetii KT0803 3798465 3584 3449 19.21 PRJNA58661 Bacteria Granulibacter bethesdensis CGDNIH1 2708355 2437 2446 12.47 PRJNA49957 Bacteria Granulicella mallensis MP5ACTX8 6237577 4815 4903 19.13 PRJNA50551 Bacteria Granulicella tundricola MP5ACTX9 5503984 4542 4698 21.61 PRJNA57625 Bacteria Haemophilus ducreyi 35000HP 1698955 1717 1689 15.50 PRJNA86647 Bacteria Haemophilus influenzae 10810 1981535 1914 1901 9.72 PRJNA58093 Bacteria Haemophilus influenzae 86-028NP 1914490 1793 1838 8.76 PRJNA62123 Bacteria Haemophilus influenzae F3031 1985832 1816 1949 8.95 PRJNA62097 Bacteria Haemophilus influenzae F3047 2007018 1820 1976 10.06 PRJNA58591 Bacteria Haemophilus influenzae PittEE 1813033 1619 1774 7.16 PRJNA58593 Bacteria Haemophilus influenzae PittGG 1887192 1667 1963 9.94 PRJNA161921 Bacteria Haemophilus influenzae R2846 1819370 1636 1699 5.22 PRJNA161923 Bacteria Haemophilus influenzae R2866 1932306 1795 1832 6.89 PRJNA57771 Bacteria Haemophilus influenzae Rd KW20 1830138 1709 1745 5.50 PRJNA72801 Bacteria Haemophilus parainfluenzae T3T1 2086875 1993 1971 8.70 PRJNA59273 Bacteria Haemophilus parasuis SH0165 2269156 2031 2206 11.99 PRJNA57929 Bacteria Haemophilus somnus 129PT 2012878 1798 1822 9.72 PRJNA57979 Bacteria Haemophilus somnus 2336 2263857 1980 2058 11.19 PRJNA58483 Bacteria Hahella chejuensis KCTC 2396 7215267 6782 6396 23.29 PRJNA60191 Bacteria Halanaerobium hydrogeniformans 2613117 2295 2422 8.78 PRJNA161959 Bacteria Halanaerobium praevalens DSM 2228 2309262 2068 2118 8.46 PRJNA41425 Bacteria Haliangium ochraceum DSM 14365 9446314 6719 6870 24.45 PRJNA66777 Bacteria Haliscomenobacter hydrossis DSM 1100 8771651 6752 6936 22.36 PRJEA162033 Bacteria Halobacillus halophilus DSM 2266 4170008 4135 4079 18.32 PRJNA52781 Bacteria Halomonas elongata DSM 2581 4061296 3474 3751 8.62 PRJNA58473 Bacteria Halorhodospira halophila SL1 2678452 2407 2462 8.30 PRJNA58585 Bacteria Halothermothrix orenii H 168 2578146 2342 2412 11.23

Page 35: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA41317 Bacteria Halothiobacillus neapolitanus c2 2582886 2357 2408 10.77 PRJNA58685 Bacteria Helicobacter acinonychis str. Sheeba 1557588 1618 1558 21.38 PRJNA68141 Bacteria Helicobacter bizzozeronii CIII-1 1807534 1971 1875 24.83 PRJNA162217 Bacteria Helicobacter cetorum MIT 00-7128 1960111 1731 1783 22.11 PRJNA162215 Bacteria Helicobacter cetorum MIT 99-5656 1847790 1689 1741 22.04 PRJDA162219 Bacteria Helicobacter cinaedi PAGU611 2101402 2125 2136 24.48 PRJNA61409 Bacteria Helicobacter felis ATCC 49179 1672681 1668 1666 21.66 PRJNA57737 Bacteria Helicobacter hepaticus ATCC 51449 1799146 1875 1803 21.67 PRJNA46647 Bacteria Helicobacter mustelae 12198 1578097 1430 1428 13.82 PRJNA161151 Bacteria Helicobacter pylori 2017 1548238 1593 1501 19.72 PRJNA161159 Bacteria Helicobacter pylori 2018 1562832 1603 1508 19.32 PRJNA57787 Bacteria Helicobacter pylori 26695 1667867 1566 1578 19.12 PRJNA49903 Bacteria Helicobacter pylori 35A 1566655 1470 1499 17.82 PRJNA161925 Bacteria Helicobacter pylori 51 1589954 1415 1514 16.01 PRJNA159983 Bacteria Helicobacter pylori 52 1568826 1405 1514 15.90 PRJNA161153 Bacteria Helicobacter pylori 83 1617426 1609 1556 19.46 PRJNA159985 Bacteria Helicobacter pylori 908 1549666 1595 1511 19.41 PRJNA59415 Bacteria Helicobacter pylori B38 1576758 1528 1501 16.21 PRJNA49873 Bacteria Helicobacter pylori B8 1680029 1720 1582 21.59 PRJNA159987 Bacteria Helicobacter pylori Cuz20 1635449 1564 1534 18.24 PRJNA158157 Bacteria Helicobacter pylori ELS37 1669876 1576 1566 18.52 PRJNA161145 Bacteria Helicobacter pylori F16 1575399 1500 1507 16.46 PRJNA159991 Bacteria Helicobacter pylori F30 1579693 1484 1501 15.54 PRJNA161139 Bacteria Helicobacter pylori F32 1581461 1491 1509 16.77 PRJNA161143 Bacteria Helicobacter pylori F57 1609006 1520 1535 17.25 PRJNA59305 Bacteria Helicobacter pylori G27 1663013 1504 1574 17.58 PRJNA159493 Bacteria Helicobacter pylori Gambia94 24 1712468 1605 1589 20.73 PRJNA58517 Bacteria Helicobacter pylori HPAG1 1605736 1544 1504 16.77 PRJNA162213 Bacteria Helicobacter pylori HUP-B14 1607584 1501 1501 17.39 PRJNA161149 Bacteria Helicobacter pylori India7 1675918 1600 1572 20.74 PRJNA57789 Bacteria Helicobacter pylori J99 1643831 1491 1504 16.09

Page 36: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA159491 Bacteria Helicobacter pylori Lithuania75 1640673 1565 1561 20.44 PRJNA59327 Bacteria Helicobacter pylori P12 1684038 1579 1589 19.61 PRJNA162211 Bacteria Helicobacter pylori PeCan18 1660685 1495 1546 17.86 PRJNA53539 Bacteria Helicobacter pylori PeCan4 1638269 1563 1534 18.82 PRJNA159611 Bacteria Helicobacter pylori Puno120 1637762 1540 1534 18.67 PRJNA161157 Bacteria Helicobacter pylori Puno135 1646139 1573 1538 18.39 PRJNA159467 Bacteria Helicobacter pylori Sat464 1567570 1508 1465 16.85 PRJNA162207 Bacteria Helicobacter pylori Shi112 1663456 1564 1566 18.85 PRJNA162209 Bacteria Helicobacter pylori Shi169 1616909 1525 1518 17.91 PRJNA162205 Bacteria Helicobacter pylori Shi417 1665719 1548 1542 18.35 PRJNA59165 Bacteria Helicobacter pylori Shi470 1608548 1569 1518 19.40 PRJNA53541 Bacteria Helicobacter pylori SJM180 1658051 1581 1525 19.12 PRJNA159615 Bacteria Helicobacter pylori SNT49 1610830 1519 1518 18.57 PRJNA159989 Bacteria Helicobacter pylori SouthAfrica7 1679829 1572 1570 19.10 PRJNA159639 Bacteria Helicobacter pylori v225d 1595604 1550 1511 18.78 PRJNA165869 Bacteria Helicobacter pylori XZ274 1656544 1438 1701 21.54 PRJNA58279 Bacteria Heliobacterium modesticaldum Ice1 3075407 3000 2750 19.97 PRJNA50427 Bacteria Herbaspirillum seropedicae SmR1 5513887 4737 4761 10.58 PRJNA58291 Bacteria Herminiimonas arsenicoxydans 3424307 3294 3260 13.98 PRJNA58599 Bacteria Herpetosiphon aurantiacus DSM 785 6785430 5278 5616 21.44 PRJNA65267 Bacteria Hippea maritima DSM 10411 1694430 1677 1734 9.73 PRJNA59365 Bacteria Hirschia baltica ATCC 49814 3540114 3187 3209 13.43 PRJNA159875 Bacteria Hydrogenobacter thermophilus TK-6 1742932 1869 1901 14.77 PRJNA45927 Bacteria Hydrogenobacter thermophilus TK-6 1743135 1894 1904 15.11 PRJNA67353 Bacteria Hydrogenobaculum sp. 3684 1552775 1578 1619 13.45 PRJNA58857 Bacteria Hydrogenobaculum sp. Y04AAS1 1559514 1629 1632 14.14 PRJNA50325 Bacteria Hyphomicrobium denitrificans ATCC 51888 3638969 3512 3556 22.11 PRJNA68453 Bacteria Hyphomicrobium sp. MC1 4757528 4908 4637 23.61 PRJNA58433 Bacteria Hyphomonas neptunium ATCC 15444 3705021 3505 3498 14.01 PRJNA58087 Bacteria Idiomarina loihiensis L2TR 2839318 2628 2658 8.97 PRJNA162097 Bacteria Ignavibacterium album JCM 16511 3658997 3195 3149 15.45

Page 37: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59769 Bacteria Ilyobacter polytropus DSM 2926 3132314 2880 2940 14.48 PRJNA61729 Bacteria Intrasporangium calvum DSM 43043 4024382 3563 3653 13.75 PRJNA67501 Bacteria Isoptericola variabilis 225 3307740 2881 3004 12.47 PRJNA62207 Bacteria Isosphaera pallida ATCC 43644 5529304 3722 3818 23.17 PRJNA58147 Bacteria Jannaschia sp. CCS1 4404049 4283 4351 13.82 PRJNA58603 Bacteria Janthinobacterium sp. Marseille 4110251 3697 3841 11.95 PRJNA59053 Bacteria Jonesia denitrificans DSM 20603 2749646 2511 2554 17.47 PRJNA59209 Bacteria Kangiella koreensis DSM 16069 2852073 2632 2637 13.99 PRJNA161161 Bacteria Ketogulonicigenium vulgare WSH-001 3277101 3054 3160 11.39 PRJNA59581 Bacteria Ketogulonicigenium vulgare Y25 3288404 3213 3298 11.64 PRJNA58067 Bacteria Kineococcus radiotolerans SRS30216 4956672 4681 4699 18.49 PRJNA77027 Bacteria Kitasatospora setae KM-6054 8783278 7569 7606 19.54 PRJNA170256 Bacteria Klebsiella oxytoca E718 6450897 5909 6074 10.86 PRJNA83159 Bacteria Klebsiella oxytoca KCTC 1686 5974109 5488 5438 6.77 PRJNA162147 Bacteria Klebsiella pneumoniae KCTC 2242 5462423 5152 5076 7.34 PRJNA174151 Bacteria Klebsiella pneumoniae subsp. pneumoniae 1084 5386705 4962 4872 7.05

PRJNA84387 Bacteria Klebsiella pneumoniae subsp. pneumoniae HS11286 5682322 5779 5454 12.74

PRJNA57619 Bacteria Klebsiella pneumoniae subsp. pneumoniae MGH 78578 5694894 5185 5298 8.22

PRJNA59073 Bacteria Klebsiella pneumoniae subsp. pneumoniae NTUH-K2044 5472672 5287 5027 7.85

PRJNA42113 Bacteria Klebsiella variicola At-22 5458505 5057 5051 5.85 PRJNA59099 Bacteria Kocuria rhizophila DC2201 2697540 2356 2313 12.10 PRJNA59205 Bacteria Kosmotoga olearia TBF 19.5.1 2302126 2118 2169 13.18 PRJNA43465 Bacteria Kribbella flavida DSM 17836 7579488 6943 7065 15.35 PRJNA66593 Bacteria Krokinobacter sp. 4H-3-7-5 3389993 2978 2993 16.81 PRJNA48361 Bacteria Kyrpidia tusciae DSM 2912 3384766 3150 3343 13.83 PRJNA59071 Bacteria Kytococcus sedentarius DSM 20547 2785024 2554 2619 14.75 PRJNA68067 Bacteria Lacinutrix sp. 5H-3-7-4 3296168 2967 2979 18.43 PRJNA63605 Bacteria Lactobacillus acidophilus 30SC 2097766 2059 2096 16.73 PRJNA57685 Bacteria Lactobacillus acidophilus NCFM 1993560 1862 1880 13.04

Page 38: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA61179 Bacteria Lactobacillus amylovorus GRL 1112 2126674 2121 2164 19.74 PRJNA160233 Bacteria Lactobacillus amylovorus GRL1118 1977087 1920 1957 16.17 PRJNA57989 Bacteria Lactobacillus brevis ATCC 367 2340228 2218 2280 15.58 PRJNA66205 Bacteria Lactobacillus buchneri NRRL B-30929 2588309 2392 2508 17.18 PRJNA57985 Bacteria Lactobacillus casei ATCC 334 2924325 2771 2857 18.23 PRJNA162119 Bacteria Lactobacillus casei BD-II 3127288 3204 3083 20.30 PRJNA59237 Bacteria Lactobacillus casei BL23 3079196 3044 3013 18.08 PRJNA162121 Bacteria Lactobacillus casei LC2W 3077434 3164 3006 19.90 PRJNA50673 Bacteria Lactobacillus casei str. Zhang 2898335 2848 2719 18.21 PRJNA48359 Bacteria Lactobacillus crispatus ST1 2043161 2024 1988 14.46 PRJNA161929 Bacteria Lactobacillus delbrueckii subsp. bulgaricus 2038 1872918 1792 1933 16.70

PRJNA58647 Bacteria Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842 1864998 2096 1912 20.99

PRJNA57987 Bacteria Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365 1856951 1721 1926 20.73

PRJNA60621 Bacteria Lactobacillus delbrueckii subsp. bulgaricus ND02 2131976 2018 2060 18.37

PRJNA162003 Bacteria Lactobacillus fermentum CECT 5716 2100449 1051 2328 11.31 PRJNA58865 Bacteria Lactobacillus fermentum IFO 3956 2098685 1843 2088 11.50 PRJNA57687 Bacteria Lactobacillus gasseri ATCC 33323 1894360 1755 1807 15.36 PRJNA58761 Bacteria Lactobacillus helveticus DPC 4571 2080931 1610 2203 14.11 PRJNA162017 Bacteria Lactobacillus helveticus H10 2172383 1978 2276 16.46 PRJNA174439 Bacteria Lactobacillus helveticus R0052 2129206 2011 2288 17.93 PRJNA162057 Bacteria Lactobacillus johnsonii DPC 6026 1966342 1772 1868 13.02 PRJNA41735 Bacteria Lactobacillus johnsonii FI9785 1785116 1737 1745 12.87 PRJNA58029 Bacteria Lactobacillus johnsonii NCC533 1992676 1821 1878 12.44 PRJNA67985 Bacteria Lactobacillus kefiranofaciens ZW3 2354088 2162 2426 21.19 PRJNA59361 Bacteria Lactobacillus plantarum JDM1 3197759 2948 2986 13.65

PRJNA53537 Bacteria Lactobacillus plantarum subsp. plantarum ST-III 3307936 3038 3086 15.63

PRJNA62911 Bacteria Lactobacillus plantarum WCFS1 3348624 3108 3132 14.73 PRJNA58471 Bacteria Lactobacillus reuteri DSM 20016 1999618 1900 1993 13.90

Page 39: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58875 Bacteria Lactobacillus reuteri JCM 1112 2039414 1820 2028 12.76 PRJNA55357 Bacteria Lactobacillus reuteri SD2112 2316838 2300 2297 17.45 PRJNA162169 Bacteria Lactobacillus rhamnosus ATCC 8530 2960339 2887 2751 18.59 PRJDA161983 Bacteria Lactobacillus rhamnosus GG 3005051 2834 2828 18.09 PRJNA59313 Bacteria Lactobacillus rhamnosus GG 3010111 2944 2839 19.25 PRJNA59315 Bacteria Lactobacillus rhamnosus Lc 705 3033106 2992 2892 18.75 PRJNA73417 Bacteria Lactobacillus ruminis ATCC 27782 2066652 1862 1975 15.85 PRJNA58281 Bacteria Lactobacillus sakei subsp. sakei 23K 1884661 1885 1848 12.89 PRJNA162005 Bacteria Lactobacillus salivarius CECT 5713 2136138 1552 2177 13.65 PRJNA58233 Bacteria Lactobacillus salivarius UCC118 2133977 2014 2107 17.52 PRJNA72937 Bacteria Lactobacillus sanfranciscensis TMW 1.1304 1375770 1284 1402 13.22 PRJNA73413 Bacteria Lactococcus garvieae ATCC 49156 1950135 1947 1930 15.53 PRJDA161935 Bacteria Lactococcus garvieae Lg2 1963964 1968 1952 15.61 PRJNA160937 Bacteria Lactococcus lactis subsp. cremoris A76 2577104 2769 2706 18.19 PRJNA58837 Bacteria Lactococcus lactis subsp. cremoris MG1363 2529478 2516 2574 16.64 PRJNA167481 Bacteria Lactococcus lactis subsp. cremoris NZ9000 2530294 2510 2573 17.19 PRJNA57983 Bacteria Lactococcus lactis subsp. cremoris SK11 2598348 2504 2708 17.57 PRJNA160253 Bacteria Lactococcus lactis subsp. lactis CV56 2518737 2408 2496 17.43 PRJNA57671 Bacteria Lactococcus lactis subsp. lactis Il1403 2365589 2266 2378 14.69 PRJNA42831 Bacteria Lactococcus lactis subsp. lactis KF147 2635654 2575 2554 16.16 PRJNA59265 Bacteria Laribacter hongkongensis HLHK9 3169329 3238 3005 16.51 PRJNA61575 Bacteria Lawsonia intracellularis PHE MN1-00 1719014 1344 1439 13.01 PRJNA60161 Bacteria Leadbetterella byssophila DSM 17132 4059653 3465 3655 17.13 PRJNA46099 Bacteria Legionella longbeachae NSW150 4149158 3669 3556 25.27 PRJEA51615 Bacteria Legionella pneumophila 130b 3489800 3288 3246 22.36 PRJNA48801 Bacteria Legionella pneumophila 2300 99 Alcoy 3516334 3191 3128 21.66 PRJNA58733 Bacteria Legionella pneumophila str. Corby 3576470 3204 3187 22.20 PRJNA58209 Bacteria Legionella pneumophila str. Lens 3405519 3004 3012 19.56 PRJNA58211 Bacteria Legionella pneumophila str. Paris 3635495 3224 3244 21.15 PRJNA170534 Bacteria Legionella pneumophila subsp. pneumophila 3492535 3132 3112 20.48 PRJNA170535 Bacteria Legionella pneumophila subsp. pneumophila 3617686 3274 3294 22.06

Page 40: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA86885 Bacteria Legionella pneumophila subsp. pneumophila ATCC 43290 3359001 2942 2988 19.56

PRJNA72895 Bacteria Legionella pneumophila subsp. pneumophila str. Hextuple 2q 2682626 0* 2415 17.35

PRJNA72897 Bacteria Legionella pneumophila subsp. pneumophila str. Hextuple 3a 2682626 0* 2415 17.39

PRJNA57609 Bacteria Legionella pneumophila subsp. pneumophila str. Philadelphia 1 3397754 2942 3021 19.52

PRJNA57759 Bacteria Leifsonia xyli subsp. xyli str. CTCB07 2584158 2030 2755 22.01

PRJNA58511 Bacteria Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)' 3956089 3600 3695 21.74

PRJNA58993 Bacteria Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)' 3951448 3726 3686 22.94

PRJNA58509 Bacteria Leptospira borgpetersenii serovar Hardjo-bovis str. JB197 3876235 2880 3558 25.01

PRJNA58507 Bacteria Leptospira borgpetersenii serovar Hardjo-bovis str. L550 3931782 2945 3615 25.15

PRJNA58065 Bacteria Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130 4627366 3658 3686 27.12

PRJNA57881 Bacteria Leptospira interrogans serovar Lai str. 56601 4698134 3706 3789 27.20 PRJNA161957 Bacteria Leptospira interrogans serovar Lai str. IPAV 4708530 3714 3792 27.16 PRJNA158171 Bacteria Leptospirillum ferrooxidans C2-3 2559538 2421 2449 19.82 PRJNA58971 Bacteria Leptothrix cholodnii SP-6 4909403 4363 4366 10.16 PRJNA59211 Bacteria Leptotrichia buccalis C-1013-b 2465610 2220 2300 20.29 PRJNA58481 Bacteria Leuconostoc citreum KM20 1896614 1823 1855 12.86 PRJNA50385 Bacteria Leuconostoc gasicomitatum LMG 18811 1954080 1913 1908 14.73 PRJNA48589 Bacteria Leuconostoc kimchii IMSNU 11154 2101787 2130 2093 14.71

PRJNA57919 Bacteria Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 2075763 2005 2026 13.69

PRJNA84337 Bacteria Leuconostoc mesenteroides subsp. mesenteroides J18 2016426 1937 1950 11.76

PRJNA68743 Bacteria Leuconostoc sp. C2 1877273 1855 1847 12.43 PRJNA61567 Bacteria Listeria innocua Clip11262 3093113 3061 3128 13.10 PRJNA73473 Bacteria Listeria ivanovii subsp. ivanovii PAM 55 2928879 2805 2838 9.60

Page 41: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162185 Bacteria Listeria monocytogenes 07PF0776 2901562 2797 2817 10.51 PRJNA43671 Bacteria Listeria monocytogenes 08-5578 3109342 3088 3080 12.76 PRJNA43727 Bacteria Listeria monocytogenes 08-5923 2999054 2966 2960 11.58 PRJNA54461 Bacteria Listeria monocytogenes 10403S 2903106 2814 2828 9.61 PRJNA61583 Bacteria Listeria monocytogenes EGD-e 2944528 2855 2880 9.73 PRJNA54443 Bacteria Listeria monocytogenes Finland 1998 2874431 2762 2785 8.82 PRJNA54441 Bacteria Listeria monocytogenes FSL R2-561 2973801 2910 2944 10.86 PRJNA59203 Bacteria Listeria monocytogenes HCC23 2976212 2974 2966 13.15 PRJNA54459 Bacteria Listeria monocytogenes J0161 3000464 2955 2948 11.03 PRJEA161953 Bacteria Listeria monocytogenes L99 2979198 2925 2962 12.45 PRJNA162131 Bacteria Listeria monocytogenes M7 2976163 2977 2966 13.12

PRJNA59317 Bacteria Listeria monocytogenes serotype 4b str. CLIP 80459 2912690 2790 2804 9.28

PRJNA57689 Bacteria Listeria monocytogenes serotype 4b str. F2365 2905187 2821 2812 10.83 PRJNA46215 Bacteria Listeria seeligeri serovar 1 2b str. SLCC3954 2797636 2722 2697 10.74 PRJNA61605 Bacteria Listeria welshimeri serovar 6b str. SLCC5334 2814130 2780 2781 12.34 PRJNA58945 Bacteria Lysinibacillus sphaericus C3-41 4817463 4771 4811 19.28 PRJNA59003 Bacteria Macrococcus caseolyticus JCSC5402 2219737 2061 2344 12.53 PRJNA57833 Bacteria Magnetococcus marinus MC-1 4719581 3716 3944 21.58 PRJNA58527 Bacteria Magnetospirillum magneticum AMB-1 4967148 4559 4626 20.23 PRJNA66917 Bacteria Mahella australiensis 50-1 BON 3135972 2870 2959 14.98 PRJNA58197 Bacteria Mannheimia succiniciproducens MBEL55E 2314078 2384 2117 11.49 PRJNA51877 Bacteria Maribacter sp. HTCC2170 3868304 3411 3406 15.55 PRJNA58689 Bacteria Maricaulis maris MCS10 3368780 3063 3131 14.27 PRJNA65783 Bacteria Marinithermus hydrothermalis DSM 14884 2269167 2205 2263 12.98 PRJNA81629 Bacteria Marinitoga piezophila KA3 2244793 2046 2073 14.57 PRJNA162009 Bacteria Marinobacter adhaerens HP15 4651725 4410 4355 14.68 PRJNA59419 Bacteria Marinobacter aquaeolei VT8 4779762 4272 4460 14.91

PRJEA162203 Bacteria Marinobacter hydrocarbonoclasticus ATCC 49840 3989480 3804 3657 12.61

PRJNA171995 Bacteria Marinobacter sp. BSs20148 4063864 3887 3752 12.46 PRJNA64753 Bacteria Marinomonas mediterranea MMB-1 4684316 4121 4227 12.90

Page 42: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA67323 Bacteria Marinomonas posidonica IVIA-Po-181 3899940 3491 3521 8.39 PRJNA58715 Bacteria Marinomonas sp. MWYL1 5100344 4439 4554 8.73 PRJNA60837 Bacteria Marivirga tractuosa DSM 4126 4516490 3757 3810 20.75 PRJNA39163 Bacteria Megamonas hypermegale ART12 1 2209938 2118 2800 16.94 PRJNA46661 Bacteria Meiothermus ruber DSM 1279 3097457 3014 3076 15.89 PRJNA49485 Bacteria Meiothermus silvanus DSM 9946 3721669 3505 3703 20.38 PRJNA170941 Bacteria Melioribacter roseus P3M 3300414 2840 2906 12.10 PRJNA66803 Bacteria Melissococcus plutonius ATCC 35311 2068732 1923 1912 14.63 PRJNA89371 Bacteria Melissococcus plutonius DAT561 2045253 1722 1733 11.64 PRJNA58055 Bacteria Mesoplasma florum L1 793224 683 686 18.99

PRJNA62101 Bacteria Mesorhizobium ciceri biovar biserrulae WSM1271 6690028 6264 6418 13.81

PRJNA57601 Bacteria Mesorhizobium loti MAFF303099 7596297 7281 7375 16.44 PRJNA40861 Bacteria Mesorhizobium opportunistum WSM2075 6884444 6508 6641 14.09 PRJNA52599 Bacteria Mesotoga prima MesG1.Ag.4.2 2975953 2574 2673 18.24 PRJNA59161 Bacteria Methylacidiphilum infernorum V4 2287145 2472 2109 27.57 PRJNA58085 Bacteria Methylibium petroleiphilum PM1 4643639 4449 4504 17.82 PRJNA58049 Bacteria Methylobacillus flagellatus KT 2971517 2753 2835 15.86 PRJNA58933 Bacteria Methylobacterium chloromethanicum CM4 6180732 5516 5696 22.51 PRJNA57605 Bacteria Methylobacterium extorquens AM1 6879778 6234 6488 28.56 PRJNA61617 Bacteria Methylobacterium extorquens DM4 6123851 5947 5771 23.33 PRJNA58821 Bacteria Methylobacterium extorquens PA1 5471154 4829 4990 18.93 PRJNA59023 Bacteria Methylobacterium nodulans ORS 2060 8839022 8309 8612 27.37 PRJNA58937 Bacteria Methylobacterium populi BJ001 5848997 5365 5441 21.03 PRJNA58845 Bacteria Methylobacterium radiotolerans JCM 2831 6899110 6431 6496 21.44 PRJNA58843 Bacteria Methylobacterium sp. 4-46 7737025 6692 6989 20.54 PRJNA59433 Bacteria Methylocella silvestris BL2 4305430 3818 3920 16.79 PRJNA57607 Bacteria Methylococcus capsulatus str. Bath 3304561 2960 3051 14.94 PRJEA174072 Bacteria Methylocystis sp. SC2 3773444 3666 3626 19.73 PRJNA77119 Bacteria Methylomicrobium alcaliphilum 4668296 4007 4110 17.95 PRJNA67363 Bacteria Methylomonas methanica MC09 5051681 4494 4609 18.27 PRJNA162947 Bacteria Methylophaga sp. JAM1 3137192 3022 2990 13.37

Page 43: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162949 Bacteria Methylophaga sp. JAM7 2745290 2691 2624 14.04 PRJNA59373 Bacteria Methylotenera mobilis JLW8 2547570 2338 2343 8.93 PRJNA49469 Bacteria Methylotenera versatilis 301 3059871 2764 2799 11.11 PRJNA59367 Bacteria Methylovorus glucosetrophus SIP3-4 3082007 2910 2913 14.36 PRJNA60723 Bacteria Methylovorus sp. MP688 2862391 2712 2681 11.76 PRJNA73585 Bacteria Micavibrio aeruginosavorus ARL-13 2481983 2432 2343 27.29 PRJNA62789 Bacteria Microbacterium testaceum StLB037 3982034 3676 3639 15.71 PRJNA59033 Bacteria Micrococcus luteus NCTC 2665 2501097 2236 2285 13.21 PRJNA59101 Bacteria Microcystis aeruginosa NIES-843 5842795 6311 5637 30.31 PRJNA68055 Bacteria Microlunatus phosphovorus NM-1 5683123 5359 5206 17.29 PRJNA42501 Bacteria Micromonospora aurantiaca ATCC 27029 7025559 6222 6298 18.03 PRJNA45895 Bacteria Micromonospora sp. L5 6962533 6150 6244 17.30 PRJNA49695 Bacteria Mobiluncus curtisii ATCC 43063 2146480 1909 1830 16.64 PRJEA167487 Bacteria Modestobacter marinus 5575517 5492 5313 17.22 PRJNA58051 Bacteria Moorella thermoacetica ATCC 39073 2628784 2463 2637 10.22 PRJNA48809 Bacteria Moraxella catarrhalis RH4 1863286 1886 1678 17.79 PRJNA72479 Bacteria Muricauda ruestringensis DSM 13258 3842422 3478 3478 17.64 PRJNA61613 Bacteria Mycobacterium abscessus 5090491 4942 4994 17.16 PRJNA68839 Bacteria Mycobacterium africanum GM041182 4389314 3933 4033 13.06 PRJNA57693 Bacteria Mycobacterium avium 104 5475491 5120 5194 15.65

PRJNA57699 Bacteria Mycobacterium avium subsp. paratuberculosis K-10 4829781 4350 4531 12.44

PRJNA57695 Bacteria Mycobacterium bovis AF2122 97 4345492 3953 4008 13.42 PRJNA86889 Bacteria Mycobacterium bovis BCG str. Mexico 4350386 3981 3991 14.28 PRJNA58781 Bacteria Mycobacterium bovis BCG str. Pasteur 1173P2 4374522 3988 4016 14.42 PRJNA59281 Bacteria Mycobacterium bovis BCG str. Tokyo 172 4371711 3984 4031 14.35 PRJNA70731 Bacteria Mycobacterium canettii CIPT 140010059 4482059 3935 4062 12.94 PRJNA168322 Bacteria Mycobacterium chubuense NBB4 6342624 5843 5993 16.09 PRJNA59421 Bacteria Mycobacterium gilvum PYR-GCK 5982829 5579 5723 15.65 PRJNA61403 Bacteria Mycobacterium gilvum Spyr1 5783292 5349 5525 15.39 PRJNA167994 Bacteria Mycobacterium intracellulare ATCC 13950 5402402 5144 5033 13.65 PRJNA89387 Bacteria Mycobacterium intracellulare MOTT-02 5409696 5149 5055 13.26

Page 44: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA89385 Bacteria Mycobacterium intracellulare MOTT-64 5501090 5249 5156 13.41 PRJNA59293 Bacteria Mycobacterium leprae Br4923 3268071 2720 3943 46.60 PRJNA57697 Bacteria Mycobacterium leprae TN 3268203 2720 3960 46.80 PRJNA59423 Bacteria Mycobacterium marinum M 6660144 5452 5610 15.02 PRJNA170732 Bacteria Mycobacterium massiliense str. GO 06 5068807 2626 5014 12.21 PRJNA75107 Bacteria Mycobacterium rhodesiae NBB3 6415739 6147 6273 15.35 PRJNA171958 Bacteria Mycobacterium smegmatis str. MC2 155 6988208 6693 6671 12.89 PRJNA57701 Bacteria Mycobacterium smegmatis str. MC2 155 6988209 6716 6690 13.99 PRJNA67369 Bacteria Mycobacterium sp. JDM601 4643668 4346 4355 14.76 PRJNA58489 Bacteria Mycobacterium sp. JLS 6048425 5739 5835 13.33 PRJNA58491 Bacteria Mycobacterium sp. KMS 6256079 5975 6071 16.40 PRJNA58465 Bacteria Mycobacterium sp. MCS 5920523 5615 5718 14.83 PRJNA164001 Bacteria Mycobacterium sp. MOTT36Y 5613626 5128 5174 12.90 PRJNA161943 Bacteria Mycobacterium tuberculosis CCDC5079 4398812 3647 4193 13.02 PRJNA161941 Bacteria Mycobacterium tuberculosis CCDC5180 4405981 3591 4115 13.00 PRJNA57775 Bacteria Mycobacterium tuberculosis CDC1551 4403837 4189 4081 16.98 PRJNA161997 Bacteria Mycobacterium tuberculosis CTRI-2 4398525 3946 4082 13.60 PRJNA58417 Bacteria Mycobacterium tuberculosis F11 4424435 3950 4092 13.42 PRJNA58853 Bacteria Mycobacterium tuberculosis H37Ra 4419977 4034 4101 14.00 PRJNA170532 Bacteria Mycobacterium tuberculosis H37Rv 4411708 4113 4080 14.63 PRJNA57777 Bacteria Mycobacterium tuberculosis H37Rv 4411532 3999 4085 13.59 PRJNA59069 Bacteria Mycobacterium tuberculosis KZN 1435 4398250 4060 4080 14.19 PRJNA83619 Bacteria Mycobacterium tuberculosis KZN 4207 4394985 3996 4072 13.75 PRJNA54947 Bacteria Mycobacterium tuberculosis KZN 605 4399120 4002 4087 14.07 PRJNA157907 Bacteria Mycobacterium tuberculosis RGTB327 4380119 3691 4731 19.88 PRJNA162179 Bacteria Mycobacterium tuberculosis RGTB423 4406587 3622 4804 19.80 PRJEA162183 Bacteria Mycobacterium tuberculosis UT205 4418088 3804 4034 13.79 PRJNA62939 Bacteria Mycobacterium ulcerans Agy99 5805761 4241 5558 14.91 PRJNA58463 Bacteria Mycobacterium vanbaalenii PYR-1 6491865 5979 6170 15.47 PRJNA46679 Bacteria Mycoplasma agalactiae 1006702 825 831 29.59 PRJNA61619 Bacteria Mycoplasma agalactiae PG2 877438 759 772 28.02

Page 45: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58005 Bacteria Mycoplasma arthritidis 158L3-1 820453 631 633 27.45 PRJNA168665 Bacteria Mycoplasma bovis HB0801 991702 762 828 24.72 PRJNA68691 Bacteria Mycoplasma bovis Hubei-1 948121 801 805 25.72 PRJNA60859 Bacteria Mycoplasma bovis PG45 1003404 765 837 25.97

PRJNA58525 Bacteria Mycoplasma capricolum subsp. capricolum ATCC 27343 1010023 812 838 21.45

PRJNA59325 Bacteria Mycoplasma conjunctivae 846214 696 674 29.49 PRJNA47087 Bacteria Mycoplasma crocodyli MP145 934379 689 755 23.55 PRJNA53543 Bacteria Mycoplasma fermentans JER 977524 797 836 25.72 PRJNA62099 Bacteria Mycoplasma fermentans M64 1118751 1050 986 36.84 PRJDA28473 Bacteria Mycoplasma fermentans PG18 1004014 893 893 29.51 PRJNA172630 Bacteria Mycoplasma gallisepticum CA06 2006.052-5-2P 976412 763 770 24.01 PRJNA172629 Bacteria Mycoplasma gallisepticum NC06 2006.080-5-2P 938869 744 742 23.82 PRJNA172631 Bacteria Mycoplasma gallisepticum NC08 2008.031-4-3P 926650 739 743 23.95 PRJNA172625 Bacteria Mycoplasma gallisepticum NC95 13295-2-2P 953989 754 759 24.98 PRJNA172626 Bacteria Mycoplasma gallisepticum NC96 1596-4-2P 986257 771 773 24.42 PRJNA172627 Bacteria Mycoplasma gallisepticum NY01 2001.047-5-1P 965525 760 766 25.03 PRJNA162001 Bacteria Mycoplasma gallisepticum str. F 977612 756 783 26.32 PRJNA161999 Bacteria Mycoplasma gallisepticum str. R(high) 1012027 766 809 25.27 PRJNA57993 Bacteria Mycoplasma gallisepticum str. R(low) 1012800 763 808 24.82 PRJNA172624 Bacteria Mycoplasma gallisepticum VA94 7994-1-7P 964110 767 765 25.59

PRJNA172628 Bacteria Mycoplasma gallisepticum WI01 2001.043-13-2P 939844 746 753 24.48

PRJNA57707 Bacteria Mycoplasma genitalium G37 580076 476 527 14.56 PRJNA173372 Bacteria Mycoplasma genitalium M2288 579558 506 536 18.33 PRJNA173373 Bacteria Mycoplasma genitalium M2321 579977 499 550 17.92 PRJNA173371 Bacteria Mycoplasma genitalium M6282 579504 484 574 19.19 PRJNA173370 Bacteria Mycoplasma genitalium M6320 579796 509 553 18.64 PRJNA82367 Bacteria Mycoplasma haemocanis str. Illinois 919992 1175 1182 74.71 PRJNA162029 Bacteria Mycoplasma haemofelis Ohio2 1155937 1527 1615 80.49 PRJNA62461 Bacteria Mycoplasma haemofelis str. Langford 1 1147259 1545 1569 80.22 PRJNA41875 Bacteria Mycoplasma hominis ATCC 23114 665445 536 561 23.70

Page 46: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162053 Bacteria Mycoplasma hyopneumoniae 168 925576 695 705 31.00 PRJNA58039 Bacteria Mycoplasma hyopneumoniae 7448 920079 698 694 30.39 PRJNA58059 Bacteria Mycoplasma hyopneumoniae J 897405 674 693 28.68 PRJNA87003 Bacteria Mycoplasma hyorhinis GDL-1 837480 647 756 24.88 PRJNA162087 Bacteria Mycoplasma hyorhinis MCLD 829709 778 757 28.86 PRJEA162031 Bacteria Mycoplasma leachii 99 014 6 1017232 905 901 25.08 PRJNA60849 Bacteria Mycoplasma leachii PG50 1008951 882 887 24.19 PRJNA58077 Bacteria Mycoplasma mobile 163K 777079 635 659 19.63

PRJNA39245 Bacteria Mycoplasma mycoides subsp capri GM12 deltatypeIIIres 1084586 830 874 21.54

PRJNA19245 Bacteria Mycoplasma mycoides subsp capri GM12 tetM-lacZ 1089202 832 879 21.86

PRJNA66189 Bacteria Mycoplasma mycoides subsp. capri LC str. 95010 1155838 926 949 24.00

PRJNA27713 Bacteria Mycoplasma mycoides subsp. mycoides SC str. Gladysdale 1193808 1095 1098 26.13

PRJNA58031 Bacteria Mycoplasma mycoides subsp. mycoides SC str. PG1 1211703 1016 1148 29.16

PRJNA57729 Bacteria Mycoplasma penetrans HF-2 1358633 1037 1038 25.88 PRJNA85495 Bacteria Mycoplasma pneumoniae 309 817176 707 760 20.38 PRJNA57709 Bacteria Mycoplasma pneumoniae M129 816394 688 764 20.45 PRJNA61569 Bacteria Mycoplasma pulmonis UAB CTIP 963879 782 758 24.94 PRJNA72481 Bacteria Mycoplasma putrefaciens KS1 832603 650 701 20.50 PRJNA58061 Bacteria Mycoplasma synoviae 53 799476 681 711 28.74 PRJNA170731 Bacteria Mycoplasma wenyonii str. Massachusetts 650228 652 718 53.58 PRJNA68443 Bacteria Myxococcus fulvus HW-1 9003593 7284 7301 24.77 PRJNA58003 Bacteria Myxococcus xanthus DK 1622 9139763 7331 7328 23.88 PRJNA59221 Bacteria Nakamurella multipartita DSM 44233 6060298 5240 5467 17.47 PRJNA59001 Bacteria Natranaerobius thermophilus JW NM-WN-LF 3191453 2906 2973 15.22 PRJNA59345 Bacteria Nautilia profundicola AmH 1676444 1730 1737 12.95 PRJNA57611 Bacteria Neisseria gonorrhoeae FA 1090 2153922 2002 2143 20.63 PRJNA60851 Bacteria Neisseria lactamica 020-06 2220606 2018 2035 17.30 PRJNA58587 Bacteria Neisseria meningitidis 053442 2153416 2020 2005 17.52

Page 47: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJEA161967 Bacteria Neisseria meningitidis 8013 2277550 2126 2130 18.49 PRJNA61649 Bacteria Neisseria meningitidis alpha14 2145295 1924 1979 15.96 PRJNA161971 Bacteria Neisseria meningitidis alpha710 2242947 2063 2080 17.84 PRJNA57825 Bacteria Neisseria meningitidis FAM18 2194961 1975 2057 16.84 PRJNA162085 Bacteria Neisseria meningitidis G2136 2184862 1971 2070 17.55 PRJNA162083 Bacteria Neisseria meningitidis H44 76 2240883 2025 2092 17.66 PRJNA162079 Bacteria Neisseria meningitidis M01-240149 2223518 1985 2053 16.79 PRJNA162075 Bacteria Neisseria meningitidis M01-240355 2287777 2023 2098 17.25 PRJNA162081 Bacteria Neisseria meningitidis M04-240196 2250449 2022 2087 17.30 PRJNA57817 Bacteria Neisseria meningitidis MC58 2272360 2063 2121 20.55 PRJNA162077 Bacteria Neisseria meningitidis NZ-05 33 2248966 2014 2091 17.69 PRJEA162093 Bacteria Neisseria meningitidis WUE 2594 2227255 2070 2132 17.44 PRJNA57819 Bacteria Neisseria meningitidis Z2491 2184406 1993 2053 16.46 PRJNA57965 Bacteria Neorickettsia sennetsu str. Miyayama 859006 932 789 25.33 PRJNA83125 Bacteria Niastella koreensis GR20-10 9033684 7174 7307 21.30 PRJNA62183 Bacteria Nitratifractor salsuginis DSM 16511 2101285 2088 2120 17.21 PRJNA58861 Bacteria Nitratiruptor sp. SB155-2 1877931 1857 1936 12.02 PRJNA58293 Bacteria Nitrobacter hamburgensis X14 5011522 4326 4892 23.78 PRJNA58295 Bacteria Nitrobacter winogradskyi Nb-255 3402093 3122 3265 19.37 PRJNA46803 Bacteria Nitrosococcus halophilus Nc4 4145260 3817 3988 17.91 PRJNA58403 Bacteria Nitrosococcus oceani ATCC 19707 3522111 3019 3310 15.99 PRJNA50331 Bacteria Nitrosococcus watsonii C-113 3373286 2908 3152 16.09 PRJNA57647 Bacteria Nitrosomonas europaea ATCC 19718 2812094 2575 2664 11.99 PRJNA58363 Bacteria Nitrosomonas eutropha C91 2781824 2551 2706 13.81 PRJNA55727 Bacteria Nitrosomonas sp. AL212 3337023 2983 3200 19.59 PRJNA68745 Bacteria Nitrosomonas sp. Is79A3 3783444 3372 3555 21.35 PRJNA58361 Bacteria Nitrosospira multiformis ATCC 25196 3234309 2805 2980 16.18 PRJNA89395 Bacteria Nocardia cyriacigeorgica GUH-2 6194645 5491 5582 17.68 PRJNA58203 Bacteria Nocardia farcinica IFM 10152 6292344 5946 5979 19.90 PRJNA58149 Bacteria Nocardioides sp. JS614 5293685 4909 5088 15.72 PRJNA174334 Bacteria Nocardiopsis alba ATCC BAA-2165 5848211 5539 5152 21.12

Page 48: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA49483 Bacteria Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 6543312 5497 5557 15.83

PRJNA49725 Bacteria Nostoc azollae' 0708 5486145 3651 6621 37.81 PRJNA57767 Bacteria Nostoc punctiforme PCC 73102 9059191 6690 7771 25.42 PRJNA57803 Bacteria Nostoc sp. PCC 7120 7211789 6132 6156 23.36 PRJNA57747 Bacteria Novosphingobium aromaticivorans DSM 12444 4233314 3937 4002 13.98 PRJNA67383 Bacteria Novosphingobium sp. PP1Y 5313905 4683 4876 12.52 PRJNA81627 Bacteria Oceanimonas sp. GK1 3527244 3221 3231 6.23 PRJNA60855 Bacteria Oceanithermus profundus DSM 14977 2439291 2373 2387 16.37 PRJNA57867 Bacteria Oceanobacillus iheyensis HTE831 3630528 3496 3555 11.15 PRJNA58921 Bacteria Ochrobactrum anthropi ATCC 49188 5205777 4799 5022 13.35 PRJNA63397 Bacteria Odoribacter splanchnicus DSM 20712 4392288 3497 3709 19.94 PRJNA59417 Bacteria Oenococcus oeni PSU-1 1780517 1691 1833 13.99 PRJNA162135 Bacteria Oligotropha carboxidovorans OM4 3839771 3574 3675 15.04 PRJNA59155 Bacteria Oligotropha carboxidovorans OM5 3745629 3722 3581 17.90 PRJNA72795 Bacteria Oligotropha carboxidovorans OM5 3896074 3629 3716 14.92 PRJNA51367 Bacteria Olsenella uli DSM 7084 2051896 1739 1766 12.24 PRJNA58015 Bacteria Onion yellows phytoplasma OY-M 853092 752 951 34.76 PRJNA58965 Bacteria Opitutus terrae PB90-1 5957605 4612 4672 18.83 PRJNA61621 Bacteria Orientia tsutsugamushi str. Boryong 2127051 2179 2402 39.33 PRJNA58869 Bacteria Orientia tsutsugamushi str. Ikeda 2008987 1967 2198 30.37 PRJNA168256 Bacteria Ornithobacterium rhinotracheale DSM 15997 2399175 2267 2335 27.64 PRJNA73895 Bacteria Oscillibacter valericigenes Sjm18-20 4470622 4723 4393 25.21 PRJNA82951 Bacteria Owenweeksia hongkongensis DSM 17368 4000057 3485 3512 20.98 PRJNA162117 Bacteria Paenibacillus mucilaginosus K02 8770140 7253 7365 16.65 PRJNA68311 Bacteria Paenibacillus mucilaginosus KNP414 8663821 7804 7197 19.44 PRJNA53477 Bacteria Paenibacillus polymyxa E681 5394884 4805 4856 15.25 PRJEA162159 Bacteria Paenibacillus polymyxa M1 6231122 5364 5626 19.33 PRJNA59583 Bacteria Paenibacillus polymyxa SC2 6241931 6033 5754 25.05 PRJNA59021 Bacteria Paenibacillus sp. JDR-2 7184930 6213 6295 12.26 PRJNA41127 Bacteria Paenibacillus sp. Y412MC10 7121665 6238 6319 12.73 PRJNA82371 Bacteria Paenibacillus terrae HPL-003 6083395 5525 5455 16.09

Page 49: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA60725 Bacteria Paludibacter propionicigenes WB4 3685504 3020 3072 17.53 PRJDA162073 Bacteria Pantoea ananatis AJ13355 4877280 4067 4476 7.40 PRJNA46807 Bacteria Pantoea ananatis LMG 20103 4703373 4241 4270 8.15 PRJNA86861 Bacteria Pantoea ananatis LMG 5342 4908144 4664 4527 12.39 PRJNA162181 Bacteria Pantoea ananatis PA13 4867131 4372 4523 9.20 PRJNA55845 Bacteria Pantoea sp. At-9b 6312783 5770 5903 9.43 PRJNA49871 Bacteria Pantoea vagans C9-1 4888338 4591 4501 10.68 PRJNA58301 Bacteria Parabacteroides distasonis ATCC 8503 4811379 3850 4002 16.19 PRJNA58187 Bacteria Paracoccus denitrificans PD1222 5236194 5077 5112 12.94 PRJNA58739 Bacteria Parvibaculum lavamentivorans DS-1 3914745 3636 3701 10.21 PRJNA51641 Bacteria Parvularcula bermudensis HTCC2503 2902643 2687 2677 14.22 PRJNA86887 Bacteria Pasteurella multocida 36950 2349518 2098 2154 8.28 PRJNA161955 Bacteria Pasteurella multocida subsp. multocida str. 3480 2378127 2229 2214 10.35

PRJNA156881 Bacteria Pasteurella multocida subsp. multocida str. HN06 2407578 2265 2253 10.67

PRJNA57627 Bacteria Pasteurella multocida subsp. multocida str. Pm70 2257487 2014 2030 6.53

PRJNA57957 Bacteria Pectobacterium atrosepticum SCRI1043 5064019 4492 4467 9.84

PRJNA59295 Bacteria Pectobacterium carotovorum subsp. carotovorum PC1 4862913 4246 4284 7.95

PRJNA174335 Bacteria Pectobacterium carotovorum subsp. carotovorum PCC21 4842771 4263 4265 9.77

PRJNA41297 Bacteria Pectobacterium wasabiae WPP163 5063892 4437 4525 11.70 PRJNA81103 Bacteria Pediococcus claussenii ATCC BAA-344 1978793 1881 1891 12.62 PRJNA57981 Bacteria Pediococcus pentosaceus ATCC 25745 1832387 1755 1755 11.85 PRJNA59111 Bacteria Pedobacter heparinus DSM 2366 5167383 4252 4271 15.26 PRJNA61349 Bacteria Pedobacter saltans DSM 12145 4635236 3792 3848 16.64 PRJNA74393 Bacteria Pelagibacterium halotolerans B2 3948887 3881 3859 12.35 PRJNA58241 Bacteria Pelobacter carbinolicus DSM 2380 3665893 3353 3239 14.20 PRJNA58255 Bacteria Pelobacter propionicus DSM 2379 4241119 3804 3814 16.28 PRJNA58173 Bacteria Pelodictyon phaeoclathratiforme BU-1 3018238 2707 2923 17.02 PRJNA58877 Bacteria Pelotomaculum thermopropionicum SI 3025375 2920 2990 14.86

Page 50: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58119 Bacteria Persephonella marina EX-H1 1983966 2051 2051 16.31 PRJNA58747 Bacteria Petrotoga mobilis SJ95 2169548 1898 1980 11.78 PRJNA54715 Bacteria Phaeobacter gallaeciensis 2.10 4160918 3723 3883 9.89 PRJNA54717 Bacteria Phaeobacter gallaeciensis DSM 17395 4227134 3875 3951 11.55 PRJNA58959 Bacteria Phenylobacterium zucineum HLK1 4379231 3854 4274 11.38 PRJNA62923 Bacteria Photobacterium profundum SS9 6403280 5480 5818 16.41 PRJNA59243 Bacteria Photorhabdus asymbiotica 5094138 4422 4416 18.78

PRJNA61593 Bacteria Photorhabdus luminescens subsp. laumondii TTO1 5688987 4905 4883 18.64

PRJNA157331 Bacteria Phycisphaera mikurensis NBRC 102666 3884382 3284 3124 20.96 PRJNA43209 Bacteria Pirellula staleyi DSM 6068 6196199 4717 4716 25.72 PRJNA60583 Bacteria Planctomyces brasiliensis DSM 5305 6006602 4750 4861 25.31 PRJNA48643 Bacteria Planctomyces limnophilus DSM 3776 5460085 4258 4283 28.92 PRJNA58273 Bacteria Polaromonas naphthalenivorans CJ2 5366143 4929 5008 15.79 PRJNA58207 Bacteria Polaromonas sp. JS666 5898676 5453 5644 14.06 PRJNA65447 Bacteria Polymorphum gilvum SL003B-26A1 4718963 4393 4426 12.10

PRJNA58611 Bacteria Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 2159490 2077 2135 9.83

PRJNA58967 Bacteria Polynucleobacter necessarius subsp. necessarius STIR1 1560469 1508 1931 15.09

PRJNA66603 Bacteria Porphyromonas asaccharolytica DSM 20707 2186370 1699 1731 16.06 PRJNA58879 Bacteria Porphyromonas gingivalis ATCC 33277 2354886 2090 1980 19.93 PRJNA67407 Bacteria Porphyromonas gingivalis TDC60 2339898 2220 1976 23.28 PRJNA57641 Bacteria Porphyromonas gingivalis W83 2343476 1909 2013 19.22 PRJNA65091 Bacteria Prevotella denticola F0289 2937589 2386 2355 22.70 PRJNA163151 Bacteria Prevotella intermedia 17 2699437 2266 2332 25.03 PRJNA51377 Bacteria Prevotella melaninogenica ATCC 25845 3168282 2296 2464 20.97 PRJNA47507 Bacteria Prevotella ruminicola 23 3619559 2763 3046 18.11 PRJNA58307 Bacteria Prochlorococcus marinus str. AS9601 1669886 1921 1891 22.69 PRJNA58309 Bacteria Prochlorococcus marinus str. MIT 9211 1688963 1855 1853 21.95 PRJNA58819 Bacteria Prochlorococcus marinus str. MIT 9215 1738790 1983 1986 24.34 PRJNA58437 Bacteria Prochlorococcus marinus str. MIT 9301 1641879 1907 1875 22.50

Page 51: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58305 Bacteria Prochlorococcus marinus str. MIT 9303 2682675 2997 2742 31.68 PRJNA58357 Bacteria Prochlorococcus marinus str. MIT 9312 1709204 1962 1922 23.58 PRJNA57773 Bacteria Prochlorococcus marinus str. MIT 9313 2410873 2915 2565 33.23 PRJNA58313 Bacteria Prochlorococcus marinus str. MIT 9515 1704176 1906 1901 23.01 PRJNA58423 Bacteria Prochlorococcus marinus str. NATL1A 1864731 2193 2155 30.61 PRJNA58359 Bacteria Prochlorococcus marinus str. NATL2A 1842899 2163 2116 30.45

PRJNA57995 Bacteria Prochlorococcus marinus subsp. marinus str. CCMP1375 1751080 1882 1908 22.96

PRJNA57761 Bacteria Prochlorococcus marinus subsp. pastoris str. CCMP1986 1657990 1960 1921 24.04

PRJNA162059 Bacteria Propionibacterium acnes 266 2494578 2348 2315 17.39 PRJNA162137 Bacteria Propionibacterium acnes 6609 2560282 2348 2361 16.12 PRJNA162177 Bacteria Propionibacterium acnes ATCC 11828 2488626 2259 2312 16.50 PRJNA58101 Bacteria Propionibacterium acnes KPA171202 2560265 2297 2366 15.72 PRJNA48071 Bacteria Propionibacterium acnes SK137 2495334 2352 2316 16.88 PRJNA80735 Bacteria Propionibacterium acnes TypeIA2 P.acn17 2522885 2263 2321 15.10 PRJNA80733 Bacteria Propionibacterium acnes TypeIA2 P.acn31 2498766 2244 2295 15.05 PRJNA80745 Bacteria Propionibacterium acnes TypeIA2 P.acn33 2489623 2233 2289 14.88

PRJNA49535 Bacteria Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1 2616384 2439 2312 16.27

PRJNA170533 Bacteria Propionibacterium propionicum F0230a 3449360 2938 3068 17.90 PRJNA58151 Bacteria Prosthecochloris aestuarii DSM 271 2579695 2327 2382 14.19 PRJNA61599 Bacteria Proteus mirabilis HI4320 4099895 3740 3690 11.59 PRJNA162193 Bacteria Providencia stuartii MRSN 2154 4402109 4099 4150 13.53 PRJNA58283 Bacteria Pseudoalteromonas atlantica T6c 5187005 4281 4409 11.55 PRJNA58431 Bacteria Pseudoalteromonas haloplanktis TAC125 3850272 3487 3398 12.49 PRJNA61247 Bacteria Pseudoalteromonas sp. SM9913 4037671 3712 3604 13.38 PRJNA73423 Bacteria Pseudogulbenkiania sp. NH8B 4332995 4015 3946 12.21 PRJNA168996 Bacteria Pseudomonas aeruginosa DK2 6402658 5884 5870 11.16 PRJNA59275 Bacteria Pseudomonas aeruginosa LESB58 6601757 5965 6058 12.04 PRJNA162089 Bacteria Pseudomonas aeruginosa M18 6327754 5684 5775 10.13 PRJDA162173 Bacteria Pseudomonas aeruginosa NCGM2.S1 6764661 6269 6225 13.14

Page 52: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58627 Bacteria Pseudomonas aeruginosa PA7 6588339 6286 6035 13.56 PRJNA57945 Bacteria Pseudomonas aeruginosa PAO1 6264404 5571 5680 9.18 PRJNA57977 Bacteria Pseudomonas aeruginosa UCBPP-PA14 6537648 5892 5905 10.59

PRJNA66303 Bacteria Pseudomonas brassicacearum subsp. brassicacearum NFM421 6843248 6095 6057 12.00

PRJNA58639 Bacteria Pseudomonas entomophila L48 5888780 5168 5070 11.46 PRJNA165185 Bacteria Pseudomonas fluorescens A506 5962570 5267 5374 11.80 PRJNA87037 Bacteria Pseudomonas fluorescens F113 6845832 5862 6048 9.97 PRJNA57591 Bacteria Pseudomonas fluorescens Pf0-1 6438405 5722 5735 11.18 PRJNA158693 Bacteria Pseudomonas fluorescens SBW25 6722539 6009 6027 11.71 PRJNA67351 Bacteria Pseudomonas fulva 12-X 4920769 4461 4460 9.32 PRJNA66299 Bacteria Pseudomonas mendocina NK-01 5434353 4958 5012 10.17 PRJNA58723 Bacteria Pseudomonas mendocina ymp 5072807 4594 4632 9.55 PRJNA57937 Bacteria Pseudomonas protegens Pf-5 7074893 6108 6275 11.49 PRJNA162055 Bacteria Pseudomonas putida BIRD-1 5731541 4960 5152 10.80 PRJNA171260 Bacteria Pseudomonas putida DOT-T1E 6260702 5721 5756 13.89 PRJNA58355 Bacteria Pseudomonas putida F1 5959964 5252 5298 10.76 PRJNA58735 Bacteria Pseudomonas putida GB-1 6078430 5409 5451 11.69 PRJNA57843 Bacteria Pseudomonas putida KT2440 6181863 5350 5584 12.11 PRJNA167583 Bacteria Pseudomonas putida ND6 6202452 6289 5590 17.72 PRJNA68747 Bacteria Pseudomonas putida S16 5984790 5218 5454 11.38 PRJNA58651 Bacteria Pseudomonas putida W619 5774330 5182 5239 11.14 PRJNA58641 Bacteria Pseudomonas stutzeri A1501 4567418 4128 4178 9.20

PRJNA68749 Bacteria Pseudomonas stutzeri ATCC 17588 = LMG 11199 4547930 4217 4196 10.38

PRJNA168379 Bacteria Pseudomonas stutzeri CCUG 29243 4709064 4300 4352 9.92 PRJNA170940 Bacteria Pseudomonas stutzeri DSM 10701 4174118 3815 3833 8.76 PRJNA162113 Bacteria Pseudomonas stutzeri DSM 4166 4689946 4303 4338 9.06 PRJNA58099 Bacteria Pseudomonas syringae pv. phaseolicola 1448A 6112448 5172 5577 13.54 PRJNA57931 Bacteria Pseudomonas syringae pv. syringae B728a 6093698 5136 5204 11.71 PRJNA57967 Bacteria Pseudomonas syringae pv. tomato str. DC3000 6538260 5621 5869 17.00 PRJNA65087 Bacteria Pseudonocardia dioxanivorans CB1190 7440794 6797 6987 15.80

Page 53: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA82373 Bacteria Pseudovibrio sp. FO-BEG1 5916782 5468 5342 14.93 PRJNA75113 Bacteria Pseudoxanthomonas spadix BD-a59 3452554 3149 3117 11.67 PRJNA62105 Bacteria Pseudoxanthomonas suwonensis 11-1 3419049 3070 3102 13.66 PRJNA58021 Bacteria Psychrobacter arcticus 273-4 2650701 2147 2198 12.13 PRJNA58373 Bacteria Psychrobacter cryohalolentis K5 3101097 2511 2558 10.24 PRJNA58459 Bacteria Psychrobacter sp. PRwf-1 2995049 2385 2442 12.72 PRJNA58521 Bacteria Psychromonas ingrahamii 37 4559598 3545 3831 12.31 PRJNA66391 Bacteria Pusillimonas sp. T7-7 3924810 3773 3682 10.60 PRJNA86855 Bacteria Rahnella aquatilis CIP 78.65 = ATCC 33071 5448900 4866 4891 8.47 PRJNA158049 Bacteria Rahnella aquatilis HX2 5656799 5082 5197 9.06 PRJNA62715 Bacteria Rahnella sp. Y9602 5614252 5111 5151 9.54 PRJNA62925 Bacteria Ralstonia eutropha H16 7416678 6626 6753 11.17 PRJNA58047 Bacteria Ralstonia eutropha JMP134 7255290 6446 6631 11.03 PRJNA58859 Bacteria Ralstonia pickettii 12D 5685358 5361 5419 19.35 PRJNA58737 Bacteria Ralstonia pickettii 12J 5325729 4952 4991 14.95 PRJNA50545 Bacteria Ralstonia solanacearum CFBP2957 3417386 3303 3162 14.00 PRJNA57593 Bacteria Ralstonia solanacearum GMI1000 5810922 5120 5058 15.40 PRJNA162133 Bacteria Ralstonia solanacearum Po82 5430263 5018 4741 18.34 PRJNA50539 Bacteria Ralstonia solanacearum PSI07 5605618 5100 4806 15.62 PRJNA68279 Bacteria Ramlibacter tataouinensis TTB310 4070193 3881 3918 10.67 PRJNA58899 Bacteria Renibacterium salmoninarum ATCC 33209 3155250 3507 3515 16.96 PRJNA58377 Bacteria Rhizobium etli CFN 42 6530228 6016 6261 13.46 PRJNA59115 Bacteria Rhizobium etli CIAT 652 6448048 6109 6085 15.19

PRJNA58991 Bacteria Rhizobium leguminosarum bv. trifolii WSM1325 7418122 7001 7118 14.43

PRJNA58997 Bacteria Rhizobium leguminosarum bv. trifolii WSM2304 6872702 6415 6526 13.35

PRJNA57955 Bacteria Rhizobium leguminosarum bv. viciae 3841 7751309 7263 7369 13.78 PRJNA47509 Bacteria Rhodobacter capsulatus SB 1003 3871920 3642 3630 14.15 PRJNA57653 Bacteria Rhodobacter sphaeroides 2.4.1 4603060 4242 4357 13.30 PRJNA58451 Bacteria Rhodobacter sphaeroides ATCC 17025 4557127 4333 4505 15.41 PRJNA58449 Bacteria Rhodobacter sphaeroides ATCC 17029 4489380 4132 4217 12.26

Page 54: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59277 Bacteria Rhodobacter sphaeroides KD131 4711139 4569 4419 14.46 PRJNA60171 Bacteria Rhodococcus equi 103S 5043170 4525 4657 12.43 PRJNA59019 Bacteria Rhodococcus erythropolis PR4 6895538 6437 6545 15.74 PRJNA58325 Bacteria Rhodococcus jostii RHA1 9702737 9145 9049 19.48 PRJNA13791 Bacteria Rhodococcus opacus B4 8834939 8203 8192 16.19 PRJNA58353 Bacteria Rhodoferax ferrireducens T118 4969784 4418 4582 12.44 PRJNA43247 Bacteria Rhodomicrobium vannielii ATCC 17100 4014469 3565 3679 17.35 PRJNA61589 Bacteria Rhodopirellula baltica SH 1 7145576 7325 5518 35.86 PRJNA58445 Bacteria Rhodopseudomonas palustris BisA53 5505494 4878 4950 16.46 PRJNA58443 Bacteria Rhodopseudomonas palustris BisB18 5513844 4886 4985 15.26 PRJNA58441 Bacteria Rhodopseudomonas palustris BisB5 4892717 4397 4429 14.09 PRJNA62901 Bacteria Rhodopseudomonas palustris CGA009 5467640 4840 4963 12.45 PRJNA43327 Bacteria Rhodopseudomonas palustris DX-1 5404117 4917 4980 14.44 PRJNA58439 Bacteria Rhodopseudomonas palustris HaA2 5331656 4683 4739 12.64 PRJNA58995 Bacteria Rhodopseudomonas palustris TIE-1 5744041 5246 5303 16.10 PRJNA58805 Bacteria Rhodospirillum centenum SW 4355543 4038 3945 16.66 PRJNA159003 Bacteria Rhodospirillum photometricum DSM 122 3876289 3286 3510 15.67 PRJNA57655 Bacteria Rhodospirillum rubrum ATCC 11170 4406557 3841 3889 10.75 PRJNA162149 Bacteria Rhodospirillum rubrum F11 4352825 3878 3841 11.47 PRJNA41729 Bacteria Rhodothermus marinus DSM 4252 3386737 2863 2922 13.05 PRJNA72767 Bacteria Rhodothermus marinus SG0.5JP17-172 3334122 2838 2945 13.49 PRJNA58799 Bacteria Rickettsia africae ESF-5 1290917 1041 1448 23.42 PRJNA58161 Bacteria Rickettsia akari str. Hartford 1231060 1259 1283 28.09 PRJNA158039 Bacteria Rickettsia australis str. Cutlack 1323280 1261 1471 28.51 PRJNA58681 Bacteria Rickettsia bellii OSU 85-389 1528980 1476 1573 24.40 PRJNA58405 Bacteria Rickettsia bellii RML369-C 1522076 1429 1515 21.16 PRJNA88063 Bacteria Rickettsia canadensis str. CA410 1150228 1016 1036 21.54 PRJNA58159 Bacteria Rickettsia canadensis str. McKiel 1159772 1093 1054 24.41 PRJNA57633 Bacteria Rickettsia conorii str. Malish 7 1268755 1374 1418 28.62 PRJNA58331 Bacteria Rickettsia felis URRWXCal2 1587240 1512 1644 22.97 PRJNA70839 Bacteria Rickettsia heilongjiangensis 054 1278471 1297 1421 27.85

Page 55: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA73963 Bacteria Rickettsia japonica YH 1283087 971 1430 23.41 PRJNA58801 Bacteria Rickettsia massiliae MTU5 1376184 980 1573 23.35 PRJNA86751 Bacteria Rickettsia massiliae str. AZT80 1278719 1207 1451 27.35 PRJNA158043 Bacteria Rickettsia montanensis str. OSU 85-930 1279798 1217 1398 24.09 PRJNA158045 Bacteria Rickettsia parkeri str. Portsmouth 1300386 1318 1450 28.47 PRJNA59301 Bacteria Rickettsia peacockii str. Rustic 1314898 947 1502 22.09 PRJNA89383 Bacteria Rickettsia philipii str. 364D 1287740 1344 1423 28.48 PRJNA161945 Bacteria Rickettsia prowazekii Rp22 1111612 952 888 14.78 PRJNA158063 Bacteria Rickettsia prowazekii str. BuV67-CWPP 1111445 843 886 9.20 PRJNA158053 Bacteria Rickettsia prowazekii str. Chernikova 1109804 845 885 9.19 PRJNA158057 Bacteria Rickettsia prowazekii str. Dachau 1109051 839 893 9.53 PRJNA158051 Bacteria Rickettsia prowazekii str. GvV257 1111969 829 890 9.02 PRJNA158055 Bacteria Rickettsia prowazekii str. Katsinyian 1111454 844 889 9.17 PRJNA61565 Bacteria Rickettsia prowazekii str. Madrid E 1111523 834 900 9.40 PRJNA158065 Bacteria Rickettsia prowazekii str. RpGvF24 1112101 834 885 8.96 PRJNA156977 Bacteria Rickettsia rhipicephali str. 3-7-female6-CWPP 1305467 1266 1501 28.62 PRJNA58027 Bacteria Rickettsia rickettsii str. 'Sheila Smith' 1257710 1345 1399 30.61 PRJNA86655 Bacteria Rickettsia rickettsii str. Arizona 1267197 1343 1420 29.53 PRJNA88069 Bacteria Rickettsia rickettsii str. Brazil 1255681 1332 1402 29.19 PRJNA86653 Bacteria Rickettsia rickettsii str. Colombia 1270083 1350 1418 29.84 PRJNA86659 Bacteria Rickettsia rickettsii str. Hauke 1269774 1340 1421 29.48 PRJNA86657 Bacteria Rickettsia rickettsii str. Hino 1269837 1336 1419 29.36 PRJNA88067 Bacteria Rickettsia rickettsii str. Hlp#2 1270751 1308 1407 28.84 PRJNA58961 Bacteria Rickettsia rickettsii str. Iowa 1268188 1384 1422 31.54 PRJNA82369 Bacteria Rickettsia slovaca 13-B 1275089 1114 1446 26.72 PRJNA158159 Bacteria Rickettsia slovaca str. D-CWPP 1275720 1347 1438 28.87 PRJNA158357 Bacteria Rickettsia typhi str. B9991CWPP 1112957 839 876 10.09 PRJNA158161 Bacteria Rickettsia typhi str. TH1527 1112372 838 875 9.98 PRJNA58063 Bacteria Rickettsia typhi str. Wilmington 1111496 838 876 10.39

PRJNA159857 Bacteria Riemerella anatipestifer ATCC 11845 = DSM 15868 2164087 1941 2025 20.05

PRJNA60727 Bacteria Riemerella anatipestifer ATCC 11845 = DSM 2155121 1972 2008 19.45

Page 56: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

15868

PRJNA162013 Bacteria Riemerella anatipestifer RA-GD 2166384 1985 2083 20.72 PRJNA58285 Bacteria Robiginitalea biformata HTCC2501 3530383 3211 3116 18.40 PRJNA73419 Bacteria Roseburia hominis A2-183 3592125 3362 3258 17.58 PRJNA39165 Bacteria Roseburia intestinalis M50 1 4143550 3478 3870 22.06 PRJNA45953 Bacteria Roseburia intestinalis XB6B4 4286292 3630 3981 23.32 PRJNA58287 Bacteria Roseiflexus castenholzii DSM 13941 5723298 4330 4904 19.23 PRJNA58523 Bacteria Roseiflexus sp. RS-1 5801598 4517 4979 18.22 PRJNA58597 Bacteria Roseobacter denitrificans OCh 114 4331234 4144 4106 13.30 PRJNA54719 Bacteria Roseobacter litoralis Och 149 4745450 4537 4585 14.03 PRJNA49331 Bacteria Rothia dentocariosa ATCC 17931 2506025 2217 2144 24.33 PRJNA43093 Bacteria Rothia mucilaginosa DY-18 2264603 1992 1729 24.75 PRJNA158163 Bacteria Rubrivivax gelatinosus IL144 5043253 4706 4584 11.10 PRJNA58057 Bacteria Rubrobacter xylanophilus DSM 9941 3225748 3140 3237 11.96 PRJNA57863 Bacteria Ruegeria pomeroyi DSS-3 4601053 4252 4393 9.90 PRJNA58193 Bacteria Ruegeria sp. TM1040 4153699 3864 3917 12.76 PRJNA51721 Bacteria Ruminococcus albus 7 4482087 3872 4053 28.81 PRJNA39153 Bacteria Ruminococcus bromii L2-63 2249085 1811 2124 18.50 PRJNA39179 Bacteria Ruminococcus champanellensis 18P13 2573208 2114 2354 20.61 PRJNA39167 Bacteria Ruminococcus obeum A2-162 3757491 3155 3493 18.32 PRJNA39149 Bacteria Ruminococcus sp. SR1 5 3545606 3260 3697 23.31 PRJNA39169 Bacteria Ruminococcus torques L2-14 3341681 2798 3071 16.56 PRJNA68317 Bacteria Runella slithyformis DSM 19594 6919729 5974 6001 21.84 PRJNA59055 Bacteria Saccharomonospora viridis DSM 43017 4308349 3828 3930 15.20 PRJNA57921 Bacteria Saccharophagus degradans 2-40 5057531 4007 4141 15.62 PRJNA62947 Bacteria Saccharopolyspora erythraea NRRL 2338 8212805 7198 7282 15.75 PRJNA58513 Bacteria Salinibacter ruber DSM 13855 3587328 2833 3036 17.77 PRJNA47323 Bacteria Salinibacter ruber M8 3832918 3257 3232 20.76 PRJNA58659 Bacteria Salinispora arenicola CNS-205 5786361 4917 5108 19.03 PRJNA58565 Bacteria Salinispora tropica CNB-440 5183331 4536 4687 16.63 PRJNA70155 Bacteria Salmonella bongori NCTC 12419 4460105 3948 4055 6.04

Page 57: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58191 Bacteria Salmonella enterica subsp. arizonae serovar 62:z4,z23:- 4600800 4510 4272 12.34

PRJNA59431 Bacteria Salmonella enterica subsp. enterica serovar Agona str. SL483 4836638 4614 4514 10.89

PRJNA58917 Bacteria Salmonella enterica subsp. enterica serovar Dublin str. CT 02021853 4917459 4617 4692 11.08

PRJNA59247 Bacteria Salmonella enterica subsp. enterica serovar Enteritidis str. P125109 4685848 4318 4370 6.58

PRJNA87035 Bacteria Salmonella enterica subsp. enterica serovar Gallinarum pullorum str. RKS5078 4637962 4325 4478 10.79

PRJNA59249 Bacteria Salmonella enterica subsp. enterica serovar Gallinarum str. 287 91 4658697 4274 4469 7.00

PRJNA162195 Bacteria Salmonella enterica subsp. enterica serovar Heidelberg str. B182 4788046 4334 4492 6.84

PRJNA58973 Bacteria Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 4983515 4779 4684 11.35

PRJNA58831 Bacteria Salmonella enterica subsp. enterica serovar Newport str. SL254 5007719 4805 4702 11.59

PRJNA59269 Bacteria Salmonella enterica subsp. enterica serovar Paratyphi A str. AKU 12601 4581797 4284 4352 7.85

PRJNA58201 Bacteria Salmonella enterica subsp. enterica serovar Paratyphi A str. ATCC 9150 4585229 4093 4356 7.36

PRJNA59097 Bacteria Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7 4858887 5601 4557 18.14

PRJNA59063 Bacteria Salmonella enterica subsp. enterica serovar Paratyphi C strain RKS4594 4888494 4640 4698 10.73

PRJNA58915 Bacteria Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 4823887 4627 4547 11.02

PRJNA57793 Bacteria Salmonella enterica subsp. enterica serovar Typhi str. CT18 5133713 4980 5062 13.42

PRJNA87001 Bacteria Salmonella enterica subsp. enterica serovar Typhi str. P-stx-12 4949783 4924 4819 14.02

PRJNA57973 Bacteria Salmonella enterica subsp. enterica serovar Typhi str. Ty2 4791961 4323 4638 9.21

PRJNA86059 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S 4964097 5474 4656 15.13

Page 58: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA158047 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. 798 4970096 4475 4698 6.41

PRJNA86061 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. D23580 4879400 4521 4566 7.26

PRJNA57799 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 4951371 4554 4637 6.88

PRJNA86645 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344 5067450 4744 4772 8.06

PRJNA84393 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. ST4 74 5067451 4842 4773 9.34

PRJNA84397 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. T000240 5069994 4871 4795 8.29

PRJNA87049 Bacteria Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1 4911145 4555 4587 7.22

PRJNA40845 Bacteria Sanguibacter keddieii DSM 10542 4253413 3710 3713 14.02 PRJNA89375 Bacteria Saprospira grandis str. Lewin 4400185 4251 3488 37.60 PRJNA41865 Bacteria Sebaldella termitidis ATCC 33386 4486650 4150 4194 25.14

PRJNA172737 Bacteria secondary endosymbiont of Ctenarytaina eucalypti 1441139 918 922 26.36

PRJNA172738 Bacteria Secondary endosymbiont of Heteropsylla cubana 1121596 576 581 3.72

PRJNA49049 Bacteria Segniliparus rotundus DSM 44985 3157527 3006 3069 24.76

PRJNA157247 Bacteria Selenomonas ruminantium subsp. lactilytica TAM6421 3631933 3512 3458 22.31

PRJNA55329 Bacteria Selenomonas sputigena ATCC 35185 2568361 2255 2286 12.84 PRJNA67313 Bacteria Serratia plymuthica AS9 5442880 4952 4972 7.90 PRJNA58725 Bacteria Serratia proteamaculans 568 5495657 4942 5068 6.59 PRJNA67315 Bacteria Serratia sp. AS12 5443009 4952 4971 7.90 PRJNA162065 Bacteria Serratia sp. AS13 5442549 4951 4972 7.93 PRJNA82363 Bacteria Serratia symbiotica str. 'Cinara cedri' 1762765 730 741 6.25 PRJNA58257 Bacteria Shewanella amazonensis SB2B 4306142 3645 3722 9.14 PRJNA52601 Bacteria Shewanella baltica BA175 5199401 4344 4429 13.51 PRJNA162025 Bacteria Shewanella baltica OS117 5526018 4668 4906 16.84 PRJNA58259 Bacteria Shewanella baltica OS155 5342896 4489 4690 13.71

Page 59: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58743 Bacteria Shewanella baltica OS185 5312910 4394 4563 14.15 PRJNA58261 Bacteria Shewanella baltica OS195 5547544 4688 4803 15.57 PRJNA58775 Bacteria Shewanella baltica OS223 5358884 4443 4545 14.79 PRJNA50553 Bacteria Shewanella baltica OS678 5368767 4536 4608 14.82 PRJNA58263 Bacteria Shewanella denitrificans OS217 4545906 3754 3901 13.78 PRJNA58265 Bacteria Shewanella frigidimarina NCIMB 400 4845257 4029 4128 11.18 PRJNA59007 Bacteria Shewanella halifaxensis HAW-EB4 5226917 4278 4409 11.93 PRJNA58349 Bacteria Shewanella loihica PV-4 4602594 3859 3901 10.62 PRJNA57949 Bacteria Shewanella oneidensis MR-1 5131416 4779 4618 20.04 PRJNA58705 Bacteria Shewanella pealeana ATCC 700345 5174581 4241 4359 12.21 PRJNA58745 Bacteria Shewanella piezotolerans WP3 5396476 4933 4590 18.19 PRJNA161927 Bacteria Shewanella putrefaciens 200 4840251 4179 4333 15.14 PRJNA58267 Bacteria Shewanella putrefaciens CN-32 4659220 3972 4048 11.46 PRJNA58835 Bacteria Shewanella sediminis HAW-EB3 5517674 4497 4652 11.36 PRJNA58347 Bacteria Shewanella sp. ANA-3 5251146 4360 4526 13.18 PRJNA58345 Bacteria Shewanella sp. MR-4 4706287 3924 4018 10.87 PRJNA58343 Bacteria Shewanella sp. MR-7 4799109 4014 4133 12.15 PRJNA58341 Bacteria Shewanella sp. W3-18-1 4708380 4044 4161 12.94 PRJNA47085 Bacteria Shewanella violacea DSS12 4962103 4346 4087 17.69 PRJNA58721 Bacteria Shewanella woodyi ATCC 51908 5935403 4880 4991 14.40 PRJNA58415 Bacteria Shigella boydii CDC 3083-94 4874659 4557 5113 15.55 PRJNA58215 Bacteria Shigella boydii Sb227 4646520 4290 4818 12.65 PRJNA58213 Bacteria Shigella dysenteriae Sd197 4560911 4508 5213 17.70 PRJNA159233 Bacteria Shigella flexneri 2002017 4894492 4706 5084 13.31 PRJNA57991 Bacteria Shigella flexneri 2a str. 2457T 4599354 4073 4719 10.13 PRJNA62907 Bacteria Shigella flexneri 2a str. 301 4828820 4705 4995 10.80 PRJNA58583 Bacteria Shigella flexneri 5 str. 8401 4574284 4116 4681 9.79 PRJNA84383 Bacteria Shigella sonnei 5220473 0* 5265 15.06 PRJNA58217 Bacteria Shigella sonnei Ss046 5055316 4476 5097 12.92 PRJNA46801 Bacteria Sideroxydans lithotrophicus ES-1 3003656 2980 2969 15.09 PRJNA68451 Bacteria Simkania negevensis Z 2628375 2518 2363 35.71

Page 60: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA86865 Bacteria Sinorhizobium fredii HH103 7221188 6787 6823 16.54 PRJNA59081 Bacteria Sinorhizobium fredii NGR234 6891900 6366 6427 13.67 PRJNA168059 Bacteria Sinorhizobium fredii USDA 257 7032323 6743 6712 16.83 PRJNA58549 Bacteria Sinorhizobium medicae WSM419 6817576 6213 6500 14.35 PRJNA57603 Bacteria Sinorhizobium meliloti 1021 6691694 6234 6299 12.02 PRJNA52607 Bacteria Sinorhizobium meliloti AK83 7140471 6510 6925 15.87 PRJNA52605 Bacteria Sinorhizobium meliloti BL225C 6978785 6354 6590 13.54 PRJNA159685 Bacteria Sinorhizobium meliloti SM11 7173736 7093 6924 17.21 PRJNA59051 Bacteria Slackia heliotrinireducens DSM 20476 3165038 2765 2770 16.64 PRJNA58553 Bacteria Sodalis glossinidius str. 'morsitans' 4292502 2516 5747 23.22 PRJDA168516 Bacteria Solibacillus silvestris StLB046 3984229 3823 3894 15.89 PRJNA81783 Bacteria Solitalea canadensis DSM 3403 5202069 4310 4438 20.21 PRJNA61629 Bacteria Sorangium cellulosum 'So ce 56' 13033779 9384 9662 26.45 PRJNA41997 Bacteria Sphaerobacter thermophilus DSM 20745 3993764 3485 3529 14.44 PRJNA66331 Bacteria Sphaerochaeta coccoides DSM 17374 2227296 1822 1867 12.58 PRJNA63633 Bacteria Sphaerochaeta globus str. Buddy 3316466 3017 3070 14.47 PRJNA82365 Bacteria Sphaerochaeta pleomorpha str. Grapes 3590853 3159 3204 12.76 PRJNA64755 Bacteria Sphingobacterium sp. 21 6226409 5169 5283 16.99 PRJNA52597 Bacteria Sphingobium chlorophenolicum L-1 4573221 4072 4171 12.56 PRJNA47077 Bacteria Sphingobium japonicum UT26S 4424862 4394 4142 17.72 PRJNA73353 Bacteria Sphingobium sp. SYK-6 4348133 4063 4048 14.70 PRJNA58691 Bacteria Sphingomonas wittichii RW1 5915246 5345 5462 11.75 PRJNA58351 Bacteria Sphingopyxis alaskensis RB2256 3373713 3195 3236 12.14 PRJNA81779 Bacteria Spirochaeta africana DSM 8902 3285855 2782 2819 17.85 PRJNA68753 Bacteria Spirochaeta caldaria DSM 7334 3239340 2789 2877 15.18 PRJNA51369 Bacteria Spirochaeta smaragdinae DSM 11293 4653970 4219 4291 13.80 PRJNA53037 Bacteria Spirochaeta thermophila DSM 6192 2472645 2203 2226 13.91 PRJNA162041 Bacteria Spirochaeta thermophila DSM 6578 2560222 2264 2309 12.31 PRJNA43413 Bacteria Spirosoma linguale DSM 74 8491258 6938 7127 21.27 PRJNA46663 Bacteria Stackebrandtia nassauensis DSM 44728 6841557 6379 6438 18.37 PRJNA161969 Bacteria Staphylococcus aureus 04-02981 2821452 2650 2576 11.42

Page 61: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA57661 Bacteria Staphylococcus aureus RF122 2742531 2589 2591 12.08 PRJNA89393 Bacteria Staphylococcus aureus subsp. aureus 2787638 2533 2553 10.72 PRJNA159981 Bacteria Staphylococcus aureus subsp. aureus 11819-97 2868863 2656 2642 11.82 PRJNA162141 Bacteria Staphylococcus aureus subsp. aureus 71193 2715000 2623 2463 12.84 PRJNA57797 Bacteria Staphylococcus aureus subsp. aureus COL 2813862 2676 2570 13.32 PRJNA159389 Bacteria Staphylococcus aureus subsp. aureus ECT-R 2 2759052 2520 2495 10.49 PRJNA159689 Bacteria Staphylococcus aureus subsp. aureus ED133 2832478 2653 2642 13.03 PRJNA41455 Bacteria Staphylococcus aureus subsp. aureus ED98 2847542 2689 2631 13.12

PRJEA162163 Bacteria Staphylococcus aureus subsp. aureus HO 5096 0412 2832299 2602 2611 11.91

PRJNA58457 Bacteria Staphylococcus aureus subsp. aureus JH1 2936936 2780 2732 12.61 PRJNA58455 Bacteria Staphylococcus aureus subsp. aureus JH9 2937129 2726 2732 11.73 PRJNA159691 Bacteria Staphylococcus aureus subsp. aureus JKD6159 2832165 2577 2600 11.45 PRJNA159391 Bacteria Staphylococcus aureus subsp. aureus LGA251 2753827 2475 2501 10.11 PRJNA88065 Bacteria Staphylococcus aureus subsp. aureus M013 2788636 2591 2578 12.46 PRJNA57839 Bacteria Staphylococcus aureus subsp. aureus MRSA252 2902619 2744 2688 12.56 PRJNA57841 Bacteria Staphylococcus aureus subsp. aureus MSSA476 2820454 2643 2565 11.73 PRJNA58817 Bacteria Staphylococcus aureus subsp. aureus Mu3 2880168 2699 2653 11.94 PRJNA57835 Bacteria Staphylococcus aureus subsp. aureus Mu50 2903636 2733 2677 12.29 PRJNA57903 Bacteria Staphylococcus aureus subsp. aureus MW2 2820462 2632 2563 11.84 PRJNA57837 Bacteria Staphylococcus aureus subsp. aureus N315 2839469 2624 2580 10.88

PRJNA57795 Bacteria Staphylococcus aureus subsp. aureus NCTC 8325 2821361 2892 2627 15.56

PRJNA159247 Bacteria Staphylococcus aureus subsp. aureus ST398 2885367 2710 2680 13.08

PRJNA159855 Bacteria Staphylococcus aureus subsp. aureus str. JKD6008 2924344 2680 2714 12.88

PRJNA58839 Bacteria Staphylococcus aureus subsp. aureus str. Newman 2878897 2624 2691 11.40

PRJNA159861 Bacteria Staphylococcus aureus subsp. aureus T0131 2913900 2711 2699 13.51 PRJNA159859 Bacteria Staphylococcus aureus subsp. aureus TCH60 2827166 2700 2607 12.78 PRJNA159241 Bacteria Staphylococcus aureus subsp. aureus TW20 3075806 2869 2883 14.95

PRJNA58555 Bacteria Staphylococcus aureus subsp. aureus USA300 FPR3757 2917469 2604 2691 10.86

Page 62: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58925 Bacteria Staphylococcus aureus subsp. aureus USA300 TCH1516 2903081 2689 2684 13.98

PRJNA88071 Bacteria Staphylococcus aureus subsp. aureus VC40 2692570 2468 2433 9.96

PRJNA59401 Bacteria Staphylococcus carnosus subsp. carnosus TM300 2566424 2462 2491 11.21

PRJNA57861 Bacteria Staphylococcus epidermidis ATCC 12228 2564615 2485 2355 14.42 PRJNA57663 Bacteria Staphylococcus epidermidis RP62A 2643840 2526 2438 16.66 PRJNA62919 Bacteria Staphylococcus haemolyticus JCSC1435 2697861 2694 2597 14.57 PRJNA46233 Bacteria Staphylococcus lugdunensis HKU09-01 2658366 2490 2473 12.17 PRJEA162143 Bacteria Staphylococcus lugdunensis N920143 2595888 2452 2423 11.84 PRJNA162109 Bacteria Staphylococcus pseudintermedius ED99 2572216 2356 2350 11.79 PRJNA62125 Bacteria Staphylococcus pseudintermedius HKU10-03 2617381 2450 2450 12.80

PRJNA58411 Bacteria Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 2577899 2514 2491 11.07

PRJNA48815 Bacteria Starkeya novella DSM 506 4765023 4431 4486 10.91 PRJEA162199 Bacteria Stenotrophomonas maltophilia D457 4769156 4130 4321 14.77 PRJNA72473 Bacteria Stenotrophomonas maltophilia JV3 4544477 4063 4083 15.11 PRJNA61647 Bacteria Stenotrophomonas maltophilia K279a 4851126 4430 4422 17.35 PRJNA58657 Bacteria Stenotrophomonas maltophilia R551-3 4573969 4039 4058 13.60 PRJNA158509 Bacteria Stigmatella aurantiaca DW4 3-1 10260756 8352 8288 24.22 PRJNA41863 Bacteria Streptobacillus moniliformis DSM 12112 1673280 1442 1537 21.35 PRJNA57943 Bacteria Streptococcus agalactiae 2603V R 2160267 2124 2110 12.85 PRJNA57935 Bacteria Streptococcus agalactiae A909 2127839 1996 2073 9.83 PRJNA61585 Bacteria Streptococcus agalactiae NEM316 2211485 2134 2140 14.72

PRJNA161979 Bacteria Streptococcus dysgalactiae subsp. equisimilis ATCC 12394 2159491 2056 2152 12.67

PRJNA59103 Bacteria Streptococcus dysgalactiae subsp. equisimilis GGS 124 2106340 2100 2097 11.79

PRJNA59259 Bacteria Streptococcus equi subsp. equi 4047 2253793 2137 2167 16.89 PRJNA59261 Bacteria Streptococcus equi subsp. zooepidemicus 2149868 1960 1958 13.96

PRJNA162155 Bacteria Streptococcus equi subsp. zooepidemicus ATCC 35246 2167264 2087 2006 17.42

PRJNA59263 Bacteria Streptococcus equi subsp. zooepidemicus 2024171 1893 1860 13.86

Page 63: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

MGCS10565

PRJDA162103 Bacteria Streptococcus gallolyticus subsp. gallolyticus ATCC 43143 2362241 2295 2284 12.71

PRJNA63617 Bacteria Streptococcus gallolyticus subsp. gallolyticus ATCC BAA-2069 2377209 2330 2296 14.05

PRJNA46061 Bacteria Streptococcus gallolyticus UCN34 2350911 2261 2256 11.64 PRJNA57667 Bacteria Streptococcus gordonii str. Challis substr. CH1 2196662 2051 2052 13.38

PRJNA87033 Bacteria Streptococcus infantarius subsp. infantarius CJ18 2008249 1902 2088 14.09

PRJDA168614 Bacteria Streptococcus intermedius JTH08 1933610 1705 1842 10.85 PRJNA81631 Bacteria Streptococcus macedonicus 2142762 2209 2143 13.92 PRJNA46097 Bacteria Streptococcus mitis B6 2146611 2018 2043 13.25 PRJNA169223 Bacteria Streptococcus mutans GS-5 2027088 1878 1896 9.54 PRJDA162197 Bacteria Streptococcus mutans LJ23 2015626 1921 1868 9.79 PRJNA46353 Bacteria Streptococcus mutans NN2025 2013587 1895 1869 8.98 PRJNA57947 Bacteria Streptococcus mutans UA159 2032925 1962 1896 10.68 PRJNA65449 Bacteria Streptococcus oralis Uo5 1958690 1909 1865 14.18 PRJNA49313 Bacteria Streptococcus parasanguinis ATCC 15912 2153652 2022 1980 12.82 PRJNA163997 Bacteria Streptococcus parasanguinis FW213 2171609 2019 2004 12.90 PRJNA67355 Bacteria Streptococcus parauberis KCTC 11537 2143887 1868 2350 11.31 PRJNA68019 Bacteria Streptococcus pasteurianus ATCC 43144 2100077 2026 2008 11.06 PRJNA52533 Bacteria Streptococcus pneumoniae 670-6B 2240045 2366 2281 16.72 PRJNA59125 Bacteria Streptococcus pneumoniae 70585 2184682 2202 2195 15.26 PRJNA52453 Bacteria Streptococcus pneumoniae AP200 2130580 2216 2120 13.31 PRJNA59287 Bacteria Streptococcus pneumoniae ATCC 700669 2221315 2135 2183 12.92 PRJNA59181 Bacteria Streptococcus pneumoniae CGSP14 2209198 2206 2161 13.28 PRJNA58581 Bacteria Streptococcus pneumoniae D39 2046115 1914 2000 11.57 PRJNA59167 Bacteria Streptococcus pneumoniae G54 2078953 2115 2069 13.31 PRJNA59117 Bacteria Streptococcus pneumoniae Hungary19A-6 2245615 2155 2243 15.39 PRJNA162039 Bacteria Streptococcus pneumoniae INV104 2142122 1939 2133 11.81 PRJNA162035 Bacteria Streptococcus pneumoniae INV200 2093317 2049 2044 12.09 PRJNA59121 Bacteria Streptococcus pneumoniae JJA 2120234 2123 2097 14.12

Page 64: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA162037 Bacteria Streptococcus pneumoniae OXC141 2036867 1973 2040 12.06 PRJNA59123 Bacteria Streptococcus pneumoniae P1031 2111882 2073 2104 14.68 PRJNA57859 Bacteria Streptococcus pneumoniae R6 2038615 2043 1995 11.59 PRJNA50809 Bacteria Streptococcus pneumoniae SPN032672 2131190 0* 2149 13.59 PRJNA50811 Bacteria Streptococcus pneumoniae SPN033038 2133496 0* 2160 13.56 PRJNA50793 Bacteria Streptococcus pneumoniae SPN034156 2024476 0* 2017 12.00 PRJNA50795 Bacteria Streptococcus pneumoniae SPN034183 2037254 0* 2046 12.56 PRJNA50799 Bacteria Streptococcus pneumoniae SPN994038 2027108 0* 2030 12.32 PRJNA50801 Bacteria Streptococcus pneumoniae SPN994039 2025562 0* 2030 12.36 PRJNA162191 Bacteria Streptococcus pneumoniae ST556 2145902 2148 2126 13.92 PRJNA59119 Bacteria Streptococcus pneumoniae Taiwan19F-14 2112148 2044 2084 13.61 PRJNA49735 Bacteria Streptococcus pneumoniae TCH8431 19A 2088772 2275 2061 15.34 PRJNA57857 Bacteria Streptococcus pneumoniae TIGR4 2160842 2125 2116 14.62 PRJNA71153 Bacteria Streptococcus pseudopneumoniae IS7493 2195458 2236 2163 16.87 PRJNA162171 Bacteria Streptococcus pyogenes Alab49 1827308 1773 1758 12.32 PRJNA57845 Bacteria Streptococcus pyogenes M1 GAS 1852441 1696 1774 10.89 PRJNA58571 Bacteria Streptococcus pyogenes MGAS10270 1928252 1987 1886 14.87 PRJNA58105 Bacteria Streptococcus pyogenes MGAS10394 1899877 1886 1833 13.96 PRJNA58575 Bacteria Streptococcus pyogenes MGAS10750 1937111 1979 1889 14.22 PRJNA158037 Bacteria Streptococcus pyogenes MGAS15252 1750832 1662 1628 10.88 PRJNA66081 Bacteria Streptococcus pyogenes MGAS1882 1781029 1691 1672 11.21 PRJNA58573 Bacteria Streptococcus pyogenes MGAS2096 1860355 1898 1809 13.11 PRJNA57911 Bacteria Streptococcus pyogenes MGAS315 1900521 1865 1883 14.14 PRJNA58337 Bacteria Streptococcus pyogenes MGAS5005 1838554 1865 1781 13.85 PRJNA58335 Bacteria Streptococcus pyogenes MGAS6180 1897573 1894 1827 14.38 PRJNA57871 Bacteria Streptococcus pyogenes MGAS8232 1895017 1845 1888 13.29 PRJNA58569 Bacteria Streptococcus pyogenes MGAS9429 1836467 1877 1753 13.99 PRJNA59035 Bacteria Streptococcus pyogenes NZ131 1815785 1703 1732 10.95 PRJNA57895 Bacteria Streptococcus pyogenes SSI-1 1894275 1861 1882 14.85 PRJNA57847 Bacteria Streptococcus pyogenes str. Manfredo 1841271 1819 1805 12.94 PRJNA162151 Bacteria Streptococcus salivarius 57 I 2179563 1996 1999 12.22

Page 65: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA70481 Bacteria Streptococcus salivarius CCHSS3 2217184 2032 2004 12.78 PRJEA162145 Bacteria Streptococcus salivarius JIM8777 2210574 1979 1964 12.99 PRJNA58381 Bacteria Streptococcus sanguinis SK36 2388435 2270 2271 13.48 PRJNA58663 Bacteria Streptococcus suis 05ZYH33 2096309 2186 2114 14.14 PRJNA58665 Bacteria Streptococcus suis 98HAH33 2095698 2189 2118 14.30 PRJNA162111 Bacteria Streptococcus suis A7 2038409 1974 1933 12.52 PRJNA59321 Bacteria Streptococcus suis BM407 2170808 2058 2064 12.21 PRJNA162127 Bacteria Streptococcus suis D12 2183059 2078 2112 14.99 PRJNA162125 Bacteria Streptococcus suis D9 2177656 2074 2106 14.83 PRJNA161937 Bacteria Streptococcus suis GZ1 2038034 1978 1940 12.12 PRJNA162095 Bacteria Streptococcus suis JS14 2137435 2066 2045 13.38 PRJEA32235 Bacteria Streptococcus suis P1 7 2007491 1908 1893 11.31 PRJNA174333 Bacteria Streptococcus suis S735 1980887 1882 1853 10.47 PRJNA59323 Bacteria Streptococcus suis SC84 2095898 1985 1979 11.73 PRJNA162123 Bacteria Streptococcus suis SS12 2096866 2079 2011 14.52 PRJNA167482 Bacteria Streptococcus suis ST1 2034321 1987 1979 14.12 PRJNA66327 Bacteria Streptococcus suis ST3 2028815 1952 1942 13.25 PRJNA58221 Bacteria Streptococcus thermophilus CNRZ1066 1796226 1915 1895 13.96 PRJEA162157 Bacteria Streptococcus thermophilus JIM 8232 1929905 2145 1990 16.66 PRJNA58327 Bacteria Streptococcus thermophilus LMD-9 1864178 1716 1956 16.42 PRJNA58219 Bacteria Streptococcus thermophilus LMG 18311 1796846 1889 1897 13.87 PRJNA166827 Bacteria Streptococcus thermophilus MN-ZLW-002 1848520 1910 1941 15.66 PRJNA162015 Bacteria Streptococcus thermophilus ND03 1831949 1919 1939 15.99 PRJNA57959 Bacteria Streptococcus uberis 0140J 1852352 1825 1813 9.37 PRJNA57739 Bacteria Streptomyces avermitilis MA-4680 9119895 7676 7913 18.62 PRJNA82931 Bacteria Streptomyces bingchenggensis BCW-1 11936683 10022 9964 17.35

PRJNA162187 Bacteria Streptomyces cattleya NRRL 8057 = DSM 46488 8095515 7569 7159 22.02

PRJNA77117 Bacteria Streptomyces cattleya NRRL 8057 = DSM 46488 8092553 7483 7144 21.06

PRJNA57801 Bacteria Streptomyces coelicolor A3(2) 9054847 8215 8159 19.38 PRJNA40839 Bacteria Streptomyces flavogriseus ATCC 33331 7656104 6572 6739 17.41

Page 66: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58983 Bacteria Streptomyces griseus subsp. griseus NBRC 13350 8545929 7138 7118 17.16

PRJNA89409 Bacteria Streptomyces hygroscopicus subsp. jinggangensis 5008 10383684 9108 9220 19.87

PRJNA46531 Bacteria Streptomyces scabiei 87.22 10148695 8809 8790 20.08 PRJNA72627 Bacteria Streptomyces sp. SirexAA-E 7414440 6357 6499 15.92 PRJNA62209 Bacteria Streptomyces venezuelae ATCC 10712 8226158 7453 7351 18.25 PRJNA52609 Bacteria Streptomyces violaceusniger Tu 4113 11138313 8985 9432 18.73 PRJNA42521 Bacteria Streptosporangium roseum DSM 43021 10369518 8975 9229 16.96 PRJNA88061 Bacteria Sulfobacillus acidophilus DSM 10332 3557831 3471 3715 19.62 PRJNA68841 Bacteria Sulfobacillus acidophilus TPY 3551206 3754 3717 21.50 PRJNA60789 Bacteria Sulfuricurvum kujiense DSM 16994 2819357 2798 2827 17.83 PRJNA58121 Bacteria Sulfurihydrogenibium azorense Az-Fu1 1640877 1725 1717 12.38 PRJNA58855 Bacteria Sulfurihydrogenibium sp. YO3AOP1 1838442 1721 1815 12.19 PRJNA53043 Bacteria Sulfurimonas autotrophica DSM 16294 2153198 2158 2168 15.56 PRJNA58185 Bacteria Sulfurimonas denitrificans DSM 1251 2201561 2096 2168 14.00 PRJNA168117 Bacteria Sulfurospirillum barnesii SES-3 2510109 2491 2521 14.03 PRJNA41861 Bacteria Sulfurospirillum deleyianum DSM 6946 2306351 2265 2286 14.37 PRJNA58863 Bacteria Sulfurovum sp. NBC37-1 2562277 2466 2552 16.30 PRJNA58165 Bacteria Symbiobacterium thermophilum IAM 14863 3566135 3337 3296 15.05 PRJNA58235 Bacteria Synechococcus elongatus PCC 6301 2696255 2525 2682 14.73 PRJNA58045 Bacteria Synechococcus elongatus PCC 7942 2742269 2661 2723 15.88 PRJNA58123 Bacteria Synechococcus sp. CC9311 2606748 2892 2879 28.56 PRJNA58319 Bacteria Synechococcus sp. CC9605 2510659 2638 2893 27.79 PRJNA58323 Bacteria Synechococcus sp. CC9902 2234828 2304 2460 21.37 PRJNA58537 Bacteria Synechococcus sp. JA-2-3B'a(2-13) 3046682 2862 2845 18.21 PRJNA58535 Bacteria Synechococcus sp. JA-3-3Ab 2932766 2760 2748 16.90 PRJNA59137 Bacteria Synechococcus sp. PCC 7002 3409935 3186 3233 17.87 PRJNA61609 Bacteria Synechococcus sp. RCC307 2224914 2535 2541 23.98 PRJNA61607 Bacteria Synechococcus sp. WH 7803 2366980 2533 2558 20.78 PRJNA61581 Bacteria Synechococcus sp. WH 8102 2434428 2526 2720 23.01 PRJNA159873 Bacteria Synechocystis sp. PCC 6803 3703945 3334 3451 14.96

Page 67: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA57659 Bacteria Synechocystis sp. PCC 6803 3947019 3564 3709 17.04 PRJNA158059 Bacteria Synechocystis sp. PCC 6803 substr. PCC-N 3570103 3169 3299 14.46 PRJNA159835 Bacteria Synechocystis sp. PCC 6803 substr. PCC-P 3570114 3169 3297 14.43 PRJNA45959 Bacteria Synergistetes sp SGP1 2728333 1435 2352 21.31 PRJNA58177 Bacteria Syntrophobacter fumaroxidans MPOB 4990251 4064 4143 18.09 PRJNA63343 Bacteria Syntrophobotulus glycolicus DSM 8271 3406739 3251 3340 17.49

PRJNA58179 Bacteria Syntrophomonas wolfei subsp. wolfei str. Goettingen 2936195 2504 2652 13.52

PRJNA49527 Bacteria Syntrophothermus lipocalidus DSM 12680 2405559 2313 2385 13.50 PRJNA58539 Bacteria Syntrophus aciditrophicus SB 3179300 3168 2933 19.69 PRJNA83157 Bacteria Tannerella forsythia ATCC 43037 3405521 3001 2861 24.92 PRJNA73771 Bacteria Taylorella asinigenitalis MCE3 1638559 1524 1529 12.25 PRJNA170255 Bacteria Taylorella equigenitalis ATCC 35865 1732123 1542 1595 12.21 PRJNA62103 Bacteria Taylorella equigenitalis MCE9 1695860 1556 1546 11.32 PRJNA66873 Bacteria Tepidanaerobacter acetatoxydans Re1 2759867 2524 2602 10.98 PRJNA59267 Bacteria Teredinibacter turnerae T7901 5193164 4255 4217 18.18 PRJNA168183 Bacteria Terriglobus roseus DSM 18391 5227858 4245 4346 23.62 PRJNA53251 Bacteria Terriglobus saanensis SP1PR4 5095226 4180 4268 16.76 PRJNA74441 Bacteria Tetragenococcus halophilus NBRC 12172 2562720 2555 2520 12.41 PRJNA58987 Bacteria Thauera sp. MZ1T 4574586 3978 4052 11.17 PRJNA61727 Bacteria Thermaerobacter marianensis DSM 12885 2844696 2327 2381 11.64

PRJNA41925 Bacteria Thermanaerovibrio acidaminovorans DSM 6589 1848474 1738 1762 8.54

PRJNA48823 Bacteria Thermincola potens JR 3157416 2949 3006 13.82

PRJNA55639 Bacteria Thermoanaerobacter brockii subsp. finnii Ako-1 2344824 2209 2310 10.02

PRJNA46241 Bacteria Thermoanaerobacter italicus Ab9 2451061 2270 2377 11.17

PRJNA58339 Bacteria Thermoanaerobacter pseudethanolicus ATCC 33223 2362816 2243 2321 10.06

PRJNA53065 Bacteria Thermoanaerobacter sp. X513 2456520 2331 2423 9.97 PRJNA58589 Bacteria Thermoanaerobacter sp. X514 2457259 2349 2430 9.83 PRJNA57813 Bacteria Thermoanaerobacter tengcongensis MB4 2689445 2588 2668 12.65

Page 68: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA52581 Bacteria Thermoanaerobacter wiegelii Rt8.B1 2785056 2522 2813 10.16

PRJNA167781 Bacteria Thermoanaerobacterium saccharolyticum JW SL-YS485 2879349 2840 2889 15.40

PRJNA51639 Bacteria Thermoanaerobacterium thermosaccharolyticum DSM 571 2785752 2601 2741 12.80

PRJNA63163 Bacteria Thermoanaerobacterium xylanolyticum LX-11 2534358 2345 2413 10.07 PRJNA42011 Bacteria Thermobaculum terrenum ATCC BAA-798 3101581 2832 2871 13.36 PRJNA57703 Bacteria Thermobifida fusca YX 3642249 3110 3146 15.17 PRJNA48999 Bacteria Thermobispora bispora DSM 43833 4189976 3546 3598 14.31 PRJNA46231 Bacteria Thermocrinis albus DSM 14484 1500577 1593 1621 11.67 PRJNA68285 Bacteria Thermodesulfatator indicus DSM 15286 2322224 2195 2245 10.86 PRJNA68283 Bacteria Thermodesulfobacterium sp. OPB45 1634377 1596 1639 10.08 PRJNA66601 Bacteria Thermodesulfobium narugense DSM 14796 1898865 1807 1854 10.65 PRJNA59257 Bacteria Thermodesulfovibrio yellowstonii DSM 11347 2003803 2033 2009 11.36 PRJNA59341 Bacteria Thermomicrobium roseum DSM 5159 2920744 2859 2649 18.54 PRJNA41885 Bacteria Thermomonospora curvata DSM 43183 5639016 4890 4970 17.70 PRJNA51421 Bacteria Thermosediminibacter oceani DSM 16646 2280035 2197 2291 11.81 PRJNA59095 Bacteria Thermosipho africanus TCF52B 2016657 1956 2001 14.96 PRJNA58683 Bacteria Thermosipho melanesiensis BI429 1915238 1879 1922 14.81 PRJNA57907 Bacteria Thermosynechococcus elongatus BP-1 2593857 2475 2518 14.14 PRJNA58419 Bacteria Thermotoga lettingae TMO 2135342 2040 2079 8.30 PRJNA57723 Bacteria Thermotoga maritima MSB8 1860725 1846 1874 9.84 PRJNA42777 Bacteria Thermotoga naphthophila RKU-10 1809823 1768 1825 9.24 PRJNA59065 Bacteria Thermotoga neapolitana DSM 4359 1884562 1937 1914 10.83 PRJNA58655 Bacteria Thermotoga petrophila RKU-1 1823511 1785 1814 9.14 PRJNA58935 Bacteria Thermotoga sp. RQ2 1877693 1819 1864 8.63 PRJNA68449 Bacteria Thermotoga thermarum DSM 5069 2039943 1945 2044 11.53 PRJNA62095 Bacteria Thermovibrio ammonificans HB-1 1759526 1813 1831 17.48 PRJNA77129 Bacteria Thermovirga lienii DSM 17291 1999646 1875 1938 9.07 PRJNA62273 Bacteria Thermus scotoductus SA-01 2355186 2461 2457 15.19 PRJNA81197 Bacteria Thermus sp. CCB US3 UF1 2263488 2279 2266 12.52 PRJNA58033 Bacteria Thermus thermophilus HB27 2127482 2210 2224 11.95

Page 69: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA58223 Bacteria Thermus thermophilus HB8 2116056 2238 2223 12.24 PRJNA162129 Bacteria Thermus thermophilus JL-18 2311212 2402 2459 14.22 PRJNA159537 Bacteria Thermus thermophilus SG0.5JP17-16 2303227 2339 2451 13.63 PRJNA67391 Bacteria Thioalkalimicrobium cyclicum ALM1 1932455 1665 1683 8.60 PRJNA46181 Bacteria Thioalkalivibrio sp. K90mix 2985056 2855 2899 15.17 PRJNA59179 Bacteria Thioalkalivibrio sulfidophilus HL-EbGr7 3464554 3283 3318 11.83 PRJNA58189 Bacteria Thiobacillus denitrificans ATCC 25259 2909809 2827 2814 11.68 PRJNA74025 Bacteria Thiocystis violascens DSM 198 5017071 4330 4574 19.22 PRJNA58183 Bacteria Thiomicrospira crunogena XCL-2 2427734 2196 2270 11.02 PRJNA48825 Bacteria Thiomonas intermedia K12 3462095 3172 3254 10.83 PRJEA50687 Bacteria Thiomonas sp. 3As 3785534 3699 3584 14.84 PRJNA167486 Bacteria Tistrella mobilis KA081020-065 6513401 5785 5841 10.19 PRJNA59395 Bacteria Tolumonas auensis DSM 9187 3471292 3130 3177 8.53 PRJNA67365 Bacteria Treponema azotonutricium ZAS-9 3855671 3474 3397 20.54 PRJNA66607 Bacteria Treponema brennaborense DSM 12168 3055580 2531 2581 15.06 PRJNA57583 Bacteria Treponema denticola ATCC 35405 2843201 2767 2577 23.67 PRJNA87065 Bacteria Treponema pallidum subsp. pallidum DAL-1 1139971 1059 979 18.74 PRJNA58977 Bacteria Treponema pallidum subsp. pallidum SS14 1139457 1028 1003 18.56

PRJNA159543 Bacteria Treponema pallidum subsp. pallidum str. Chicago 1139281 982 980 18.60

PRJNA57585 Bacteria Treponema pallidum subsp. pallidum str. Nichols 1138011 1031 1006 18.95

PRJNA87051 Bacteria Treponema pallidum subsp. pertenue str. CDC2 1139744 1068 982 19.17

PRJNA87067 Bacteria Treponema pallidum subsp. pertenue str. Gauthier 1139417 1068 981 19.23

PRJNA87069 Bacteria Treponema pallidum subsp. pertenue str. SamoaD 1139330 1068 981 19.28

PRJNA68447 Bacteria Treponema paraluiscuniculi Cuniculi A 1133390 1010 988 17.62 PRJNA67367 Bacteria Treponema primitia ZAS-2 4059867 3522 3506 19.19 PRJNA65781 Bacteria Treponema succinifaciens DSM 2489 2897425 2608 2752 25.86 PRJNA57925 Bacteria Trichodesmium erythraeum IMS101 7750108 4451 5265 25.90 PRJNA57705 Bacteria Tropheryma whipplei str. Twist 927303 808 939 19.98

Page 70: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA57961 Bacteria Tropheryma whipplei TW08 27 925938 784 945 19.32 PRJNA49533 Bacteria Truepera radiovictrix DSM 17093 3260398 2945 2989 14.39 PRJNA48829 Bacteria Tsukamurella paurometabola DSM 20162 4479724 4242 4331 15.26 PRJNA168321 Bacteria Turneriella parva DSM 21527 4409302 4139 4181 26.59

PRJNA59059 Bacteria uncultured Termite group 1 bacterium phylotype Rs-D17 1148570 776 1149 19.84

PRJNA58887 Bacteria Ureaplasma parvum serovar 3 str. ATCC 27815 751679 609 608 23.58

PRJNA57711 Bacteria Ureaplasma parvum serovar 3 str. ATCC 700970 751719 611 620 24.37

PRJNA59011 Bacteria Ureaplasma urealyticum serovar 10 str. ATCC 33699 874478 646 660 26.72

PRJNA62107 Bacteria Variovorax paradoxus EPS 6550056 5952 6007 12.15 PRJNA59437 Bacteria Variovorax paradoxus S110 6754997 6279 6294 10.46 PRJNA41927 Bacteria Veillonella parvula DSM 2008 2132142 1844 1866 12.70 PRJNA58675 Bacteria Verminephrobacter eiseniae EF01-2 5597943 4947 4944 11.30 PRJNA66297 Bacteria Verrucosispora maris AB-18-032 6732271 6009 5996 18.86 PRJNA68057 Bacteria Vibrio anguillarum 775 4052047 3731 3671 12.69 PRJNA89389 Bacteria Vibrio cholerae IEC224 4079586 3664 3611 9.48 PRJNA159541 Bacteria Vibrio cholerae LMA3984-4 3738715 3153 3447 7.86 PRJNA59355 Bacteria Vibrio cholerae M66-2 3938905 3693 3445 10.87 PRJNA59387 Bacteria Vibrio cholerae MJ-1236 4236368 3774 3758 11.26 PRJNA57623 Bacteria Vibrio cholerae O1 biovar El Tor str. N16961 4033464 3828 3592 12.57 PRJNA78933 Bacteria Vibrio cholerae O1 str. 2010EL-1786 4077740 3845 3587 11.61 PRJNA159869 Bacteria Vibrio cholerae O395 4135300 3934 3738 14.17 PRJNA58425 Bacteria Vibrio cholerae O395 4132319 3875 3736 12.61 PRJNA58163 Bacteria Vibrio fischeri ES114 4273718 3823 3821 10.98 PRJNA58907 Bacteria Vibrio fischeri MJ11 4503336 4039 3994 14.13 PRJNA82347 Bacteria Vibrio furnissii NCTC 11218 4916408 4462 4544 9.37 PRJNA58957 Bacteria Vibrio harveyi ATCC BAA-1116 6058377 6064 5633 21.13 PRJNA57969 Bacteria Vibrio parahaemolyticus RIMD 2210633 5165770 4832 4583 14.14 PRJNA83161 Bacteria Vibrio sp. EJY3 5452646 4786 4814 10.49 PRJNA41601 Bacteria Vibrio sp. Ex25 5089025 4518 4486 10.90

Page 71: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59353 Bacteria Vibrio splendidus LGP32 4974818 4434 4330 13.13 PRJNA62909 Bacteria Vibrio vulnificus CMCP6 5126696 4433 4518 11.78 PRJNA62243 Bacteria Vibrio vulnificus MO6-24 O 5007768 4562 4415 13.01 PRJNA58007 Bacteria Vibrio vulnificus YJ016 5260086 5028 4639 16.04 PRJNA49531 Bacteria Waddlia chondrophila WSU 86-1044 2131905 1956 1915 26.35 PRJNA63627 Bacteria Weeksella virosa DSM 16922 2272954 2049 2118 17.35

PRJNA57853 Bacteria Wigglesworthia glossinidia endosymbiont of Glossina brevipalpis 703004 611 644 0.96

PRJNA88075 Bacteria Wigglesworthia glossinidia endosymbiont of Glossina morsitans morsitans (Yale colony) 719535 635 652 1.48

PRJNA61645 Bacteria Wolbachia endosymbiont of Culex quinquefasciatus Pel 1482455 1385 1416 28.85

PRJNA57851 Bacteria Wolbachia endosymbiont of Drosophila melanogaster 1267782 1195 1273 27.15

PRJEA171829 Bacteria Wolbachia endosymbiont of Onchocerca ochengi 957990 647 795 17.48

PRJNA58107 Bacteria Wolbachia endosymbiont strain TRS of Brugia malayi 1080084 805 1146 29.78

PRJNA59371 Bacteria Wolbachia sp. wRi 1445873 1150 1395 23.54 PRJNA61591 Bacteria Wolinella succinogenes DSM 1740 2110355 2044 2074 11.66 PRJNA58453 Bacteria Xanthobacter autotrophicus Py2 5625098 5035 5178 14.25 PRJNA43163 Bacteria Xanthomonas albilineans GPE PC73 3768695 3114 3117 14.93 PRJNA57889 Bacteria Xanthomonas axonopodis pv. citri str. 306 5274174 4429 4475 15.99 PRJNA73179 Bacteria Xanthomonas axonopodis pv. citrumelo F1 4967469 4181 4100 15.14 PRJNA61643 Bacteria Xanthomonas campestris pv. campestris 5079002 4513 4317 16.58

PRJNA57595 Bacteria Xanthomonas campestris pv. campestris str. 8004 5148708 4273 4353 14.99

PRJNA57887 Bacteria Xanthomonas campestris pv. campestris str. ATCC 33913 5076188 4181 4294 14.43

PRJNA159539 Bacteria Xanthomonas campestris pv. raphani 756C 4941214 4520 4144 18.87

PRJNA58321 Bacteria Xanthomonas campestris pv. vesicatoria str. 85-10 5420152 4726 4680 18.42

PRJNA58155 Bacteria Xanthomonas oryzae pv. oryzae KACC 10331 4941439 4538 4834 18.29 PRJNA58547 Bacteria Xanthomonas oryzae pv. oryzae MAFF 311018 4940217 4372 4746 16.67

Page 72: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA59131 Bacteria Xanthomonas oryzae pv. oryzae PXO99A 5240075 4988 4980 22.75 PRJNA54411 Bacteria Xanthomonas oryzae pv. oryzicola BLS256 4831739 4474 4303 19.78 PRJNA46345 Bacteria Xenorhabdus bovienii SS-2004 4225498 4362 3793 21.07 PRJNA49133 Bacteria Xenorhabdus nematophila ATCC 19061 4587917 4767 4135 19.87 PRJNA41935 Bacteria Xylanimonas cellulosilytica DSM 15894 3831380 3443 3463 17.04 PRJNA57849 Bacteria Xylella fastidiosa 9a5c 2731750 2832 2659 31.20 PRJNA58763 Bacteria Xylella fastidiosa M12 2475130 2104 2348 24.10 PRJNA58809 Bacteria Xylella fastidiosa M23 2573987 2201 2453 25.18 PRJNA162023 Bacteria Xylella fastidiosa subsp. fastidiosa GB514 2517383 2216 2420 25.43 PRJNA57869 Bacteria Xylella fastidiosa Temecula1 2521148 2036 2412 23.16

PRJNA57741 Bacteria Yersinia enterocolitica subsp. enterocolitica 8081 4683620 4137 4249 7.74

PRJNA63663 Bacteria Yersinia enterocolitica subsp. palearctica 105.5R(r) 4621811 4021 4179 8.77

PRJEA162069 Bacteria Yersinia enterocolitica subsp. palearctica Y11 4625880 4459 4257 11.35 PRJNA158119 Bacteria Yersinia pestis A1122 4658411 4247 4143 14.30 PRJNA58485 Bacteria Yersinia pestis Angola 4687014 4045 4286 16.08 PRJNA58607 Bacteria Yersinia pestis Antiqua 4879836 4364 4368 14.12 PRJNA158537 Bacteria Yersinia pestis biovar Medievalis str. Harbin 35 4709501 4436 4254 16.57 PRJNA58037 Bacteria Yersinia pestis biovar Microtus str. 91001 4803217 4142 4250 13.20 PRJNA57621 Bacteria Yersinia pestis CO92 4829855 4217 4299 13.57 PRJNA158071 Bacteria Yersinia pestis D106004 4812922 3783 4358 10.35 PRJNA158073 Bacteria Yersinia pestis D182038 4802263 3787 4405 10.90 PRJNA57875 Bacteria Yersinia pestis KIM10+ 4701745 4205 4150 14.90 PRJNA58609 Bacteria Yersinia pestis Nepal516 4646286 4094 4112 12.49 PRJNA58619 Bacteria Yersinia pestis Pestoides F 4725862 4069 4223 11.71 PRJNA47317 Bacteria Yersinia pestis Z176003 4725788 3693 4233 10.94 PRJNA58487 Bacteria Yersinia pseudotuberculosis IP 31758 4935125 4324 4265 13.92 PRJNA58157 Bacteria Yersinia pseudotuberculosis IP 32953 4840898 4116 4189 10.36 PRJNA59153 Bacteria Yersinia pseudotuberculosis PB1 + 4765431 4237 4112 11.95 PRJNA59151 Bacteria Yersinia pseudotuberculosis YPIII 4689441 4192 4085 12.40 PRJNA70621 Bacteria Zobellia galactanivorans 5521712 4738 4550 19.68

Page 73: Table S1. List of 2110 prokaryotic genomes used in this ...oksana/PhD_Thesis/Supplementary... · PRJNA168259 Thermococcus sp. CL1Archaea 1950313 2017 2079 20.12 PRJNA58563 Thermofilum

PRJNA48073 Bacteria Zunongwangia profunda SM-A87 5128187 4653 4489 21.03

PRJNA55403 Bacteria Zymomonas mobilis subsp. mobilis ATCC 10988 2143461 1803 1865 11.53

PRJNA170612 Bacteria Zymomonas mobilis subsp. mobilis ATCC 29191 2008345 1709 1781 11.29

PRJNA41019 Bacteria Zymomonas mobilis subsp. mobilis NCIMB 11163 2223520 1884 1918 10.99

PRJNA58095 Bacteria Zymomonas mobilis subsp. mobilis ZM4 2056363 1737 1753 9.20

PRJNA68445 Bacteria Zymomonas mobilis subsp. pomaceae ATCC 29192 2061413 1748 1774 10.16