oasis: an automated program for global investigation of ... · oasis: an automated program for...

49
OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson 1 , Ming-Chun Lee 1 and Christopher J. Marx 1,2, * 1 Department of Organismic and Evolutionary Biology and 2 Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Cambridge, MA 02138, USA Received March 31, 2012; Revised July 6, 2012; Accepted July 24, 2012 ABSTRACT Insertion sequences (ISs) are simple transposable elements present in most bacterial and archaeal genomes and play an important role in genomic evo- lution. The recent expansion of sequenced genomes offers the opportunity to study ISs comprehensively, but this requires efficient and accurate tools for IS annotation. We have developed an open-source program called OASIS, or Optimized Annotation System for Insertion Sequences, which automatic- ally annotates ISs within sequenced genomes. OASIS annotations of 1737 bacterial and archaeal genomes offered an unprecedented opportunity to examine IS evolution. At a broad scale, we found that most IS families are quite widespread; however, they are not present randomly across taxa. This may indicate differential loss, barriers to exchange and/or insufficient time to equilibrate across clades. The number of ISs increases with genome length, but there is both tremendous vari- ation and no increase in IS density for genomes >2 Mb. At the finer scale of recently diverged genomes, the proportion of shared IS content falls sharply, suggesting loss and/or emergence of barriers to successful cross-infection occurs rapidly. Surprisingly, even after controlling for 16S rRNA sequence divergence, the same ISs were more likely to be shared between genomes labeled as the same species rather than as different species. INTRODUCTION The ever-increasing number of sequenced bacterial and archaeal genomes provides a valuable opportunity to understand genome architecture and evolution. However, as new high-throughput sequencing methods are developed, genome annotation quickly becomes the bottleneck for genomic research. Despite the development of various annotation programs for particular genomic features, some important features such as insertion se- quences (ISs), the smallest and simplest autonomous mobile genetic elements, remain poorly annotated. ISs are short regions of DNA, usually between 700 and 3000 bp long that can move or copy themselves within a genome through self-transposition. The majority of ISs possess one or two open reading frames (ORFs) that encode a transposase. These ORFs are surrounded by linker regions that frequently end with short-terminal inverted repeats (IRs) ranging from 7 to 20 bp in length. Upon insertion, ISs often generate short directed repeats from 2 to 14 bp immediately outside the IRs (1). Despite considerable sequence divergence, ISs can be classified into 26 families based on transposase homology and overall organization, with some families divided further into groups (2). Due to their movement within and across genomes, ISs not only represent an important source of genetic vari- ation within genomes but also mediate horizontal gene transfer (HGT) among organisms and thus play a key role in genome evolution. Through transposition, ISs can interrupt the coding region of a gene, or disrupt promoter regions and alter gene expression. Given that there can be hundreds of copies of the same IS in a genome, they can also serve as sites of rearrangements *To whom correspondence should be addressed. Tel: +1 617 496 8103; Fax:+1 617 495 8848; Email: [email protected] The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors. Present addresses: David G. Robinson, Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA. Ming-Chun Lee, Department of Biochemistry, The University of Hong Kong, Pok Fu Lam, Hong Kong. Nucleic Acids Research, 2012, 1–11 doi:10.1093/nar/gks778 ß The Author(s) 2012. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. Nucleic Acids Research Advance Access published August 16, 2012 by guest on August 17, 2012 http://nar.oxfordjournals.org/ Downloaded from

Upload: others

Post on 06-Oct-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

OASIS: an automated program for globalinvestigation of bacterial and archaeal insertionsequencesDavid G. Robinson1, Ming-Chun Lee1 and Christopher J. Marx1,2,*

1Department of Organismic and Evolutionary Biology and 2Faculty of Arts and Sciences Center for SystemsBiology, Harvard University, Cambridge, MA 02138, USA

Received March 31, 2012; Revised July 6, 2012; Accepted July 24, 2012

ABSTRACT

Insertion sequences (ISs) are simple transposableelements present in most bacterial and archaealgenomes and play an important role in genomic evo-lution. The recent expansion of sequenced genomesoffers the opportunity to study ISs comprehensively,but this requires efficient and accurate tools for ISannotation. We have developed an open-sourceprogram called OASIS, or Optimized AnnotationSystem for Insertion Sequences, which automatic-ally annotates ISs within sequenced genomes.OASIS annotations of 1737 bacterial and archaealgenomes offered an unprecedented opportunity toexamine IS evolution. At a broad scale, we foundthat most IS families are quite widespread;however, they are not present randomly acrosstaxa. This may indicate differential loss, barriers toexchange and/or insufficient time to equilibrateacross clades. The number of ISs increases withgenome length, but there is both tremendous vari-ation and no increase in IS density for genomes>2 Mb. At the finer scale of recently divergedgenomes, the proportion of shared IS content fallssharply, suggesting loss and/or emergence ofbarriers to successful cross-infection occursrapidly. Surprisingly, even after controlling for 16SrRNA sequence divergence, the same ISs weremore likely to be shared between genomes labeledas the same species rather than as differentspecies.

INTRODUCTION

The ever-increasing number of sequenced bacterial andarchaeal genomes provides a valuable opportunity tounderstand genome architecture and evolution.However, as new high-throughput sequencing methodsare developed, genome annotation quickly becomes thebottleneck for genomic research. Despite the developmentof various annotation programs for particular genomicfeatures, some important features such as insertion se-quences (ISs), the smallest and simplest autonomousmobile genetic elements, remain poorly annotated.ISs are short regions of DNA, usually between 700 and

3000 bp long that can move or copy themselves within agenome through self-transposition. The majority of ISspossess one or two open reading frames (ORFs) thatencode a transposase. These ORFs are surrounded bylinker regions that frequently end with short-terminalinverted repeats (IRs) ranging from 7 to 20 bp in length.Upon insertion, ISs often generate short directed repeatsfrom 2 to 14 bp immediately outside the IRs (1). Despiteconsiderable sequence divergence, ISs can be classifiedinto 26 families based on transposase homology andoverall organization, with some families divided furtherinto groups (2).Due to their movement within and across genomes, ISs

not only represent an important source of genetic vari-ation within genomes but also mediate horizontal genetransfer (HGT) among organisms and thus play a keyrole in genome evolution. Through transposition, ISscan interrupt the coding region of a gene, or disruptpromoter regions and alter gene expression. Given thatthere can be hundreds of copies of the same IS in agenome, they can also serve as sites of rearrangements

*To whom correspondence should be addressed. Tel: +1 617 496 8103; Fax: +1 617 495 8848; Email: [email protected]

The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.

Present addresses:David G. Robinson, Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA.Ming-Chun Lee, Department of Biochemistry, The University of Hong Kong, Pok Fu Lam, Hong Kong.

Nucleic Acids Research, 2012, 1–11doi:10.1093/nar/gks778

� The Author(s) 2012. Published by Oxford University Press.This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Nucleic Acids Research Advance Access published August 16, 2012 by guest on A

ugust 17, 2012http://nar.oxfordjournals.org/

Dow

nloaded from

Page 2: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

such as deletions, duplications and inversions throughhomologous recombination. On a practical level, trans-position has been a classic genetic technique to generatemutant alleles, generally loss-of-function insertions intogene products. ISs are thus often regarded as ‘selfish’genomic parasites proliferating at the cost of their hostand surviving only through horizontal transfer (3). Incontrast, experimental evolution in the laboratory hasdemonstrated that both transpositions (4,5) and re-arrangements (6–10) can also generate beneficial muta-tions and commonly represent a large portion of themutations identified in genomes following a period ofadaptation (11,12).The most comprehensive resource for ISs is the ISfinder

database (13). ISfinder annotations have been submittedby users and manually verified by the curators. Therefore,we assumed this database (as of 17 September 2011) to bean accurate set of all ISs, but incomplete due to the factthat genomes are being sequenced faster than they areannotated to this extent. The most common practice ingenome annotation has been to stop at the point oflabeling ORFs as ‘transposase’ or ‘integrase’ where suffi-cient homology was observed. Without classification ofISs into families and enumeration within genomes,neither broad-scale studies across taxa nor dynamicswithin closely related strains are possible.Previous approaches to annotate ISs across a wide

range of genomes have been either involved internal pipe-lines, require manual annotation, or have identified veryfew elements. An annotation program was used for ananalysis of 19 cyanobacterial and 31 archaeal genomes,but this has yet to be made publicly available as an auto-mated pipeline (2). ISSaga is a web application pipelinethat allows semi-automated annotation based on BLASTagainst the ISfinder database (14). While ISSaga providesuseful tools both for recognizing transposase ORFs andfor manually curating their edges, it cannot automaticallyidentify novel ISs not already present in ISfinder. Due tothe requirements for manual annotation of novel elementsand individual submission of genomes online, ISSaga isimpractical for comprehensive evolutionary analysesinvolving a large number of sequenced genomes. To ourknowledge, the only publically available program that hasbeen developed to identify new ISs is IScan (15). Thissystem utilizes BLAST with a single reference transposasesequence per IS family to locate novel transposases. Aninvestigation of ISs in 438 prokaryotic genomes concludedthat, given the limited number of ISs identified by thisapproach in most taxa, most IS families are quitelimited in their phylogenetic distribution (16). Wholetaxa, such as the a-proteobacteria, had almost no identi-fiable ISs via this method, whereas manual annotation hasfound genomes of many of these organisms, for exampleMethylobacterium extorquens (17), are rather denselypopulated with ISs.To perform a global investigation of ISs in bacterial and

archaeal genomes, we developed OASIS, or OptimizedAnnotation System for Insertion Sequences, a computa-tional tool for automated annotation of ISs. OASIS takesadvantage of widely available transposase annotations toidentify candidate ISs and then uses a computationally

efficient maximum likelihood method of multiple sequencealignment to identify the edges of each element. Thanksto its speed and flexibility, OASIS is capable not onlyof providing detailed IS information for a single genomebut also of annotating thousands of genomes withinhours, making it a valuable high-throughput tool for aglobal investigation of IS distribution across diversetaxa. We applied OASIS to 1737 sequenced bacterialand archaeal genomes. Through comparisons across1319 genomes to a benchmark of ISfinder annotations,OASIS performed approximately an order of magnitudebetter than IScan. With a more comprehensive andaccurate overall picture of IS distribution acrossgenomes, we were able to address the pattern of ISs evo-lution at scales ranging from all sequenced Bacteria andArchaea to diversity between sets of extremely closelyrelated lineages. Broadly, it is clear that IS familiesare quite widespread; however, they are not presentrandomly across phyla suggesting either borders toexchange or insufficient time to equilibrate. ConsideringIS number and density with regard to genome length, wefind tremendous variation, and with the exception ofgenomes <2Mb which have considerably fewer ISs,there is no positive correlation of IS density and genomelength. Finally, we find that the proportion of shared IScontent between recently diverged genomes drops quitequickly. Quite surprisingly, even after 16S divergence istaken into account, strains labeled as the same speciesshare more IS elements than those termed as being differ-ent, suggesting that IS exchange/survival is correlated withcurrent taxonomy.

MATERIALS AND METHODS

OASIS algorithm

OASIS is a free open source program implemented inPython. It is available at https://github.com/dgrtwo/OASIS. The input data in this study consisted of the1737 curated microbial genomes available in NCBI as of17 September 2011. These genomes were downloadedfrom NCBI in GenBank format from the publicly avail-able server. OASIS uses a library of 3703 ISs from ISfinder(13), in the form of an amino acid FASTA file, which areused to identify the family and group of each IS. Thesetransposase sequences were automatically downloadedfrom the ISfinder database on 17 September 2011.

ISs may occur once in a genome or may consist of a setof almost identical copies (2). ISs in a particular genomecan therefore be classified into two major groups in termsof copy number: multicopy and single-copy ISs. As thereare distinct levels of information available in each of thesecases, different algorithms perform better with each class.As such, we have designed OASIS to find these two groupsof ISs in two separate steps: first finding multicopy ISs andthen single-copy ISs. The overall schematic pipeline isshown in Figure 1.

OASIS identifies multicopy ISs in each genome byfinding conserved regions surrounding already-annotatedtransposase genes, which are identified by the word‘transposase’ in the ‘product’ field of GenBank files.

2 Nucleic Acids Research, 2012

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 3: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

The choice to require prior annotation limits use onunannotated genomic data; however, as publishedcomplete genomes are nearly always annotated to thislevel of detail via other pipelines, we feel that the compu-tational speed gained makes this decision quite worthwhile.One additional issue is that transposases are occasionallymisannotated as integrases, a term more commonly foundfor prophage. In order to avoid false positives fromprophage, which are generally present at single copies ina genome, genes containing the word ‘integrase’ are con-sidered only when present in multiple copies. Groups oftransposase genes that compose multiple copies of an ISare identified by length and sequence similarity. TwoORFsare considered similar if the identity between the se-quences, assuming no gaps, is >95%, This thresholdallows for small differences between the transposase se-quences in multiple copies of the same ISs, though itshould be noted that the great majority of cases ofmultiple copies are within 2% of each other. Genes thatfit this similarity threshold are combined into the same ISset. OASIS then uses a maximum likelihood algorithm,described in Supplementary Methods, to determine theedges of multicopy ISs based on conservation betweentheir surrounding regions.

To define the edges of single-copy ISs, we use anapproach first developed by IScan to find IRs aroundthe transposases, which are present for the majority ofISs (15). Briefly, a Smith–Waterman alignment, with amatch score of 1, a mismatch penalty of �3 and a gappenalty of �4, is performed comparing the regionupstream of the transposase (500 bp) with the reverse com-plement of the downstream region (500 bp) and thehighest match with a score >10 is assumed to be thepair of terminal IRs.Multicopy ISs are also checked for IRs using a Smith–

Waterman alignment with the same parameters,comparing the regions within 100 bp of each edge. If theIRs disagree with the edges determined by the maximumlikelihood algorithm, the edges are changed to match theIRs. If none are found, the region immediately inside theIS is checked with a mismatch penalty of only �1, in caseOASIS had already identified the edges correctly.As a result, each full-length IS is composed of a

transposase, a protein of one or more ORFs, andupstream and downstream sequences defined as linkerregions, typically ranging from 0 to 500 bp. The extremeedges of the IS can include a partially conserved IR oneach end ranging from 8 to 20 bp in length.

Figure 1. Flowchart portraying the full workflow of OASIS.

Nucleic Acids Research, 2012 3

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 4: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Once groups of ISs are identified, BLASTN (NCBI) isused with one example from each set of ISs, selected basedon the presence of inferred IRs and the mode length of theISs, against the genome sequence to identify missing andpartial copies of the IS (for which there is often notransposase annotated). Thus, when present in multiplecopies OASIS finds partial ISs; it is not capable offinding these small IS fragments when no intact copywith an annotated transposase is present. RedundantBLAST results within a set are filtered out. OASIS thenuses hierarchical agglomerative clustering to identifygroups of IS lengths, clustering together groups whosemean lengths are closer than 100 bp apart. The modecluster is then assumed to be the true size of the IS andany fragments that are shorter than that threshold or600 bp are classified as partials. One intact ORF wasselected from each set of ISs and BLASTP was usedagainst the ISfinder database. If there were matches withan e-value <10�12, the IS set is classified according to thefamily and group of the best match, otherwise the IS set isclassified as ‘None’.The final output of OASIS includes two files for each

annotated genome: a file in GFF format of the ISs in eachgenome and each ISs characteristics, including thechromosome ID, start and end positions, direction,family and group, IRs (if found) and whether theelement is a partial element, and a file containing the nu-cleotide sequence of each identified IS and the amino acidsequence of each transposase in FASTA format.

Evaluation

The performance of OASIS was compared to IScan, usingISfinder database as the benchmark annotation. Eachgenome was then annotated using three methods:OASIS, IScan and a BLASTN against ISfinderdatabase. OASIS was performed using its defaultsettings and IScan annotations were obtained by runningIScan with its default settings on each genome. IScan usesBLAST with reference transposase sequences to findtransposases. In this run, we extracted the same referenceISs as in the original IScan analysis (16). Benchmark an-notations were obtained by mapping ISfinder sequencesonto genomes. BLASTN was used with ISfinder nucleo-tide sequences to search genomes in the same genus as thequery sequence’s origin. BLAST hits with identity <90%were removed, as were hits that were redundant (the startsand ends are within 100 bp of others in the same genome).Only 1311 of the genomes shared a genus with anyISfinder element and were included in the evaluationanalysis.Sensitivity and specificity were assessed by matching

elements between OASIS or IScan against the benchmarkdata set of ISfinder. Partial and truncated ISs are usuallynot annotated in ISfinder and thus were excluded from theevaluation analysis. As edges could be mis-annotatedeither by the BLAST from ISfinder or by one of the auto-mated annotation methods, any elements that overlappedwere considered matches, though the effect of requiringaccurate matches is shown in Supplementary Figure S1.

OASIS+ data set

While OASIS found two-thirds of the ISs obtained bymapping ISfinder elements back to genomes and foundnearly half as many new elements, there are many it didnot find (see ‘Discussion’ section). In order to make thesubsequent biological analysis as comprehensive aspossible, we combined the OASIS annotations with theISfinder annotations to form the OASIS+ data set. Toremove any redundancy between the annotation sets,any copy in ISfinder that overlapped a copy in OASISwas not included.

Phylogenetic analysis

Aligned 16S rRNA sequences were extracted fromRibosomal Database Project Release 10 on 14November 2011, by matching the genbank accessionnumber of each genome (18). For genomes that havemultiple annotated 16S sequences, the 16S sequencefrom each genome with the highest average similarity toall other 16S sequences was selected. In total, 1502 16Ssequences were extracted from RDP database and anextra 195 sequences were aligned first and added to theRDP alignment by ClustalW2 profile alignment (19).Fourteen sequences were deleted due to poor quality ofthe 16S sequences (e.g. multiple Ns) after manual verifica-tion. An overall phylogeny of the 1682 genomes was con-structed using Neighbor-joining method (20) anddisplayed by Interactive Tree of Life (21). The completetabulation of IS family content per genome is available inSupplementary Table S3.

We used SourceCluster, a pairwise distance test (22), totest the non-randomness of IS distribution across genomesfor each family. The sum of all pairwise distances between16S sequences of the genomes with at least one IS of thatfamily was calculated to represent the degree of clusteringeffect. To generate a corresponding null distribution foreach family, 1000 Monte Carlo simulations were per-formed by sampling the same number of pairs ofgenomes randomly and the position of the test statisticwithin the null distribution was used as the p-value.

IS content comparison for closely related genomes

To investigate shared IS content between closely relatedgenomes, we computed the numbers of matching ISsbetween each pair of genomes whose 16S divergence is<10%. Since the maximum 16S divergence within bacter-ial species is commonly cited to be 3%, we conservativelyexcluded genomes with 16S sequences that, on an average,diverged >3% with other genomes in their species toprevent cases of misannotation or poor alignments fromskewing our results (23). We used an alternative indexrather than the actual genomic position due to the lackof consistency in nucleotide coordinates between even re-cently diverged genomes. We considered ISs between twogenomes to match if their families are identical, theirlengths are within 200 bp of each other and if the ratioof the Levenshtein edit distance (the number of insertions,substitutions and deletions necessary to transform onestring into another (24) to the average length is <0.2.

4 Nucleic Acids Research, 2012

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 5: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

For each genome in the pair, we computed the number ofelements that match an element in the other genome. Wethen fit the following logistic model:

pi,j ¼1

1+e�zi,j

zi,j ¼ �0+�1 log10ðDi,jÞ+�2Si,j+BF

where pi,j is the probability an IS in one of the twogenomes i and j appears in the other, Di,j is the 16Sdivergence between the genomes, Si,j is whether the twogenomes are the same species and �F is a coefficientspecific to that IS family. We added 0.0001 to the diver-gence so that the logarithm could be taken, as the diver-gence can be zero. We performed logistic regression withthe number of shared and lost elements in each pair as thebinomial response variables. A higher coefficient indicatesthat a factor is associated with a higher probability of ISsharing between two genomes.

RESULTS AND DISCUSSION

OASIS performance

A total number of 41 821 copies representing 6829 uniqueIS sets were identified by OASIS in 1240 genomes out ofthe 1737 analyzed (Table 1 and Figure 2). Among those,16.4% of the unique IS sets are single-copy ISs. The re-maining sets belong to multicopy ISs and comprise 97.3%of the total copies, with the largest multicopy set in asingle genome containing 232 elements. OASIS took atotal of 9 h and 40min to annotate all 1737 genomes ona 4-core 2.8-Ghz processor, with a maximum per-genomerunning time of 6min.

To evaluate the performance of OASIS, we comparedthe program’s annotation to those of IScan by testing eachagainst benchmark annotations obtained from theISfinder database. Given that many genomes are not rep-resented in ISfinder, we considered sensitivity as a measureof annotation accuracy. Within the 1311 benchmarkedgenomes, OASIS found �66.0% of the 40 078 ISfinderannotations, whereas IScan found only 8.5%. OASISand IScan frequently identify ISs in ISfinder but disagreeabout the boundaries; OASIS annotated 42.3% of theISfinder annotations to within 10 bp of each edge, whileIScan found 3.1% at that error tolerance (the effect oferror tolerance on sensitivity is shown in SupplementaryFigure S1). The low sensitivity of IScan was not an artifactof our execution of the software, as the number and

distribution of ISs qualitatively matches the previouslyreported results (16,25). In addition, OASIS identified14 264 ISs that were not present in ISfinder, which isover half of the size of the database (Figure 3). Wemanually validated the novel ISs in a subset ofdata (1012 genomes, downloaded on February 2010) andfound that at least 85% of the 2625 unique IS typescontain real, intact transposes that had not submitted tothe ISfinder database, confirming the incompleteness ofISfinder database. Interestingly, despite the great differ-ence in numbers and sensitivity between OASIS andIScan, IScan found 1146 ISfinder elements that OASISdid not.The sensitivities of both OASIS and IScan depend very

strongly on the genus of the annotated genome.Supplementary Table S1 shows the individual sensitivitiesfor genera that include 10 or more genomes. IScan anno-tations were most sensitive in Escherichia genomes, finding25.1% of E. coli ISs, while finding no ISfinder elementsat all in several other large genera that contain ISs, suchas Streptococcus, Bacillus, Clostridium, Mycoplasma andMycobacterium. This specificity to particular taxa is likely

Figure 2. Venn diagram illustrating the identified number of ISs inISfinder, OASIS and IScan. OASIS found a total of 37 427 ISs(in copy numbers) in the 1319 benchmarked genomes while IScanonly found 2902 ISs, demonstrating a better performance of OASISover IScan. In addition to identifying 18 112 ISfinder elements,OASIS found 19 365 new ISs, indicating the advantage of OASIS infinding novel ISs.

Table 1. The size and characteristics of the four sets of IS annotations in this analysis

Annotation set Numberof copies

Numberof sets

Number ofgenomes

Number ofpartial elements

% of partialelements

Sensitivity(%)

Specificity(%)

OASIS 41 821 6829 1240 5655 13.5 66 54.7IScan 3252 428 257 335 10.3 8.5 77.5ISfinder BLAST 40 078 6012 932 10 584 26.4 – –OASIS+ 56 786 10 529 1347 11 022 19.4 – –

OASIS+ is the combination of the OASIS and ISfinder data sets. ISfinder and OASIS+ have no sensitivity or specificity since ISfinder is ourbenchmark and OASIS+ incorporates ISfinder.

Nucleic Acids Research, 2012 5

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 6: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Figure 3. IS distribution across bacterial and archaeal genomes. (A) A neighbor-joining tree of 16S rDNA from 1682 completely sequenced bacterialand archaeal genomes, colored by phylum (proteobacteria are separated by class). Length of colored bars outside the tree is proportional to thenumbers of ISs identified by OASIS+, colored by 26 IS families (ISNCY: not classified yet; None: no hit in ISfinder database). The OASIS+ data setconsists of the combined annotations of OASIS and a BLASTN from ISfinder (see ‘Materials and Methods’ section, complete data in SupplementaryTable S3). (B) A heatmap showing the non-random occurrence of each IS family across genomes. Genomes are ordered according to the 16S tree in(A). The final four columns indicate the distribution of ISs present in the various data sets we compare.

6 Nucleic Acids Research, 2012

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 7: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

an artifact of IScan for identifying transposases, whichdepends upon BLAST homology with selected referencesequences (mostly from E. coli and other Proteobacteria).In terms of the total number of annotations and sensitivityto ISfinder, we found OASIS has better performance thanIScan overall and also within every genus with more than10 genomes.

Both OASIS and IScan have a much lower rate of de-tecting single-copy ISs than multicopy ISs, particularly atannotating their edges accurately. They find 10.2 and3.9% of single-copy ISs, respectively, whereas at anerror tolerance of 10 bp, these fall to only 6.0 and 1.3%.OASIS outperforms IScan by using sequence conservationbetween multiple copies to identify the edge of theelement, which gives a much higher signal. Amongelements that overlapped a benchmark element, OASISmissed the edges of the benchmark by a median of 2 bp,while IScan missed the edges by a median of 51 bp.

Despite the significant improvement of OASIS on ISannotation, the major limitation of our program is thatit depends on the quality of NCBI annotation. Throughmanual inspection, we found many elements do not haveannotated transposases in the NCBI genomes, whichmakes OASIS incapable of identifying them as candidates.OASIS is thus limited by the efficacy of transposase an-notation and identification algorithms, which can be in-accurate. Furthermore, that the sensitivity of OASIS

increases as the error tolerance increases indicates thatOASIS finds some elements but misannotates theiredges. It is also possible in some cases that the edges areincorrectly annotated in the benchmark data set.

IS element abundance and distribution

For further analysis of broad-scale patterns of ISs acrossbacteria and archaea we combined our OASIS annota-tions of ISs with those already in ISfinder to generate acollection, ‘OASIS+’, of 10 529 ISs (due to multicopy ISs,56 786 elements in total) across 26 IS families (Table 2).ISs appeared in 77.5% of the genomes, indicating a globalIS distribution across archaeal and bacterial domains(Figure 3 and Supplementary Table S2). This resultconcurs with the prevailing view of ISs being widespreadbut absent from a moderate number of microbialgenomes (26); however, it is in contrast to the resultsof IScan (Supplementary Figure S2). The proportionof ISs identified across taxa varied substantially. Largelydue to the large number of sequenced genomes from thesegroups, >60% of the ISs we found are from Proteo-bacteria and �20% are from Firmicutes. For phyla thathave more than 10 genomes, the average number pergenome ranges from 15 to 50 except the Chlamydiae/Verrucomicrobia and the Epsilonproteobacteria groups,which have fewer than 3 ISs per genome on average.

Table 2. The size and prevalence of each family in OASIS+, sorted by the number of IS copies in each

Family Numberof copies

Numberof IS sets

Numberof genomes

Number ofgenera

Number ofclasses

Number ofphyla

Average copynumber

IS3 9630 1902 799 246 28 18 5.06IS5 6692 1073 521 213 35 20 6.24None 3951 742 483 237 40 22 5.32IS110 3535 829 488 177 26 16 4.26IS256 2890 512 337 131 25 16 5.64IS200/IS605 2770 499 334 107 25 13 5.55IS630 2721 424 252 126 23 11 6.42ISL3 2262 426 268 85 19 12 5.31IS1 2247 152 110 27 11 7 14.78IS4 2245 521 295 118 21 12 4.31IS21 2225 386 251 115 18 12 5.76IS481 1984 270 207 96 17 10 7.35IS30 1916 355 259 76 15 8 5.4IS66 1654 360 204 87 17 9 4.59IS1182 1564 292 224 95 16 9 5.36ISAs1 1143 259 136 47 12 7 4.41IS1380 1075 133 106 52 13 6 8.08IS982 907 133 100 45 16 11 6.82IS701 882 126 91 57 18 11 7IS1634 862 140 89 54 20 11 6.16IS6 725 219 177 60 18 12 3.31IS1595 674 162 117 52 11 7 4.16ISNCY 660 182 148 80 22 13 3.63IS607 498 178 94 42 18 12 2.8ISH3 491 62 25 10 4 3 7.92IS91 314 94 72 35 13 6 3.34Tn3 213 85 72 47 8 4 2.51ISAzo13 56 13 13 9 7 5 4.31OASIS+ 56 786 10 529 1347 448 50 26 5.39All – – 1737 563 58 30 –

Note that many ISs had families that could not be identified and were recorded as None. The OASIS+ row describes the total number of taxa thatcontain ISs, while the ‘All’ row represents the total number of taxa used in the analysis.

Nucleic Acids Research, 2012 7

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 8: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

In addition, our results show no ISs in the 13 genomes inBuchnera or the 12 genomes in Prochlorococcus, which isconsistent with previous reports (27). This indicates theabsence of ISs in reduced genomes, which is consistentwith previous observations (28–32).Given that certain phyla, particularly those with smaller

genomes, have few IS elements we directly explored theeffect of genome size on IS number and density in theOASIS+ data set. In a previous study of ISs in 262genomes, a positive correlation was found between IScontent and genome size, which they attributed to thelower density of essential genes and thus of deleteriousinsertion sites (26). In the OASIS+ data, we found apositive Spearman’s coefficient of 0.46 (p& 0) betweeneach genome’s length and its number of ISs, though thisis weaker than the correlation found earlier using fewergenomes (Figure 4). Much of this correlation is driven bythe rarity of ISs in small genomes: the correlation amonggenomes of length >2Mb drops to just 0.21. The previousstudy also discovered a correlation between IS density andgenome length. While we discovered a weak correlationbetween density and length (Spearman’s r=0.25, p=0),it was entirely due to the rarity of ISs in small genomes.The correlation drops to �0.040 when only genomes>2Mb are considered. This suggests that ISs have diffi-culty surviving in genomes <2Mb, but that the density ofISs permitted does not further increase for genomes>2MB.

Beyond considering IS prevalence as a whole, we alsodetermined that there is some degree of clade specificityand a non-random distribution for nearly every individualIS family (Figure 3B and Supplementary Table S3). ISfamilies differ greatly in both their frequency and diversityof hosts, appearing in as few as 13 genomes (ISAzo13) toas many as 799 genomes (IS3) (Table 2). Each IS familyalso differs in its copy number per IS set (unique IS type).For example, IS1 has an average of 14.8 copies per setwhile Tn3 (technically a transposon that has additionalpassenger genes, but it is treated as an IS family byISfinder) has an average of only 2.5 copies per set, sug-gesting different replicative transposition rates across dif-ferent types of ISs. Interestingly, several families werelimited to only a single domain. IS1380, Tn3, ISAs1 andIS30 were present only in bacterial genomes, and IS3,despite being the most abundant IS family, was absentin all archaeal genomes except two. On the other hand,ISH3 was found only in Archaea. To statistically examinethe clustering effects of IS families, we applied phylogen-etic clustering analysis to compare the average pairwisedistance of 16S sequences within each IS family and testit against a null distribution (see ‘Materials and Methods’section). Consistent with the above observations, theresults showed 22 out of 26 families are significantlyphylogenetically clustered (p< 0.05) (SupplementaryTable S4), indicating a non-random distribution of ISsacross genomes. This non-random distribution of ISs

Figure 4. Plot of genome length versus number of ISs per genome, (Spearman’s r=0.459, p& 0). A best-fit line from a linear regression is shown,but it captures relatively little of the variation present (slope 8.42, R2=0.074). Note that the y axis is restricted to 100 for clarity, though some pointslie above the plot’s range.

8 Nucleic Acids Research, 2012

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 9: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

across genomes might be due to either boundaries to suc-cessful horizontal transfer, distinct selective forces facedby different IS families in different taxa, or recent emer-gence and insufficient time to equilibrate across taxa.

IS element survival in closely related genomes

The large number of taxa in our OASIS+ data set thatrepresent closely related genomes (strains within thesame species or genus) allowed us to address the extentto which IS turnover is seen at a finer scale. Previousstudies have determined that IS counts can vary widelyeven between closely related genomes (26,33,34). Mostof these studies concluded that ISs are short lived innatural lineages and that rapid HGT is required forthem to persist. To investigate the short-term proliferationand survival of ISs, we compared the fraction of ISsshared in each pair of closely related genomes with their16S divergence. The results show that the probability of anIS being shared between genomes decreases dramaticallywith increasing 16S distance: an IS has a 35.1% chance ofbeing shared between two genomes that have identical 16Ssequences, while it has only a 0.13% chance of beingshared between two genomes with 9–10% 16S sequencedivergence. Also notably, the probability of an IS beingshared between two genomes in the same species is 24.7%,while two genomes of different species within 10% 16Sdivergence have only a 2.2% chance of being shared.Figure 5 compares the divergence between each genomicpair to the percentage of ISs shared between them, forboth intraspecies and interspecies pairs and shows thatprobability of sharing decreases very quickly withincreasing divergence. This rapid decrease in shared IScontent during the early divergence of genomes could be

due to a decline in the probability of vertical inheritance orin the chance of being acquired horizontally. Figure 5 alsoshows for a given degree of 16S sequence divergence, pairsof taxa that are defined as the same species have moresimilar IS content than if they are defined as differentspecies. As an example, the probability of an IS beingshared between two Escherichia genomes is 17.4% andthe probability of being shared between two Shigellagenomes is 41.1%, while the probability of being sharedbetween an Escherichia genome and a Shigella genome isonly 4.7%. Shigella is a polyphyletic genus completelywithin the Escherichia genus (35) (the average 16S diver-gence between the two genera is 1.6%), so the difference inIS content might reflect biological differences rather thanjust the time since divergence.In order to quantify the effect of these factors on the

probability of shared IS content, we performed logisticregression to predict the proportion of matched IScopies between two genomes, using the log of the 16Sdistance, IS family and whether a pair was within asingle species. Different IS families have very differentprobabilities of being shared between closely relatedgenomes (Supplementary Table S5). Families such asIS1595 and IS1380 were particularly likely to appear inmultiple closely related genomes, while families such asIS91, Tn3 and IS66 were unlikely to be shared. Whilethe log genomic distance was found to have a very signifi-cant effect, particularly important was whether the twogenomes were considered to be part of the same species(p& 0). The intraspecies factor had a logistic regressioncoefficient of 1.85±0.01, which can be interpreted as anodds ratio of e1.85=6.36, indicating that there is a strongeffect of being defined as part of the same species on top ofthe effect of the pairwise 16S distance. The wide range of

Figure 5. A plot of the probability of two genomes sharing an IS copy compared to their 16S distance, for: (A) intraspecies and (B) interspeciespairs. An average using a Nadaraya–Watson kernel smoother with a bandwidth of 1% is shown in red. Note that at all 16S sequence distances theintraspecies value remains at least 2-fold higher than interspecies.

Nucleic Acids Research, 2012 9

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 10: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

coefficients among IS families shows that family does havea large effect on the probability of being shared, suggest-ing that elements in different families have differentprobabilities of net gain or loss.Part of the difference causing certain IS families to be

shared across closely related genomes can be attributed totheir differing copy numbers, For example, the IS familywith the lowest average copy number, Tn3, has one ofthe lowest coefficients in the regression and there-fore the lowest probability of IS sharing. Also importantis the effect of sequencing bias, as a family’s presence inhighly sequenced taxa allows its shared presence in closelyrelated pairs to be observed. While the effects of distanceand species are controlled for in the logistic regression, thehighly sequenced taxa can still inflate the estimates. Forexample, IS1380 has one of the highest coefficients in theregression, but >80% of the pairs that share IS1380elements are within closely related genomes in theAcetobacter pasteurianus or Streptococcus pneumoniaspecies. If those species had not been highly sequencedthe tendency would not have been discovered. However,other families, such as IS256 and IS200/605, are found tohave shared pairs in a wider variety of taxa. OASIS mightalso be capable of annotating some families more accur-ately than others, as its accuracy is affected by theproperties such as the presence of IRs and the level ofconservation between copies. The various reasons thatIS families have different rates of being shared acrossclosely related genomes deserves further study.

CONCLUSIONS

As the sequencing technology progresses, the need foruser-friendly, high-throughput annotation systems con-tinues to grow. We developed OASIS, an automated an-notation system for ISs, which is capable not only ofproviding detailed IS information for a single genome,but also of annotating thousands of genomes withinhours. A tradeoff inherent to computationally efficientautomated annotation at this scale is that, althoughOASIS fares better than previous software platforms,some ISs present in the manually curated ISfinderdatabase were missed. In developing our algorithms weerred on the side of caution, trying to minimize false posi-tives so that these would not be further propagated.By applying OASIS for high-throughput IS annotation

across genomes, this study examined IS evolution at bothbroad and narrow phylogenetic scales. Looking acrossall taxa, we revealed a nearly global distribution of ISsacross both Bacteria and Archaea and a non-randomnessof IS family distribution across taxa. Interestingly, withthe fuller OASIS+data set it becomes clear that, althoughsmall genomes <2Mb have relatively few ISs, beyond thissize the density of ISs per genome size is relativelyconstant. The clearest finding of ISs versus genome size;however, is just how large the variation is for any givengenome size, in accord with the variance previously notedwithin a single species (31). At a finer scale of closelyrelated genomes, we found that the probability of an ISbeing present drops precipitously, indicating either rapid

loss and/or emergence of increasing barriers to successfulexchange across diverging lineages. We also found, to oursurprise, that for a given 16S divergence, strains con-sidered to be of the same species have greater IS contentsimilarity than strains that are named as different species.This perhaps suggests that our current bacterial taxonomymanages to capture some real differences, at least as far asthey relate to differential exchange and survival of ISsconditions and thresholds relevant for IS survival andhorizontal transfer. The availability of the OASISplatform will hopefully aid in future work to tease apartthe individual contributions of IS loss and exchange thatgive rise to these global patterns and to explore other bio-logical and evolutionary characteristics of ISs.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online:Supplementary Tables 1–5, Supplementary Figures 1 and2 and Supplementary Methods.

FUNDING

Taiwan Ministry of Education [Study Abroad Scholarshipto M.-C.L.]; Harvard University Microbial SciencesInitiative [fellowship to D.G.R.]; Harvard University[Harvard Merit Fellowship to M.-C.L. and set-up fundsto C.J.M.]. Funding for open access charge: HarvardUniversity.

Conflict of interest statement. None declared.

REFERENCES

1. Chandler,M. (2002) Insertion sequences revisited. Mobile DNA II.In: Craig,N.L., Craigie,R., Gellert,M. and Lambowitz,A.M. (eds),Mobile DNA II. American Society for Microbiology, WashingtonD.C., pp. 305–366.

2. Zhou,F. and Olman,V. (2008) Insertion Sequences show diverserecent activities in Cyanobacteria and Archaea. BMC Genomics,9, 36.

3. Schaack,S., Gilbert,C. and Feschotte,C. (2010) PromiscuousDNA: horizontal transfer of transposable elements and whyit matters for eukaryotic evolution. Trends Ecol. Evol., 25,537–546.

4. Chou,H.-H., Berthet,J. and Marx,C.J. (2009) Fast growthincreases the selective advantage of a mutation arising recurrentlyduring evolution under metal limitation. PLoS Genet., 5,e1000652.

5. Schneider,D., Duperchy,E. and Coursange,E. (2000) Long-termexperimental evolution in Escherichia coli. IX. Characterization ofinsertion sequence-mediated mutations and rearrangements.Genetics, 156, 477–488.

6. Chou,H.-H. and Marx,C.J. (2012) Optimization of geneexpression through divergent mutational paths. Cell Rep., 1,133–140.

7. Lee,M.-C. and Marx,C.J. (2012) Repeated, selection-drivengenome reduction of accessory genes in experimental populations.PLoS Genet., 8, e1002651.

8. Cooper,V.S., Schneider,D., Blot,M. and Lenski,R.E. (2001)Mechanisms causing rapid and parallel losses of ribose catabolismin evolving populations of Escherichia coli B. J. Bacteriol., 183,2834–2841.

9. Dunham,M.J., Badrane,H., Ferea,T., Adams,J., Brown,P.O.,Rosenzweig,F. and Botstein,D. (2002) Characteristic genome

10 Nucleic Acids Research, 2012

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 11: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

rearrangements in experimental evolution of Saccharomycescerevisiae. Proc. Natl Acad. Sci. USA, 99, 16144–16149.

10. Zhong,S., Khodursky,A., Dykhuizen,D.E. and Dean,A.M. (2004)Evolutionary genomics of ecological specialization. Proc. NatlAcad. Sci. USA, 101, 11719–11724.

11. Barrick,J.E., Yu,D.S., Yoon,S.H., Jeong,H., Oh,T.K.,Schneider,D., Lenski,R.E. and Kim,J.F. (2009) Genome evolutionand adaptation in a long-term experiment with Escherichia coli.Nature, 461, 1243–1247.

12. Chou,H.-H., Chiu,H.-C., Delaney,N.F., Segre,D. and Marx,C.J.(2011) Diminishing returns epistasis among beneficial mutationsdecelerates adaptation. Science, 332, 1190–1192.

13. Siguier,P., Perochon,J., Lestrade,L., Mahillon,J. and Chandler,M.(2006) ISfinder: the reference centre for bacterial insertionsequences. Nucleic Acids Res., 34, D32–D36.

14. Varani,A.M., Siguier,P., Gourbeyre,E., Charneau,V. andChandler,M. (2011) ISsaga is an ensemble of web-based methodsfor high throughput identification and semi-automatic annotationof insertion sequences in prokaryotic genomes. Genome Biol., 12,R30.

15. Wagner,A. and Lewis,C. (2007) A survey of bacterial insertionsequences using IScan. Nucleic Acids Res., 35, 5284–5293.

16. Wagner,A. (2008) Distant horizontal gene transfer is rare formultiple families of prokaryotic insertion sequences. Mol. Genet.Genomics, 280, 397–408.

17. Vuilleumier,S., Chistoserdova,L., Lee,M.-C., Bringel,F., Lajus,A.,Zhou,Y., Gourion,B., Barbe,V., Chang,J., Cruveiller,S. et al.(2009) Methylobacterium genome sequences: a reference blueprintto investigate microbial metabolism of C1 compounds fromnatural and industrial sources. PloS One, 4, e5584.

18. Cole,J.R., Wang,Q., Cardenas,E., Fish,J., Chai,B., Farris,R.J.,Kulam-Syed-Mohideen,A.S., McGarrell,D.M., Marsh,T.,Garrity,G.M. et al. (2009) The Ribosomal Database Project:improved alignments and new tools for rRNA analysis. NucleicAcids Res., 37, D141–D145.

19. Larkin,M., Blackshields,G. and Brown,N. (2007) Clustal W andClustal X version 2.0. Bioinformatics, 23, 2947–2948.

20. Saitou,N. and Nei,M. (1987) The neighbor-joining method: a newmethod for reconstructing phylogenetic trees. Mol. Biol. Evol., 4,406–425.

21. Letunic,I. and Bork,P. (2011) Interactive Tree Of Life v2: onlineannotation and display of phylogenetic trees made easy. NucleicAcids Res., 39, W475–W478.

22. Nightingale,K.K., Lyles,K., Ayodele,M., Jalan,P., Nielsen,R. andWiedmann,M. (2006) Novel method to identify source-associatedphylogenetic clustering shows that Listeria monocytogenes includesniche-adapted clonal groups with distinct ecological preferences.J. Clin. Microbiol., 44, 3742–3751.

23. Stackebrandt,E. (1994) Taxonomic note: a place for DNA-DNAreassociation and 16S rRNA sequence analysis in the presentspecies definition in bacteriology. Int. J. Syst. Bacteriol., 44,846–849.

24. Navarro,G. (2001) A guided tour to approximate string matching.ACM Comput. Surv., 33, 31–88.

25. Wagner,A. (2009) Transposable elements as genomic diseases.Mol. BioSystems, 5, 32–35.

26. Touchon,M. and Rocha,E.P.C. (2007) Causes of insertionsequences abundance in prokaryotic genomes. Mol. Biol. Evol.,24, 969–981.

27. Sakaki,Y., Shigenobu,S., Watanabe,H., Hattori,M. andIshikawa,H. (2000) Genome sequence of the endocellularbacterial symbiont of aphids: Buchnera sp. APS. Nature, 407,81–86.

28. Andersson,S.G. and Kurland,C.G. (1998) Reductive evolution ofresident genomes. Trends Microbiol., 6, 263–268.

29. Luo,H., Friedman,R., Tang,J. and Hughes,A.L. (2011) Genomereduction by deletion of paralogs in the marine cyanobacteriumProchlorococcus. Mol. Biol. Evol., 28, 2751–2760.

30. van Ham,R.C.H.J., Kamerbeek,J., Palacios,C., Rausell,C.,Abascal,F., Bastolla,U., Fernandez,J.M., Jimenez,L., Postigo,M.,Silva,F.J. et al. (2003) Reductive genome evolution in Buchneraaphidicola. Proc. Natl Acad. Sci. USA, 100, 581–586.

31. Moran,N. and Plague,G. (2004) Genomic changes following hostrestriction in bacteria. Curr. Opin. Genet. Dev., 14, 627–633.

32. Ochman,H. and Davalos,L.M. (2006) The nature and dynamicsof bacterial genomes. Science, 311, 1730–1733.

33. Wagner,A. (2006) Periodic extinctions of transposable elements inbacterial lineages: evidence from intragenomic variation inmultiple genomes. Mol. Biol. Evol., 23, 723–733.

34. Lawrence,J.G., Ochman,H. and Hartl,D.L. (1992) The evolutionof insertion sequences within enteric bacteria. Genetics, 131, 9–20.

35. Pupo,G.M., Lan,R. and Reeves,P.R. (2000) Multiple independentorigins of Shigella clones of Escherichia coli and convergentevolution of many of their characteristics. Proc. Natl Acad. Sci.USA, 97, 10567–10572.

Nucleic Acids Research, 2012 11

by guest on August 17, 2012

http://nar.oxfordjournals.org/D

ownloaded from

Page 12: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

1

Supplementary Table 1. Sensitivity of OASIS and IScan within each genus for all genera containing at

least 10 genomes.

Supplementary Table 2. The distribution of ISs in the OASIS+ dataset across phyla. Phyla were

determined by a search in the Entrez Taxonomy database, and sorted by the total number of genomes.

The sensitivity was determined against the ISfinder benchmark annotations.

Supplementary Table 3. The number of IS elements in each family in each genome, according to the

OASIS+ dataset; used to generate Figure 3B.

Supplementary Table 4. Results of a Monte Carlo test (see Methods) to detect phylogenetic clustering in

each family.

Supplementary Table 5. Logistic regression coefficient estimates and p-values for the effect of each

family, the log of the genomic distance, and whether two genomes are of the same species, sorted by the

coefficient size.

Supplementary Figure 1. The sensitivity of OASIS and IScan as a function of different error tolerances.

Supplementary Figure 2. 16S tree showing the distribution of IScan annotated ISs in 1 682 genomes.

This tree is analogous to Figure 3A, which shows the distribution of the OASIS+ dataset. Note that while

IScan identifies ISs in several genera, including Escherichia, Shigella, and Xanthomonas, it identifies

none or almost none in the vast majority of taxa, which is contradicted by the OASIS+ dataset.

Page 13: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Supplementary  Table  1

Genus Number  of  genomes OASIS IScan ISfinder OASIS  Sensitivity IScan  SensitivityStreptococcus 74 1211 2 1190 0.551260504 0Escherichia 48 1529 582 2603 0.67038033 0.251248559Bacillus 44 605 1 481 0.467775468 0Clostridium 44 615 0 397 0.901763224 0Lactobacillus 38 1553 31 562 0.800711744 0.053380783Mycoplasma 37 429 1 467 0.957173448 0Helicobacter 36 28 1 29 0.24137931 0Staphylococcus 34 470 64 478 0.80125523 0.131799163Mycobacterium 29 1068 0 1207 0.755592378 0Burkholderia 29 1478 91 1568 0.888392857 0.030612245Pseudomonas 24 543 70 459 0.79956427 0.150326797Shewanella 24 952 43 1069 0.857811038 0.03554724Bifidobacterium 23 255 0 194 0.62371134 0Salmonella 23 238 47 282 0.656028369 0Yersinia 19 1788 26 1784 0.938340807 0Neisseria 18 481 0 685 0.750364964 0Corynebacterium 18 195 4 108 0.305555556 0Vibrio 16 128 36 176 0.568181818 0.068181818Rickettsia 15 171 0 46 0.956521739 0Listeria 15 5 0 17 0.117647059 0Haemophilus 14 36 1 26 0.576923077 0Campylobacter 14 7 0 0 -­‐ -­‐Sulfolobus 13 653 0 667 0.676161919 0Francisella 13 469 0 620 0.648387097 0Brucella 13 203 0 239 0.790794979 0Acinetobacter 11 474 2 431 0.974477958 0Xanthomonas 10 913 174 1457 0.569663693 0.182566918

Page 14: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Supplemental  Table  2

Phylum Total  genome  number Genomes  with  ISs IS  Sets Total  IS  Number Median  IS  Number Minimum  IS  Number Maximum  IS  NumberFirmicutes 378 334 2346 11223 20 0 219Gammaproteobacteria 365 314 2797 18055 21 0 811Alphaproteobacteria 166 131 1223 6136 14 0 238Actinobacteria 164 130 1223 4687 14 0 373Betaproteobacteria 113 101 979 5600 35 0 483Euryarchaeota 75 47 380 1686 5 0 184Bacteroidetes/Chlorobi  group 72 54 282 1285 7.5 0 120Epsilonproteobacteria 62 31 44 183 0.5 0 24Tenericutes 46 28 91 829 5 0 182Deltaproteobacteria 45 40 266 1075 17 0 80Cyanobacteria 42 21 260 2067 5.5 0 648Chlamydiae/Verrucomicrobia  group 41 8 28 120 0 0 48Crenarchaeota 37 17 204 1183 0 0 238Spirochaetes 37 21 69 660 4 0 139Chloroflexi 15 14 50 346 13 0 114Deinococcus-­‐Thermus 14 13 99 540 27.5 0 181Thermotogae 12 9 33 180 5 0 60Aquificae 10 6 17 156 3 0 68Fibrobacteres/Acidobacteria  group 7 5 29 156 10 0 62Fusobacteria 5 4 15 76 16 0 35Planctomycetes 5 3 9 53 3 0 37Deferribacteres 4 4 34 266 52 21 141Other 3 2 2 2 1 0 1Synergistetes 3 1 3 3 0 0 3Dictyoglomi 2 1 1 2 1 0 2Elusimicrobia 2 1 1 1 0.5 0 1Nitrospirae 2 2 7 15 7.5 3 12Thermodesulfobacteria 2 1 4 22 11 0 22Chrysiogenetes 1 1 4 30 30 30 30Gemmatimonadetes 1 1 6 12 12 12 12Korarchaeota 1 0 0 0 0 0 0Nanoarchaeota 1 0 0 0 0 0 0Thaumarchaeota 1 0 0 0 0 0 0unclassified  Archaea 1 1 14 80 80 80 80unclassified  Bacteria 1 0 0 0 0 0 0unclassified  Proteobacteria 1 1 9 57 57 57 57Total 1737 1347 10529 56786 -­‐ -­‐ -­‐

Page 15: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Supplemental  Table  3Species IS1182 ISH3 IS1595 ISL3 IS1634 IS1380 IS200/IS605 IS630 ISNCY Tn3 ISAs1 IS30 ISAzo13 IS481 None IS701 IS256 IS21 IS91 IS607 IS982 IS3 IS1 IS110 IS6 IS5 IS4 IS66Acaryochloris_marina_MBIC11017_uid12997 0 0 0 32 0 1 0 63 0 5 4 0 0 0 4 27 8 3 0 0 0 5 28 0 1 45 23 0Acetobacter_pasteurianus_IFO_3283_01_42C_uid31141 0 0 0 0 0 55 0 0 2 0 0 0 0 3 0 18 0 2 0 0 0 28 0 1 0 32 0 0Acetobacter_pasteurianus_IFO_3283_01_uid31129 0 0 0 0 0 61 0 0 2 0 0 0 0 4 0 19 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_03_uid31131 0 0 0 0 0 62 0 0 2 0 0 0 0 5 0 17 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_07_uid31133 0 0 0 0 0 61 0 0 2 0 0 0 0 4 0 18 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_12_uid32203 0 0 0 0 0 61 0 0 2 0 0 0 0 4 0 17 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_22_uid31135 0 0 0 0 0 62 0 0 2 0 0 0 0 5 0 17 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_26_uid31137 0 0 0 0 0 62 0 0 2 0 0 0 0 5 0 17 0 2 0 0 0 29 0 1 0 34 0 1Acetobacter_pasteurianus_IFO_3283_32_uid31139 0 0 0 0 0 61 0 0 2 0 0 0 0 4 0 17 0 2 0 0 0 29 0 1 0 34 0 1Acetohalobium_arabaticum_DSM_5501_uid32769 0 0 0 0 0 0 0 0 1 0 0 0 0 0 6 0 0 0 0 14 0 0 0 0 0 0 2 0Acholeplasma_laidlawii_PG_8A_uid19259 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Achromobacter_xylosoxidans_A8_uid48393 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 2 0 0 0 2 0 0Acidaminococcus_fermentans_DSM_20731_uid33685 3 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0Acidianus_hospitalis_W1_uid60875 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acidilobus_saccharovorans_345_15_uid39927 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acidimicrobium_ferrooxidans_DSM_10331_uid29525 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 17 0 0 0 0 0 0Acidiphilium_cryptum_JF-­‐5_uid15753 0 0 4 7 2 3 0 0 0 3 1 0 0 0 0 2 15 8 0 0 0 0 0 3 4 3 0 0Acidiphilium_multivorum_uid60101 0 0 23 0 5 4 0 0 0 4 11 0 0 0 0 0 3 6 2 0 0 2 0 8 1 14 6 0Acidithiobacillus_caldus_SM_1_uid63223 1 0 0 41 0 0 4 5 0 0 0 0 0 2 0 0 0 14 0 0 0 0 0 0 0 25 0 8Acidithiobacillus_ferrivorans_SS3_uid61509 0 0 0 5 7 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 7 0 4 0 0 0 1Acidithiobacillus_ferrooxidans_ATCC_23270_uid53 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 1 2 0 2 0 2 0 10 0 0 0 4Acidithiobacillus_ferrooxidans_ATCC_53993_uid16689 0 0 0 0 0 0 0 0 0 0 0 0 0 3 24 0 0 5 0 0 0 2 0 3 0 0 0 5Acidobacterium_MP5ACTX9_uid47621 0 0 0 9 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 5 0 9 0 26 0 0Acidobacterium_capsulatum_ATCC_51196_uid28085 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 2 0 0 0 0 0 2 0 4 0 0Acidothermus_cellulolyticus_11B_uid16097 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acidovorax_JS42_uid15685 0 0 4 7 0 0 0 17 0 6 0 0 0 0 6 1 5 8 0 0 0 1 0 19 0 26 0 3Acidovorax_avenae_ATCC_19860_uid37867 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 23 0 0Acidovorax_avenae_citrulli_AAC00-­‐1_uid15708 0 0 0 0 0 0 0 0 0 11 0 0 0 19 0 0 0 0 0 0 0 29 0 10 0 7 0 0Aciduliprofundum_boonei_T469_uid38403 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0Acinetobacter_DR1_uid46105 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acinetobacter_baumannii_1656_2_uid42153 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 2 1 0 0 24 18 0Acinetobacter_baumannii_AB0057_uid21111 0 0 0 2 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 3 2 7 0Acinetobacter_baumannii_AB307_0294_uid30993 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acinetobacter_baumannii_ACICU_uid17827 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 4 2 0 0 0 0 0Acinetobacter_baumannii_ATCC_17978_uid17477 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 2 0 0 0 2 1 0Acinetobacter_baumannii_AYE_uid28921 0 0 0 2 0 0 0 0 0 0 0 0 0 0 3 0 0 0 2 0 0 1 0 0 3 1 24 0Aeropyrum_pernix_uid211 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acinetobacter_baumannii_SDF_uid13001 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 165 0 0 0 0 176 0 0Acinetobacter_baumannii_TCDC_AB0715_uid62279 0 0 0 0 0 0 0 0 0 0 0 2 0 0 4 0 0 0 0 0 0 0 0 0 0 1 0 0Acinetobacter_calcoaceticus_PHEA_2_uid51267 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Acinetobacter_sp_ADP1_uid12352 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 6 0 0 0 0 0 0Actinobacillus_pleuropneumoniae_L20_uid18221 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 9 0 0 0 0 0 0Actinobacillus_pleuropneumoniae_serovar_3_JL03_uid19135 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0Actinobacillus_pleuropneumoniae_serovar_7_AP76_uid29909 0 0 0 0 0 0 0 0 0 0 0 6 0 3 3 0 0 0 0 0 0 5 0 0 0 0 0 0Actinobacillus_succinogenes_130Z_uid13370 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Actinosynnema_mirum_DSM_43827_uid19705 1 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 8 0 0Aerococcus_urinae_ACS_120_V_Col10a_uid51073 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 26 0 4 0 0 0 0Aeromonas_hydrophila_ATCC_7966_uid16697 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Aeromonas_salmonicida_A449_uid16723 0 0 0 0 0 0 0 33 0 0 2 5 0 0 0 0 4 8 0 0 0 25 2 0 0 0 0 0Aeromonas_veronii_B565_uid63671 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 3 0 3 0 0Aggregatibacter_actinomycetemcomitans_D11S_1_uid40107 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0Aggregatibacter_aphrophilus_NJ8700_uid36579 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Agrobacterium_H13_3_uid50341 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 1 0 0 0 1 0 3 0 0 0 0Agrobacterium_radiobacter_K84_uid13402 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 2 0 1 0 4Agrobacterium_tumefaciens_C58_Cereon_uid283 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 1 0 1 0 0Agrobacterium_vitis_S4_uid13372 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 5 0 0 0 7 0 3 10 11 0 13Akkermansia_muciniphila_ATCC_BAA_835_uid20089 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 16: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Alcanivorax_borkumensis_SK2_uid13005 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Alicycliphilus_denitrificans_BC_uid41663 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 6 0 0 0 16 0 0 0 8 0 4Alicycliphilus_denitrificans_K601_uid50751 0 0 0 0 0 0 0 5 0 0 0 0 0 0 1 0 0 10 0 0 0 18 0 0 0 1 3 8Alicyclobacillus_acidocaldarius_DSM_446_uid29405 4 0 0 0 13 0 0 0 2 0 0 0 0 0 0 0 10 0 0 3 0 0 0 0 0 0 0 0Aliivibrio_salmonicida_LFI1238_uid30703 0 0 0 0 0 0 0 27 1 1 4 9 0 0 0 0 0 0 47 0 50 1 0 0 1 0 1 77Alistipes_shahii_WAL_8301_uid45913 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Alkalilimnicola_ehrlichei_MLHE-­‐1_uid15763 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 2 0 0 0 0 0 2Alkaliphilus_metalliredigens_QYMF_uid13006 2 0 0 0 13 0 3 1 8 0 0 0 0 0 3 0 0 10 3 0 0 0 0 0 0 0 4 0Alkaliphilus_oremlandii_OhILAs_uid16083 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0Allochromatium_vinosum_DSM_180_uid32547 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 8 0Alteromonas_SN2_uid59947 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 3 0 0 0 6 1 0Alteromonas_macleodii__Deep_ecotype__uid13374 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 0 0 2 0 0 9 0 9 0 4 0 4Aminobacterium_colombiense_DSM_12261_uid32587 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ammonifex_degensii_KC4_uid12390 3 0 0 0 0 0 9 0 0 0 0 0 0 19 28 0 0 0 0 3 0 0 0 0 0 0 0 0Amycolatopsis_mediterranei_S699_uid43503 0 0 0 0 0 0 0 3 0 0 0 1 0 0 6 2 0 0 0 0 0 10 0 0 0 6 2 1Amycolatopsis_mediterranei_U32_uid46889 0 0 0 0 0 0 0 3 0 0 4 1 0 0 1 2 0 0 0 0 0 10 0 0 0 6 3 1Amycolicicoccus_subflavus_DQS3_9A1_uid63351 0 0 0 0 0 0 0 0 0 0 0 3 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Anabaena_variabilis_ATCC_29413_uid10642 0 0 0 0 1 0 0 13 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 0 0 28 3 9Anaeromyxobacter_K_uid19743 8 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 2 0 0 0 1 0 0 0 0 1 3Anaerocellum_thermophilum_DSM_6725_uid29407 0 0 0 0 0 0 0 0 0 0 0 0 0 0 12 0 0 4 0 0 0 0 0 0 0 0 0 0Anaerococcus_prevotii_DSM_20548_uid29533 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0Anaerolinea_thermophila_UNI_1_uid60099 0 0 0 0 0 0 0 0 0 0 0 0 0 0 17 0 0 0 0 0 0 0 0 0 0 24 0 0Anaeromyxobacter_Fw109-­‐5_uid17729 0 0 0 0 0 0 0 1 0 0 5 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 2 1Anaeromyxobacter_dehalogenans_2CP-­‐C_uid12634 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Anaeromyxobacter_dehalogenans_2CP_1_uid20095 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 3 0 0 0 2 0 0 0 0 0 2Anaplasma_centrale_Israel_uid32765 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Anaplasma_marginale_Florida_uid16369 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Anaplasma_marginale_St_Maries_uid40 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Anaplasma_phagocytophilum_HZ_uid336 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Anoxybacillus_flavithermus_WK1_uid28245 6 0 0 0 1 0 0 9 0 0 0 0 0 0 9 4 0 0 0 10 4 0 0 2 3 0 0 8Aquifex_aeolicus_uid215 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0Arcanobacterium_haemolyticum_DSM_20595_uid37925 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 5 0 0 0 0Archaeoglobus_fulgidus_uid104 0 0 0 0 0 0 0 2 5 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 1 0 0Archaeoglobus_profundus_DSM_5631_uid32583 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Archaeoglobus_veneficus_SNP6_uid51621 0 0 0 0 0 0 0 0 0 0 0 0 0 5 2 0 0 0 0 0 0 0 0 2 0 3 0 0Arcobacter_L_uid61721 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Arcobacter_butzleri_ED_1_uid61719 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Arcobacter_butzleri_RM4018_uid16319 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Arcobacter_nitrofigilis_DSM_7299_uid32593 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Aromatoleum_aromaticum_EbN1_uid13242 7 0 1 0 3 0 3 3 0 0 9 0 0 0 4 2 1 32 2 0 0 4 0 5 0 12 8 10Arthrobacter_FB24_uid12640 0 0 0 0 0 0 0 0 0 3 0 0 0 4 0 0 1 0 0 0 0 0 0 5 0 0 0 0Arthrobacter_arilaitensis_Re117_uid50353 0 0 5 47 0 3 0 1 0 0 0 0 0 11 0 0 4 4 0 0 0 17 0 4 0 3 2 0Arthrobacter_aurescens_TC1_uid12512 0 0 0 0 0 0 0 0 1 2 0 0 0 0 0 0 0 1 0 0 0 11 0 2 0 1 0 0Arthrobacter_chlorophenolicus_A6_uid20011 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Arthrobacter_phenanthrenivorans_Sphe3_uid20087 0 0 0 2 0 0 0 0 0 0 0 0 0 0 2 0 1 5 0 0 0 10 0 17 0 1 0 0Arthrospira_platensis_NIES_39_uid42161 0 0 0 0 0 0 0 54 13 0 0 0 0 0 1 0 0 0 0 12 0 0 3 0 0 0 0 0Aster_yellows_witches-­‐broom_phytoplasma_AYWB_uid13478 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Asticcacaulis_excentricus_CB_48_uid32595 0 0 1 0 0 3 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 2 0 4 0 0 1 7Atopobium_parvulum_DSM_20469_uid29401 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Azoarcus_BH72_uid13217 0 0 0 0 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Azorhizobium_caulinodans_ORS_571_uid19267 0 0 1 0 0 0 0 2 0 0 0 0 0 5 1 0 1 0 0 0 0 4 0 0 0 4 0 1Azospirillum_B510_uid32551 0 0 0 3 0 27 0 11 2 1 23 1 0 1 2 8 5 21 0 0 0 20 0 5 0 35 0 21Azotobacter_vinelandii_DJ_uid16 0 0 2 0 0 0 4 18 0 0 0 0 0 7 0 4 0 4 0 0 0 2 0 6 0 14 8 0Bacillus_amyloliquefaciens_DSM7_uid41719 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_amyloliquefaciens_FZB42_uid13403 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Bacillus_amyloliquefaciens_LL3_uid64659 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_amyloliquefaciens_TA208_uid64581 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_amyloliquefaciens_XH7_uid67079 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_anthracis_A0248_uid33543 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 12 0Bacillus_anthracis_Ames_0581_uid10784 1 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 1 0 0 0 0 0 0 15 0Bacillus_anthracis_Ames_uid309 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 1 0 0 0 0 0 0 2 0

Page 17: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Bacillus_anthracis_CDC_684_uid31329 1 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 1 0 0 0 0 11 0Bacillus_anthracis_CI_uid36309 11 0 0 0 0 0 1 0 0 0 0 0 0 0 9 0 0 0 0 1 0 9 0 0 0 0 5 0Bacillus_anthracis_str_Sterne_uid10878 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 1 0 0 0 0 0 0 2 0Bacillus_atrophaeus_1942_uid46075 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_cellulosilyticus_DSM_2522_uid38423 7 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 5 0 0 1 0Bacillus_cereus_03BB102_uid31307 1 0 0 0 0 0 2 0 0 0 0 0 0 0 7 0 0 0 0 1 0 7 0 2 0 0 4 0Bacillus_cereus_AH187_uid17715 0 0 0 0 0 0 1 0 0 2 0 0 0 0 3 0 0 1 0 0 0 11 0 0 0 0 13 0Bacillus_cereus_AH820_uid17711 0 0 0 0 0 0 1 0 0 0 0 0 0 0 4 0 0 0 0 0 0 6 0 0 0 0 8 0Bacillus_cereus_ATCC14579_uid384 0 0 1 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 3 0Bacillus_cereus_ATCC_10987_uid74 1 0 0 0 0 0 1 0 0 0 0 0 0 0 11 0 0 0 0 2 0 0 0 0 1 0 7 0Bacillus_cereus_B4264_uid17731 0 0 1 0 0 0 2 0 0 0 0 0 0 0 1 0 0 0 0 0 0 7 0 0 0 0 2 0Bacillus_cereus_G9842_uid17733 0 0 0 0 0 0 1 0 0 0 0 0 0 0 5 0 0 0 0 0 0 3 0 0 0 0 4 0Bacillus_cereus_Q1_uid16220 0 0 0 0 0 0 4 0 0 1 0 0 0 0 5 0 0 6 0 1 0 8 0 0 0 0 3 0Bacillus_cereus_ZK_uid12468 0 0 1 0 0 0 2 0 0 0 0 0 0 0 13 0 0 0 0 4 0 3 0 0 0 0 16 0Bacillus_cereus_cytotoxis_NVH_391-­‐98_uid13624 2 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 6 0 0 0 0 6 0Bacillus_clausii_KSM-­‐K16_uid13291 1 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0Bacillus_coagulans_2_6_uid61501 5 0 0 0 2 9 0 0 0 0 0 0 0 0 16 0 0 0 0 0 0 0 0 5 0 9 1 0Bacillus_halodurans_uid235 10 0 0 41 0 1 8 1 0 0 0 3 0 0 0 0 8 1 0 0 0 5 0 8 0 0 1 0Bacillus_licheniformis_ATCC_14580_uid12388 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0Bacillus_licheniformis_DSM_13_uid13082 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_megaterium_DSM319_uid42425 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_megaterium_QM_B1551_uid30165 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 2 0 2 0 0 0 0Bacillus_pseudofirmus_OF4_uid28811 3 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 5 0 0 0 7 0 6 0 0 5 2Bacillus_pumilus_SAFR-­‐032_uid20391 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_selenitireducens_MLS10_uid13376 1 0 1 2 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 4 0 8 0 0 0 0 0 0Bacillus_subtilis_BSn5_uid61191 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_subtilis_natto_BEST195_uid38027 0 0 0 0 0 0 0 0 0 0 0 0 0 0 12 0 8 3 0 0 0 14 0 0 0 0 5 0Bacillus_subtilis_spizizenii_W23_uid38713 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0Bacillus_subtilis_uid76 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacillus_thuringiensis_Al_Hakam_uid18255 0 0 0 0 0 0 1 0 0 0 0 0 0 0 4 0 0 0 0 1 0 5 0 0 0 0 2 0Bacillus_thuringiensis_BMB171_uid43631 0 0 1 0 0 0 4 0 0 0 0 0 0 0 2 0 0 0 0 0 0 3 0 0 0 0 10 0Bacillus_thuringiensis_konkukian_uid10877 13 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 7 0 0 0 0 4 0Bacillus_thuringiensis_serovar_chinensis_CT_43_uid43737 1 0 0 0 0 0 7 0 0 0 0 0 0 0 10 0 0 11 0 1 0 8 0 10 0 0 15 2Bacillus_thuringiensis_serovar_finitimus_YBT_020_uid60447 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 4 0 0 1 1 0 0 15 1 0 6 0Bacillus_tusciae_DSM_2912_uid31345 12 0 0 0 2 1 7 0 0 0 0 0 0 0 3 0 0 0 2 9 0 4 0 11 0 0 0 0Bacillus_weihenstephanensis_KBAB4_uid13623 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 1 0Bacteriovorax_marinus_SJ_uid50431 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacteroides_fragilis_638R_uid50405 2 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 1 0 0 0 0 0 0 0 0 0 0Bacteroides_fragilis_NCTC_9434_uid46 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0Bacteroides_fragilis_YCH46_uid13067 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 4Bacteroides_helcogenes_P_36_108_uid41913 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bacteroides_salanitronis_DSM_18170_uid40055 0 0 0 0 0 4 0 0 0 0 3 0 0 0 0 0 1 0 0 0 0 6 0 0 0 3 3 7Bacteroides_thetaiotaomicron_VPI-­‐5482_uid399 1 0 0 0 0 0 0 0 0 0 3 0 0 0 7 0 3 0 0 0 0 15 0 7 0 0 3 2Bacteroides_vulgatus_ATCC_8482_uid13378 10 0 1 8 0 2 0 0 0 0 0 0 0 0 1 0 0 0 0 0 6 0 0 0 0 19 0 9Bacteroides_xylanisolvens_XB1A_uid39177 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 2Bartonella_bacilliformis_KC583_uid16249 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bartonella_clarridgeiae_73_uid42109 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bartonella_grahamii_as4aup_uid34873 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0Bartonella_henselae_Houston-­‐1_uid196 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0Bartonella_quintana_Toulouse_uid44 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Bartonella_tribocorum_CIP_105476_uid28109 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 0 0 0 0 0 0 0 0 0 0 0 0 0Baumannia_cicadellinicola_Homalodisca_coagulata_uid12513 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bdellovibrio_bacteriovorus_uid9637 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Beijerinckia_indica_ATCC_9039_uid20841 3 0 0 1 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0Beutenbergia_cavernae_DSM_12333_uid20827 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_adolescentis_ATCC_15703_uid16321 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 2 0 0 0 0 0 0Bifidobacterium_animalis_lactis_AD011_uid19423 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_animalis_lactis_BB_12_uid42883 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_animalis_lactis_BLC1_uid71815 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_animalis_lactis_Bl_04_uid32897 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_animalis_lactis_CNCM_I_2494_uid67865 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0

Page 18: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Bifidobacterium_animalis_lactis_DSM_10140_uid32893 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_animalis_lactis_V9_uid32515 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_bifidum_PRL2010_uid42863 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 1 0 0 0 0 2 0 0 0 0 0 0Bifidobacterium_bifidum_S17_uid51963 0 0 0 0 0 0 0 0 0 0 0 2 0 0 2 0 2 0 0 0 0 2 0 0 0 0 0 0Bifidobacterium_breve_ACS_071_V_Sch8b_uid51077 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bifidobacterium_breve_UCC2003_uid13487 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 2 18 0 0 0 0 0 0 0 0 0 0Bifidobacterium_dentium_Bd1_uid17583 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Bifidobacterium_longum_BBMN68_uid53143 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 2 5 0 0 0 0 0 0 0 0 0 0Bifidobacterium_longum_DJO10A_uid18773 0 0 0 0 0 0 0 0 0 0 0 5 0 0 4 0 2 9 0 1 0 3 0 0 0 0 0 0Bifidobacterium_longum_F8_uid45963 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 2 0 0 0 0 0 0Bifidobacterium_longum_JCM_1217_uid32047 0 0 0 0 0 0 0 0 0 0 0 8 0 0 4 0 1 7 0 1 0 2 0 0 0 0 0 0Bifidobacterium_longum_JDM301_uid47579 0 0 0 2 0 0 0 0 2 0 0 1 0 0 3 0 11 3 0 0 0 10 0 0 0 0 0 0Bifidobacterium_longum_KACC_91563_uid66401 0 0 0 1 0 0 1 0 3 0 0 6 0 0 3 0 8 5 0 0 0 2 0 0 0 0 0 0Bifidobacterium_longum_infantis_157F_uid32051 0 0 0 2 0 0 0 0 0 0 0 18 0 0 2 0 1 10 0 0 0 4 0 0 0 0 0 0Bifidobacterium_longum_infantis_ATCC_15697_uid17189 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 3 0 0 1 0 8 0 7 0 0 0 0Bifidobacterium_longum_infantis_ATCC_15697_uid32049 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 4 0 0 1 0 8 0 7 0 1 0 0Bifidobacterium_longum_uid328 0 0 0 2 0 0 0 0 0 0 0 3 0 0 3 0 5 5 0 1 0 5 0 0 0 0 0 0Blattabacterium__Blattella_germanica__Bge_uid31103 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Blattabacterium__Periplaneta_americana__BPLAN_uid32975 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bordetella_avium_197N_uid27 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bordetella_bronchiseptica_uid24 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bordetella_parapertussis_uid25 0 0 0 22 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bordetella_pertussis_CS_uid66287 0 0 0 0 0 0 0 0 0 0 0 0 0 233 0 0 0 0 0 0 0 0 0 18 0 0 0 0Bordetella_pertussis_uid26 0 0 0 0 0 0 0 0 0 0 0 0 0 232 0 0 0 0 0 0 0 0 0 17 0 0 0 0Bordetella_petrii_uid28135 0 0 0 2 0 0 0 1 0 1 0 0 0 0 2 0 0 0 0 0 0 26 0 6 0 5 0 0Borrelia_afzelii_PKo_uid17057 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_afzelii_PKo_uid68149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_bissettii_DN127_uid29363 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_burgdorferi_JD1_uid29359 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_burgdorferi_N40_uid29357 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_burgdorferi_ZS7_uid19839 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_burgdorferi_uid3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_duttonii_Ly_uid18231 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_garinii_PBi_uid12554 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_hermsii_DAH_uid29637 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_recurrentis_A1_uid18233 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Borrelia_turicatae_91E135_uid13597 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Brachybacterium_faecium_DSM_4810_uid17026 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0Brachyspira_hyodysenteriae_WA1_uid31263 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Brachyspira_intermedia_PWS_A_uid68051 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Brachyspira_murdochii_DSM_12563_uid29543 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Brachyspira_pilosicoli_95_1000_uid48097 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Bradyrhizobium_BTAi1_uid16137 0 0 0 0 0 0 0 0 0 0 0 0 0 7 7 0 0 0 0 0 0 6 0 7 0 11 0 4Bradyrhizobium_ORS278_uid19575 13 0 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0 0 0 0 3 0 0 0 0 0 4Bradyrhizobium_japonicum_uid17 0 0 0 0 0 1 0 15 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 6 0 9 0 2Brevibacillus_brevis_NBRC_100599_uid29147 1 0 0 0 0 0 2 0 0 2 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0Brevundimonas_subvibrioides_ATCC_15264_uid36643 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Brucella_abortus_S19_uid18999 0 0 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0 0 0 0 3 0 0 0 6 0 0Brucella_abortus_bv__1_9_941_uid9619 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 2 6 0 0Brucella_canis_ATCC_23365_uid20243 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 2 7 0 0Brucella_melitensis_ATCC_23457_uid30561 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 2 10 0 0Brucella_melitensis_M28_uid61505 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 3 0 0 2 9 0 0Brucella_melitensis_M5_90_uid42891 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 3 0 0 2 9 0 0Brucella_ovis_uid12514 0 0 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0 0 0 0 3 0 0 0 40 0 0Brucella_melitensis_biovar_Abortus_uid16203 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 3 0 0 2 7 0 0Brucella_melitensis_bv__1_16M_uid180 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 1 9 0 0Brucella_microti_CCM_4915_uid32233 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 2 13 0 0Brucella_pinnipedialis_B2_94_uid41867 0 0 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 0 3 0 0 2 33 0 0Brucella_suis_1330_uid320 0 0 0 0 0 0 0 0 0 0 0 0 0 1 7 0 0 0 0 0 0 3 0 0 2 2 0 0Brucella_suis_ATCC_23445_uid20371 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 2 13 0 0Buchnera_aphidicola_5A__Acyrthosiphon_pisum__uid31225 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 19: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Buchnera_aphidicola_Ak__Acyrthosiphon_kondoi__uid65481 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_Cc_Cinara_cedri_uid16372 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_JF98__Acyrthosiphon_pisum__uid43511 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_JF99__Acyrthosiphon_pisum__uid43509 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_LL01__Acyrthosiphon_pisum__uid43505 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_Sg_uid312 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_TLW03__Acyrthosiphon_pisum__uid43507 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_Tuc7__Acyrthosiphon_pisum__uid31223 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_Ua__Uroleucon_ambrosiae__uid65479 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola__Cinara_tujafilina__uid41379 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_aphidicola_uid256 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Buchnera_sp_uid245 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Burkholderia_383_uid10695 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0Burkholderia_CCGE1001_uid37717 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 3 0 0 0 0Burkholderia_CCGE1002_uid37719 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 11 0 6 2 5 0 1Burkholderia_CCGE1003_uid37721 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 2 0 0 0 0 0 0Burkholderia_JV3_uid53943 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 4 0 0 0 0Burkholderia_ambifaria_AMMD_uid13490 0 0 0 0 0 0 11 0 2 0 0 0 0 0 0 0 2 0 0 0 0 5 0 0 0 1 0 0Burkholderia_ambifaria_MC40_6_uid17411 0 0 0 0 0 0 12 0 0 0 0 2 0 0 0 0 4 0 0 0 0 5 0 1 0 0 2 0Burkholderia_cenocepacia_AU_1054_uid13919 0 0 0 0 0 0 6 0 0 0 0 0 0 3 0 0 6 4 0 0 0 4 0 16 0 6 0 0Burkholderia_cenocepacia_HI2424_uid13918 0 0 0 0 0 0 7 0 0 0 0 0 0 4 0 0 4 5 0 0 0 4 0 10 0 5 0 1Burkholderia_cenocepacia_J2315_uid339 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 3 5 0 0 0 36 0 20 0 8 0 6Burkholderia_cenocepacia_MC0_3_uid17929 0 0 0 0 0 0 3 0 2 0 0 6 0 3 0 0 9 0 0 0 0 16 0 6 0 0 0 1Burkholderia_gladioli_BSR3_uid64503 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 9 0 0 0 0 23 0 10 3 2 0 2Burkholderia_glumae_BGR1_uid33901 0 0 0 0 0 0 0 0 2 0 0 0 0 11 0 0 0 9 0 0 0 16 0 5 8 41 0 8Burkholderia_mallei_ATCC_23344_uid171 44 0 0 16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 89 0 1 0 0 0 0Burkholderia_mallei_NCTC_10229_uid13943 45 0 0 15 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 87 0 1 0 0 0 0Burkholderia_mallei_NCTC_10247_uid13946 45 0 0 16 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 88 0 1 0 0 0 0Burkholderia_mallei_SAVP1_uid13947 38 0 0 12 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 80 0 0 0 0 0 0Burkholderia_multivorans_ATCC_17616_JGI_uid17407 0 0 0 0 0 0 0 3 0 1 0 0 0 0 0 0 14 13 0 0 0 22 0 11 0 15 0 1Burkholderia_multivorans_ATCC_17616_Tohoku_uid19401 0 0 0 0 0 0 0 3 0 1 0 0 0 0 0 0 14 10 0 0 0 19 0 7 0 16 0 1Burkholderia_phymatum_STM815_uid17409 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 3 0 0 3 0 0 2 0 4 0 7 4 0Burkholderia_phytofirmans_PsJN_uid17463 0 0 0 0 0 0 1 0 0 0 0 0 0 6 0 0 6 5 2 0 0 4 0 0 0 1 0 0Burkholderia_pseudomallei_1106a_uid16182 5 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 10 0 1 1 9 0 0Burkholderia_pseudomallei_1710b_uid13954 4 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 11 0 5 1 0 0 0Burkholderia_pseudomallei_668_uid13953 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0Burkholderia_pseudomallei_K96243_uid178 5 0 0 6 0 0 0 0 0 0 0 0 0 1 1 0 0 1 0 0 0 4 0 1 2 1 0 0Burkholderia_rhizoxinica_HKI_454_uid51915 0 0 0 1 0 0 0 0 0 1 0 0 0 0 4 0 0 7 0 0 0 8 0 2 0 3 1 1Burkholderia_thailandensis_E264_uid10774 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 17 0 0 0 0 9 0 5 0 14 0 0Burkholderia_vietnamiensis_G4_uid10696 1 0 0 0 0 0 0 0 0 8 0 44 0 0 0 0 18 14 0 0 0 12 0 5 0 31 5 9Burkholderia_xenovorans_LB400_uid254 3 0 0 6 0 0 0 1 0 2 0 1 0 10 0 1 2 13 0 0 0 17 0 7 0 8 0 7Butyrivibrio_fibrisolvens_uid39147 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Butyrivibrio_proteoclasticus_B316_uid29153 4 0 0 0 0 0 6 0 2 0 0 0 0 0 0 0 0 0 0 2 0 0 0 6 0 0 3 0Caldicellulosiruptor_hydrothermalis_108_uid40831 0 0 0 0 0 0 4 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0Caldicellulosiruptor_kristjanssonii_177R1B_uid41727 0 0 0 0 0 0 2 0 3 0 0 13 0 0 29 0 2 0 0 4 0 0 0 10 0 0 0 0Caldicellulosiruptor_kronotskyensis_2002_uid52409 0 0 0 4 3 0 2 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Caldicellulosiruptor_lactoaceticus_6A_uid40219 0 0 0 0 0 0 4 0 3 0 0 6 0 0 10 0 0 2 0 0 0 0 0 12 0 0 0 0Caldicellulosiruptor_obsidiansis_OB47_uid40355 0 0 0 0 0 0 6 0 1 0 0 0 0 5 14 0 0 1 0 0 0 0 0 6 0 0 0 0Caldicellulosiruptor_owensensis_OL_uid40833 0 0 0 1 1 0 0 0 1 0 0 0 0 0 8 0 1 0 0 0 0 0 0 0 0 0 0 0Caldicellulosiruptor_saccharolyticus_DSM_8903_uid13466 0 0 0 0 0 0 0 0 13 0 0 0 0 25 33 0 0 15 0 0 0 0 0 3 0 0 0 0Calditerrivibrio_nitroreducens_DSM_19672_uid49523 0 0 0 0 0 0 2 0 0 0 0 0 0 0 9 0 9 0 0 0 0 0 0 0 0 0 0 0Caldivirga_maquilingensis_IC-­‐167_uid17421 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_concisus_13826_uid17159 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_curvus_525_92_uid17161 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_fetus_82-­‐40_uid16293 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_hominis_ATCC_BAA-­‐381_uid20083 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_81-­‐176_uid16135 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_81116_uid17953 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_IA3902_uid28907 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_ICDCCJ07001_uid47949 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_M1_uid38041 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 20: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Campylobacter_jejuni_RM1221_uid303 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_S3_uid45947 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_doylei_269_97_uid17163 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_jejuni_uid8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Campylobacter_lari_RM2100_uid12517 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Accumulibacter_phosphatis_clade_IIA_UW_1_uid29435 3 0 8 0 7 0 0 0 3 3 1 0 2 34 0 0 0 11 0 0 0 2 0 4 0 2 12 4Candidatus_Amoebophilus_asiaticus_5a2_uid19981 3 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 19 0 24 3 2 45 0 0Candidatus_Blochmannia_floridanus_uid443 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Arthromitus_SFB_mouse_Japan_uid66727 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Azobacteroides_pseudotrichonymphae_genomovar__CFP2_uid29025 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Blochmannia_pennsylvanicus_BPEN_uid13875 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Blochmannia_vafer_BVAF_uid50045 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Carsonella_ruddii_PV_uid17977 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Desulfococcus_oleovorans_Hxd3_uid18007 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 3 0 0Candidatus_Desulforudis_audaxviator_MP104C_uid21047 15 0 0 0 3 0 2 0 9 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 3Candidatus_Hamiltonella_defensa_5AT__Acyrthosiphon_pisum__uid31259 0 0 0 0 0 0 0 23 1 0 0 0 0 0 0 0 0 0 0 0 0 5 0 2 0 6 0 0Candidatus_Hodgkinia_cicadicola_Dsem_uid32135 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Korarchaeum_cryptofilum_OPF8_uid16525 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Koribacter_versatilis_Ellin345_uid15771 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0Candidatus_Liberibacter_asiaticus_psy62_uid29835 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Liberibacter_solanacearum_CLso_ZC1_uid39273 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Methanoregula_boonei_6A8_uid18505 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Methanosphaerula_palustris_E1_9c_uid29381 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Midichloria_mitochondrii_IricVA_uid42125 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 25 0 0 0 0 0 0 0Candidatus_Moranella_endobia_PCIT_uid53213 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Nitrospira_defluvii_uid46433 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 3 0 0 0 4 0 0 0 1 0 0Candidatus_Pelagibacter_IMCC9063_uid62303 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Pelagibacter_ubique_HTCC1062_uid13989 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Phytoplasma_australiense_uid29469 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 12 0 0 0 0 0 0Candidatus_Phytoplasma_mali_uid25335 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Riesia_pediculicola_USDA_uid29065 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Ruthia_magnifica_Cm_Calyptogena_magnifica__uid16841 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Sulcia_muelleri_CARI_uid43087 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Sulcia_muelleri_DMIN_uid37785 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Sulcia_muelleri_GWSS_uid19617 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Sulcia_muelleri_SMDSEM_uid33829 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Tremblaya_princeps_PCIT_uid53211 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Candidatus_Vesicomyosocius_okutanii_HA_uid18267 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Capnocytophaga_canimorsus_Cc5_uid50199 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 1 0 0Capnocytophaga_ochracea_DSM_7271_uid29403 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Carboxydothermus_hydrogenoformans_Z-­‐2901_uid253 0 0 0 0 0 0 0 0 0 0 0 0 0 1 6 0 0 1 0 0 0 0 0 5 0 0 0 0Carnobacterium_17_4_uid60607 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Catenulispora_acidiphila_DSM_44928_uid21085 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 1 0 4 0 0Caulobacter_K31_uid16306 0 0 1 0 0 0 0 7 0 0 0 0 0 0 0 0 0 5 0 0 0 15 0 7 0 3 0 0Caulobacter_crescentus_NA1000_uid32027 2 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 3 0 6 0 9 0 0Caulobacter_crescentus_uid298 2 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 3 0 6 0 9 0 0Caulobacter_segnis_ATCC_21756_uid37277 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Cellulomonas_fimi_ATCC_484_uid33691 0 0 0 0 0 0 0 0 0 0 0 0 0 4 2 0 0 0 0 0 0 0 0 0 0 0 0 0Cellulomonas_flavigena_DSM_20109_uid19707 0 0 0 2 0 0 0 0 0 0 0 4 0 6 2 0 0 0 0 0 0 1 0 0 0 0 0 0Cellulophaga_algicola_DSM_14237_uid41529 3 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0Cellulophaga_lytica_DSM_7489_uid50743 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Cellvibrio_gilvus_ATCC_13127_uid33853 0 0 0 1 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 9 0 0 0 0 0 0Cellvibrio_japonicus_Ueda107_uid28329 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 13 17Chitinophaga_pinensis_DSM_2588_uid27951 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 10 0 0 0 0 10 0Chlamydia_muridarum_uid229 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_434_Bu_uid28583 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_A_HAR-­‐13_uid13885 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_B_Jali20_OT_uid32655 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_B_TZ1A828_OT_uid32653 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_D_EC_uid48087 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_D_LC_uid48089 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 21: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Chlamydia_trachomatis_D_UW_3_CX_uid45 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_E_11023_uid43141 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_E_150_uid43143 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_G_11074_uid43149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_G_11222_uid43147 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_G_9301_uid45851 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_G_9768_uid43145 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_L2b_UCH_1_proctitis_uid28585 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_L2c_uid47581 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydia_trachomatis_Sweden2_uid43167 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_abortus_S26_3_uid355 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_caviae_uid228 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_felis_Fe_C-­‐56_uid370 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pecorum_E58_uid62893 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pneumoniae_AR39_uid247 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pneumoniae_CWL029_uid248 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pneumoniae_J138_uid257 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pneumoniae_LPCoLN_uid17947 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_pneumoniae_TW_183_uid420 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_01DC11_uid66479 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_02DC15_uid66481 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_08DC60_uid66485 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_6BC_uid62889 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_6BC_uid63149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_C19_98_uid66483 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlamydophila_psittaci_RD1_uid60445 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlorobaculum_parvum_NCIB_8327_uid29213 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlorobium_chlorochromatii_CaD3_uid13921 0 0 0 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlorobium_limicola_DSM_245_uid12606 7 0 0 0 6 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Chlorobium_luteolum_DSM_273_uid13012 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chlorobium_phaeobacteroides_BS1_uid12608 0 0 0 0 2 31 0 1 4 0 0 0 2 0 0 0 0 2 0 0 0 3 0 0 0 10 0 0Chlorobium_phaeobacteroides_DSM_266_uid12609 0 0 0 10 3 6 0 9 0 0 0 0 5 0 0 0 0 25 0 0 0 2 0 0 0 0 2 0Chlorobium_tepidum_TLS_uid302 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Chloroflexus_Y_400_fl_uid21119 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 5 0 0 0 0 0 0 0 0 0 10 0 0Chloroflexus_aggregans_DSM_9485_uid16708 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0Chloroflexus_aurantiacus_J_10_fl_uid59 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 5 0 0 0 0 0 0 0 0 0 10 0 0Chloroherpeton_thalassium_ATCC_35110_uid29215 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Chromobacterium_violaceum_uid444 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Chromohalobacter_salexigens_DSM_3043_uid12636 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Citrobacter_koseri_ATCC_BAA-­‐895_uid12716 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Citrobacter_rodentium_ICC168_uid34685 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 18 0 13 38 0Clavibacter_michiganensis_NCPPB_382_uid19643 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clavibacter_michiganensis_sepedonicus_uid184 0 0 0 0 0 0 0 0 0 0 0 8 0 25 71 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridiales_genomosp__BVAB3_UPII9_5_uid42555 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_SY8519_uid67799 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 3 0 0 0 0 0 0Clostridium_acetobutylicum_DSM_1731_uid63241 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_acetobutylicum_EA_2018_uid50455 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_acetobutylicum_uid77 1 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_beijerinckii_NCIMB_8052_uid12637 0 0 0 0 0 0 0 2 9 0 0 0 0 0 4 0 1 0 0 0 0 2 0 0 0 0 2 0Clostridium_botulinum_A2_Kyoto_uid29859 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_A3_Loch_Maree_uid28507 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_A_ATCC_19397_uid19517 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_A_Hall_uid19521 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Clostridium_botulinum_A_uid193 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Clostridium_botulinum_B1_Okra_uid28505 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_BKT015925_uid60407 18 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 8 12 0 0 0 0 0 0 27 0 0 0Clostridium_botulinum_B_Eklund_17B_uid28857 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_Ba4_657_uid29077 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0Clostridium_botulinum_E3_Alaska_E43_uid28855 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_F_230613_uid47575 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_botulinum_F_Langeland_uid19519 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 22: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Clostridium_botulinum_H04402_065_uid61511 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0Clostridium_cellulolyticum_H10_uid17419 0 0 0 0 0 0 7 0 0 0 0 0 0 26 0 0 3 0 0 0 0 27 0 12 0 0 0 4Clostridium_cellulovorans_743B_uid32609 2 0 0 0 0 0 12 0 0 0 0 5 0 0 0 0 4 0 0 0 31 6 0 0 0 0 0 0Clostridium_cf__saccharolyticum_K10_uid45855 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_2007855_uid42471 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_630_uid78 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_ATCC_43255_uid42473 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_BI1_uid42469 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_CD196_uid38037 0 0 0 0 0 0 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Clostridium_difficile_CF5_uid42465 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_M120_uid42467 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_M68_uid42463 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_difficile_R20291_uid38039 0 0 0 0 0 0 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Clostridium_kluyveri_DSM_555_uid19065 0 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 1 0 0 0 1 8 0 0 0 0 1 0Clostridium_kluyveri_NBRC_12016_uid33271 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_lentocellum_DSM_5427_uid41511 0 0 0 3 0 0 0 6 0 0 0 0 0 0 7 0 0 0 0 0 0 14 0 0 0 0 6 0Clostridium_ljungdahlii_ATCC_49587_uid13492 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0Clostridium_novyi_NT_uid16820 27 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0Clostridium_perfringens_ATCC_13124_uid304 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_perfringens_SM101_uid12521 2 0 0 0 0 0 23 0 0 0 0 16 0 0 2 0 3 0 0 0 0 1 0 0 14 0 0 0Clostridium_perfringens_uid79 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Clostridium_phytofermentans_ISDg_uid16184 0 0 1 12 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 2 0 3 0 0 0 0 0 0Clostridium_saccharolyticum_WM1_uid42097 0 0 0 0 0 0 11 0 0 0 0 4 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Clostridium_sticklandii_DSM_519_uid28575 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0Clostridium_tetani_E88_uid81 0 0 0 0 0 0 2 0 0 0 0 0 0 0 3 0 0 0 0 0 0 9 0 0 1 0 0 0Clostridium_thermocellum_ATCC_27405_uid314 0 0 0 2 0 0 2 0 0 0 0 27 0 0 0 1 24 8 0 0 7 18 0 10 0 0 0 1Clostridium_thermocellum_DSM_1313_uid41553 0 0 0 2 0 0 11 0 0 0 0 0 0 0 0 1 18 0 0 0 4 13 0 1 0 0 0 0Collimonas_fungivorans_Ter331_uid65451 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 1 0 0Colwellia_psychrerythraea_34H_uid275 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 6 0 0 0 0Comamonas_testosteroni_CNB_1_uid29203 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 7 0 0 0 4 0 0 0 8 0 0Conexibacter_woesei_DSM_14684_uid20745 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0Coprococcus_ART55_1_uid45915 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Coprococcus_catus_GD_7_uid45861 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Coprothermobacter_proteolyticus_DSM_5265_uid30729 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Coraliomargarita_akajimensis_DSM_45221_uid33365 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0Coriobacterium_glomerans_PW2_uid42699 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_aurimucosum_ATCC_700975_uid37279 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 5 4 0 0 0 18 0 2 0 0 0 0Corynebacterium_diphtheriae_uid87 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Corynebacterium_efficiens_YS-­‐314_uid305 4 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 1 0 0 3 1 7 0Coxiella_burnetii_uid41 0 0 0 0 0 0 0 0 0 0 3 5 0 0 0 0 0 0 0 0 0 0 0 19 0 0 0 0Corynebacterium_glutamicum_ATCC_13032_Bielefeld_uid13760 0 0 0 4 0 3 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 4 1 0 0Corynebacterium_glutamicum_ATCC_13032_Kitasato_uid307 0 0 0 3 0 3 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 4 2 0 0Corynebacterium_glutamicum_R_uid19193 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 1 0 0Corynebacterium_jeikeium_K411_uid13967 0 0 0 0 0 0 0 0 0 0 0 5 0 6 1 0 23 0 0 0 0 15 0 7 0 0 0 0Corynebacterium_kroppenstedtii_DSM_44385_uid38011 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_pseudotuberculosis_1002_uid40687 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_pseudotuberculosis_C231_uid40875 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_pseudotuberculosis_I19_uid52845 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_pseudotuberculosis_PAT10_uid61115 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_pseudotuberculosis_uid48979 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_resistens_DSM_45100_uid39683 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 23 0 0 0 0 7 0 2 7 3 0 0Corynebacterium_ulcerans_809_uid61801 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_ulcerans_BR_AD22_uid61803 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Corynebacterium_urealyticum_DSM_7109_uid29211 0 0 0 1 0 1 0 0 0 0 0 4 0 1 0 0 11 0 0 0 0 2 0 0 2 0 0 0Corynebacterium_variabile_DSM_44702_uid50001 0 0 0 13 1 0 0 0 0 2 0 0 0 0 0 0 2 2 0 0 0 12 0 0 0 9 0 0Coxiella_burnetii_CbuG_Q212_uid19137 0 0 0 0 0 0 0 0 0 0 4 0 0 1 0 0 0 0 0 0 0 0 0 28 0 0 0 0Coxiella_burnetii_CbuK_Q154_uid19139 0 0 0 0 0 0 0 0 0 0 3 4 0 0 0 0 0 0 0 0 0 0 0 48 0 0 0 0Coxiella_burnetii_Dugway_7E9-­‐12_uid16721 0 0 0 0 0 0 0 0 0 0 4 11 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0Coxiella_burnetii_RSA_331_uid16791 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 46 0 0 0 0Croceibacter_atlanticus_HTCC2559_uid13570 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Cronobacter_turicensis_uid39965 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 23: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Cryptobacterium_curtum_DSM_15641_uid20739 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Cupriavidus_metallidurans_CH34_uid250 0 0 0 0 0 0 0 0 0 5 0 12 0 16 0 0 4 2 0 0 0 17 0 0 0 5 2 0Cupriavidus_necator_N_1_uid67893 0 0 0 4 0 0 14 3 0 0 0 2 0 0 0 0 17 0 0 0 0 12 0 16 0 1 7 2Cupriavidus_taiwanensis_uid15733 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 10 0 0 0 0 0 3 0 9 2 0Cyanobacteria_bacterium_Yellowstone_A-­‐Prime_uid16251 0 0 0 0 0 0 11 16 0 0 0 0 0 0 0 0 0 0 0 23 0 0 0 0 0 0 0 0Cyanobacteria_bacterium_Yellowstone_B-­‐Prime_uid16252 0 0 0 0 0 0 34 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 17 0 0Cyanothece_ATCC_51142_uid20319 0 0 0 3 0 1 0 4 0 1 7 0 0 0 13 1 0 0 0 0 3 1 0 0 0 1 0 0Cyanothece_PCC_7424_uid20479 0 0 0 1 0 1 15 0 0 0 8 0 0 0 12 0 0 0 0 4 0 0 0 0 0 0 0 0Cyanothece_PCC_7425_uid28337 0 0 0 2 0 0 0 9 0 2 7 0 0 0 0 0 6 0 0 0 0 0 0 0 0 19 1 30Cyanothece_PCC_7822_uid28535 0 0 0 0 0 0 4 3 0 0 1 0 0 2 1 2 0 0 0 2 0 0 0 0 0 9 1 0Cyanothece_PCC_8801_uid20503 0 0 0 0 0 0 12 3 0 0 0 0 0 0 8 1 0 0 0 0 0 0 0 0 0 0 13 0Cyanothece_PCC_8802_uid28339 0 0 0 0 0 0 12 2 0 0 0 0 0 0 6 3 0 0 0 0 0 0 0 0 0 3 0 0Cyclobacterium_marinum_DSM_745_uid51773 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 4 0 0 0 0 6 0 7 11 2Cytophaga_hutchinsonii_ATCC_33406_uid54 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 8 0 0 0 0 0 0Dechloromonas_aromatica_RCB_uid9635 2 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 4 0 3 0 0 1 0Deferribacter_desulfuricans_SSM1_uid37285 0 0 0 0 0 0 17 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 2 0 0Dehalococcoides_BAV1_uid15770 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 1 0 0 0Dehalococcoides_GT_uid36645 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Dehalococcoides_CBDB1_uid15604 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Dehalococcoides_VS_uid18811 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Dehalococcoides_ethenogenes_195_uid214 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 2 0 0 0 0 0 0Dehalogenimonas_lykanthroporepellens_BL_DC_9_uid40221 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 27 0 0 0 0 11 0 2 0 12 0 0Deinococcus_deserti_VCD115_uid16691 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 1 0 0 0 0 2 5 0 0 0 1 4 0Deinococcus_geothermalis_DSM_11300_uid13423 0 0 0 0 0 0 3 1 0 0 0 0 0 0 0 17 0 0 0 1 1 0 19 0 8 12 8 6Deinococcus_maricopensis_DSM_21211_uid43461 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0Deinococcus_proteolyticus_MRP_uid41911 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 2 0 0 0 0 5 13 0Deinococcus_radiodurans_uid65 0 0 0 0 0 0 9 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 25 1Delftia_Cs1_4_uid46303 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 2 0 0 0 1 0 1 0 1 0 1Delftia_acidovorans_SPH-­‐1_uid17413 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 1 0 0 0 10 0 2Denitrovibrio_acetiphilus_DSM_12809_uid29431 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 16 16 0 0 0 16 0 0 0 6 13 0Desulfarculus_baarsii_DSM_2075_uid37955 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0Desulfatibacillum_alkenivorans_AK_01_uid19449 0 0 1 0 15 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 12 0 1 18 9Desulfitobacterium_hafniense_DCB_2_uid205 9 0 0 0 3 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 4 0 10 0 1 14 0Desulfitobacterium_hafniense_Y51_uid16639 0 0 0 0 1 0 2 0 0 0 0 0 0 0 0 0 3 0 0 0 0 4 0 13 0 0 18 0Desulfobacca_acetoxidans_DSM_11109_uid51777 1 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 2 0 0 0 2 0 0 0 0 0 0Desulfobacterium_autotrophicum_HRM2_uid20931 0 0 0 0 7 5 0 0 0 0 0 0 0 0 23 0 0 2 4 0 0 3 0 7 0 0 4 3Desulfobulbus_propionicus_DSM_2032_uid32993 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 7 4 0Desulfohalobium_retbaense_DSM_5692_uid29199 3 0 0 10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 4 0 0 0 2Desulfomicrobium_baculatum_DSM_4028_uid29527 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 10 0 3 0 0 0 0Desulfotalea_psychrophila_LSv54_uid12751 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 1 0 0Desulfotomaculum_acetoxidans_DSM_771_uid27947 0 0 0 0 22 0 18 21 0 0 0 0 0 0 19 0 0 0 0 0 0 2 0 7 0 3 2 0Desulfotomaculum_carboxydivorans_CO_1_SRB_uid50757 0 0 0 0 0 0 2 0 0 0 0 3 0 3 28 0 8 6 0 0 0 0 0 0 0 0 0 0Desulfotomaculum_kuznetsovii_DSM_6115_uid48313 2 0 0 0 12 0 0 0 0 0 0 0 0 0 0 0 0 2 0 6 0 0 0 3 0 0 0 3Desulfotomaculum_reducens_MI-­‐1_uid13424 2 0 0 0 0 16 0 0 0 0 0 0 0 4 0 0 0 3 0 0 0 0 0 14 0 0 7 0Desulfotomaculum_ruminis_DSM_2154_uid47605 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 19 0 5 0 0 0 0Desulfovibrio_aespoeensis_Aspo_2_uid37869 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Desulfovibrio_desulfuricans_ATCC_27774_uid29493 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0Desulfovibrio_desulfuricans_G20_uid329 8 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 2 0 0 0 10 0 0 0 1 0 0Desulfovibrio_magneticus_RS_1_uid32105 13 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 3 1 10Desulfovibrio_salexigens_DSM_2638_uid29541 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Desulfovibrio_vulgaris_DP4_uid17227 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 1 0 0 0 0 0 0Desulfovibrio_vulgaris_Hildenborough_uid51 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 7 0 0 0 0 0 0Desulfovibrio_vulgaris_RCH1_uid32603 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 7 0 0 0 0 0 0Desulfovibrio_vulgaris__Miyazaki_F__uid27731 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Desulfurispirillum_indicum_S5_uid38575 0 0 0 4 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 8 0 0 0 15 0 0Desulfurivibrio_alkaliphilus_AHT2_uid33629 0 0 0 0 2 0 0 0 3 0 0 0 0 0 0 0 10 7 0 0 0 0 0 0 0 0 6 0Desulfurococcus_kamchatkensis_1221n_uid28141 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Desulfurobacterium_thermolithotrophum_DSM_11699_uid51497 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 1 0 0 5 0 0 0 0 0 20 0 0Desulfurococcus_mucosus_DSM_2162_uid48641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Diaphorobacter_TPSY_uid29975 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 5 0 0 0 4 0 3Dichelobacter_nodosus_VCS1703A_uid50 0 0 0 0 0 0 1 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 24: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Dickeya_dadantii_3937_uid30 0 0 0 0 0 0 6 0 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 15 0 0 2 0Dickeya_dadantii_Ech586_uid33667 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 2 0 0 0 0Dickeya_dadantii_Ech703_uid33069 0 0 0 0 0 0 0 2 0 0 0 0 0 4 0 0 1 0 0 0 0 0 0 0 0 0 0 0Dickeya_zeae_Ech1591_uid31295 0 0 0 0 0 0 0 0 2 0 2 0 0 0 0 0 4 0 0 0 0 2 0 10 0 4 13 0Dictyoglomus_thermophilum_H_6_12_uid30731 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Dictyoglomus_turgidum_DSM_6724_uid29175 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Dinoroseobacter_shibae_DFL_12_uid17417 3 0 1 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 2 0 0 11 0 0 1 2 0 0Dyadobacter_fermentans_DSM_18053_uid20829 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 9 0 6 0 0 0 0Edwardsiella_ictaluri_93_146_uid34853 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 36 0 0 0 0 2 7 0 0 2 0 0Edwardsiella_tarda_EIB202_uid28539 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 10 0 0 0 0 0 0 0 1 0 0Edwardsiella_tarda_FL6_60_uid45969 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 3 0 0 0 0 0 0Eggerthella_YY7918_uid67615 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Eggerthella_lenta_DSM_2243_uid21093 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 10 0 0 0 0 2 0 0 0 0 0 0Ehrlichia_canis_Jake_uid10694 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ehrlichia_chaffeensis_Arkansas_uid325 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ehrlichia_ruminantium_Gardel_uid13356 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ehrlichia_ruminantium_Welgevonden_UPSA_uid9614 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ehrlichia_ruminantium_str 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Elusimicrobium_minutum_Pei191_uid19701 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Enterobacter_638_uid17461 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0Enterobacter_aerogenes_KCTC_2190_uid66537 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Enterobacter_asburiae_LF7a_uid41509 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Enterobacter_cloacae_ATCC_13047_uid45793 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 47 3 4 3 19 7 0Enterobacter_cloacae_NCTC_9394_uid45967 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Enterobacter_cloacae_SCF1_uid53533 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 5 0Enterobacter_sakazakii_ATCC_BAA-­‐894_uid12720 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Enterococcus_7L76_uid39181 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Enterococcus_faecalis_62_uid61185 0 0 0 1 0 0 1 0 0 1 0 13 0 0 0 0 7 0 0 0 0 5 0 0 2 0 0 0Enterococcus_faecalis_OG1RF_uid20843 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Enterococcus_faecalis_V583_uid70 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 16 0 0 0 0 1 0 0 9 0 0 0Erwinia_Ejp617_uid51141 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 3 0 0 0 0 21 0 0 0 1 0 0Erwinia_amylovora_ATCC_49946_uid43757 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Erwinia_amylovora_CFBP1430_uid46805 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Erwinia_billingiae_Eb661_uid34781 0 0 0 0 0 0 0 0 0 1 0 5 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0Erwinia_carotovora_atroseptica_SCRI1043_uid350 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 1 0 0 0 0 0 0Erwinia_pyrifoliae_DSM_12163_uid37877 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 14 0 0 0 0 23 0 0 0 0 0 0Erwinia_pyrifoliae_Ep1_96_uid34779 0 0 0 0 0 0 0 11 0 0 0 0 0 0 8 0 14 0 0 0 0 22 0 0 0 0 0 0Erwinia_tasmaniensis_uid20585 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Erysipelothrix_rhusiopathiae_uid38421 0 0 0 0 0 0 0 0 0 0 0 13 0 0 1 0 0 0 0 0 0 7 0 0 0 0 0 0Erythrobacter_litoralis_HTCC2594_uid13480 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0Escherichia_coli_0127_H6_E2348_69_uid32571 0 0 0 0 0 0 8 0 0 0 0 1 0 5 0 0 1 2 1 0 0 9 1 6 0 0 32 4Escherichia_coli_042_uid40647 0 0 0 0 0 0 3 0 1 0 3 2 0 1 2 0 2 0 0 0 0 18 12 3 0 0 0 6Escherichia_coli_536_uid16235 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 9 0 0 0 14 0 3 0 0 2 4Escherichia_coli_55989_uid33413 0 0 0 0 0 0 3 0 0 0 5 3 0 1 2 0 0 1 0 0 0 14 18 11 0 0 8 7Escherichia_coli_ABU_83972_uid38725 0 0 0 0 0 0 2 1 0 0 0 1 0 0 0 0 0 2 0 0 0 10 3 4 0 0 2 2Escherichia_coli_APEC_O1_uid16718 0 0 0 0 0 0 2 0 0 0 1 0 0 0 2 0 0 6 1 0 0 11 8 1 3 4 3 1Escherichia_coli_BL21_DE3__uid20713 0 0 0 0 0 0 1 0 0 0 4 1 0 0 0 0 0 0 0 0 0 13 28 0 0 0 6 0Escherichia_coli_BL21_DE3__uid28965 0 0 0 0 0 0 1 0 0 0 4 1 0 0 0 0 0 0 0 0 0 13 29 0 0 0 6 0Escherichia_coli_BW2952_uid33775 0 0 0 0 0 0 1 0 0 0 3 3 0 0 0 0 0 0 0 0 0 18 7 0 0 14 5 0Escherichia_coli_B_REL606_uid18281 0 0 0 0 0 0 1 0 0 0 5 1 0 0 0 0 0 0 0 0 0 14 27 0 0 0 6 0Escherichia_coli_CFT073_uid313 0 0 0 0 0 0 19 1 0 0 0 1 0 0 2 0 0 6 0 0 0 15 2 1 0 0 1 5Escherichia_coli_C_ATCC_8739_uid18083 0 0 0 0 0 0 2 0 0 0 6 4 0 0 0 0 0 0 0 0 0 7 20 0 0 2 4 0Escherichia_coli_DH1_uid30031 0 0 0 0 0 0 1 0 0 0 3 3 0 1 0 0 0 0 0 0 0 12 8 0 0 16 4 0Escherichia_coli_DH1_uid52077 0 0 0 0 0 0 1 0 0 0 2 3 0 1 0 0 0 0 0 0 0 13 7 0 0 19 4 0Escherichia_coli_E24377A_uid13960 0 0 0 0 0 0 2 2 0 0 7 1 0 0 0 0 1 9 4 0 0 15 9 10 0 0 1 16Escherichia_coli_ED1a_uid33409 0 0 0 0 0 0 3 0 0 0 1 0 0 0 2 0 0 0 1 0 0 12 26 2 0 2 1 7Escherichia_coli_ETEC_H10407_uid42749 0 0 0 0 0 0 2 0 0 0 4 0 0 0 2 0 0 0 2 0 0 5 0 1 0 0 3 89Escherichia_coli_HS_uid13959 0 0 0 0 0 0 2 0 0 0 6 0 0 0 0 0 0 0 0 0 0 15 25 15 0 0 3 2Escherichia_coli_IAI1_uid33373 0 0 0 0 0 0 3 0 0 0 7 0 0 0 0 0 0 0 0 0 0 5 1 7 0 0 0 0Escherichia_coli_IAI39_uid33411 0 0 0 5 0 0 2 0 0 0 0 0 0 0 2 0 0 0 0 0 0 59 11 2 2 0 15 0Escherichia_coli_IHE3034_uid43693 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 2 2 0 0 2 0

Page 25: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Escherichia_coli_KO11_uid33875 0 0 0 0 0 0 4 0 0 0 4 0 0 0 2 0 0 0 2 0 0 8 1 5 0 0 0 0Escherichia_coli_K_12_substr__DH10B_uid20079 0 0 0 0 0 0 1 0 0 0 3 3 0 1 0 0 0 0 0 0 0 24 11 0 0 15 5 0Escherichia_coli_K_12_substr__MG1655_uid225 0 0 0 0 0 0 1 0 0 0 3 3 0 1 0 0 0 0 0 0 0 12 7 0 0 11 4 0Escherichia_coli_K_12_substr__W3110_uid16351 0 0 0 0 0 0 1 0 0 0 7 3 0 1 0 0 0 0 0 0 0 21 7 0 0 35 4 0Escherichia_coli_LF82_uid33825 0 0 0 0 0 0 2 0 0 0 4 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 1 0Escherichia_coli_NA114_uid66975 0 0 0 0 0 0 3 0 0 0 3 1 0 0 0 0 0 2 0 0 0 7 2 1 0 0 3 3Escherichia_coli_O103_H2_12009_uid32511 0 0 0 0 0 0 2 0 0 0 7 0 0 0 4 0 0 0 0 0 0 34 2 13 0 0 2 5Escherichia_coli_O111_H__11128_uid32513 0 0 0 0 0 0 3 0 0 0 9 1 0 0 0 0 0 0 5 0 0 51 7 1 4 0 1 4Escherichia_coli_O157H7_EDL933_uid259 0 0 0 0 0 0 2 1 0 0 7 10 0 0 2 0 0 0 1 0 0 25 3 0 0 0 1 5Escherichia_coli_O157H7_uid226 0 0 0 0 0 0 2 0 0 0 7 1 0 1 0 0 0 0 2 0 0 37 2 0 0 0 1 10Escherichia_coli_O157_H7_EC4115_uid27739 0 0 0 0 0 0 2 0 0 0 5 1 0 1 0 0 0 0 2 0 0 22 11 0 0 0 1 11Escherichia_coli_O157_H7_TW14359_uid30045 0 0 0 0 0 0 2 1 0 0 5 1 0 1 5 0 0 0 2 0 0 23 3 0 0 0 1 11Escherichia_coli_O26_H11_11368_uid32509 0 0 0 2 0 0 3 0 0 0 9 1 0 0 2 0 0 0 4 0 0 24 5 13 0 0 1 19Escherichia_coli_O55_H7_CB9615_uid42729 0 0 0 0 0 0 2 0 0 0 6 0 0 0 1 0 0 1 0 0 0 7 1 1 0 0 2 4Escherichia_coli_O83_H1_NRG_857C_uid41221 0 0 0 0 0 0 2 0 0 0 3 0 0 0 0 0 0 0 0 0 0 4 7 1 2 1 1 0Escherichia_coli_S88_uid33375 0 0 0 0 0 0 2 0 0 0 1 1 0 0 3 0 0 4 0 0 0 4 4 1 0 0 2 3Escherichia_coli_SE11_uid18057 0 0 0 0 0 0 4 0 0 0 9 0 0 0 0 0 0 0 1 0 0 5 2 8 2 2 0 2Escherichia_coli_SE15_uid19053 0 0 0 0 0 0 1 1 0 0 4 0 0 0 0 0 0 9 1 0 0 6 3 1 1 0 1 8Escherichia_coli_SMS_3_5_uid19469 0 0 0 2 0 0 4 0 0 0 0 0 0 1 0 0 1 0 0 0 0 19 11 5 9 3 8 3Escherichia_coli_UM146_uid50883 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0 0 3 1 0 0 6 3 4 0 0 3 1Escherichia_coli_UMN026_uid33415 0 0 0 0 0 0 7 0 1 0 4 1 0 0 0 0 0 0 0 0 0 12 5 0 4 4 3 4Escherichia_coli_UMNF18_uid48455 0 0 0 0 0 0 1 0 0 0 9 0 0 0 1 0 0 3 13 0 0 21 9 1 3 0 14 9Escherichia_coli_UMNK88_uid42137 0 0 0 0 0 1 1 0 0 0 8 3 0 1 0 0 1 0 1 0 0 19 7 2 0 3 4 5Escherichia_coli_UTI89_uid16259 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0 0 3 1 0 0 6 4 2 0 0 2 3Escherichia_coli_W_uid48011 0 0 0 0 0 0 4 0 0 0 4 0 0 0 0 0 0 0 2 0 0 7 1 5 0 0 0 0Escherichia_coli__BL21_Gold_DE3_pLysS_AG__uid30681 0 0 0 0 0 0 1 0 0 0 3 1 0 0 0 0 0 0 0 0 0 14 28 0 0 2 12 0Escherichia_fergusonii_ATCC_35469_uid33369 0 0 0 0 0 0 1 0 1 2 2 0 0 0 0 0 0 0 0 0 0 3 1 3 0 2 5 0Ethanoligenens_harbinense_YUAN_3_uid39729 0 0 0 0 0 0 4 0 0 0 0 0 0 0 2 0 0 9 0 0 0 0 0 0 0 0 0 0Eubacterium_cylindroides_T2_87_uid45917 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 0 0 0Eubacterium_eligens_ATCC_27750_uid29073 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0Eubacterium_limosum_KIST612_uid52281 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Eubacterium_rectale_ATCC_33656_uid29071 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 4 0 0 0 1Eubacterium_rectale_uid39159 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 1Eubacterium_rectale_uid39161 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Eubacterium_siraeum_V10Sc8a_uid45919 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Eubacterium_siraeum_uid39157 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Exiguobacterium_AT1b_uid27759 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Exiguobacterium_sibiricum_255_15_uid10649 5 0 0 0 0 0 7 0 0 0 0 11 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Faecalibacterium_prausnitzii_L2_6_uid45961 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Faecalibacterium_prausnitzii_uid39151 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ferrimonas_balearica_DSM_9799_uid30799 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ferroglobus_placidus_DSM_10642_uid33635 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 6 0 0 0Fervidobacterium_nodosum_Rt17-­‐B1_uid16719 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 8 20 0 0 0Fibrobacter_succinogenes_S85_uid32617 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Fibrobacter_succinogenes_S85_uid47 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Filifactor_alocis_ATCC_35896_uid30485 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Finegoldia_magna_ATCC_29328_uid18981 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Flavobacteriaceae_bacterium_3519_10_uid38559 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0Flavobacteriales_bacterium_HTCC2170_uid13595 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Flavobacterium_johnsoniae_UW101_uid16082 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Flavobacterium_psychrophilum_JIP02_86_uid19979 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Flexistipes_sinusarabici_DSM_4947_uid45817 0 0 0 0 18 28 0 0 0 0 0 0 0 0 27 0 0 5 0 0 0 2 0 2 0 44 0 0Fluviicola_taffensis_DSM_16823_uid47603 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0Francisella_TX077308_uid67139 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0Francisella_cf__novicida_3523_uid62753 0 0 0 0 0 0 0 1 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0Francisella_cf__novicida_Fx1_uid62751 0 0 16 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 3 0 1 0 0Francisella_philomiragia_ATCC_25017_uid27853 0 0 0 0 0 0 0 0 0 0 0 46 0 0 0 0 2 0 0 0 0 0 0 0 0 1 0 0Francisella_tularensis_FSC_198_uid17375 0 0 2 0 0 0 0 50 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 16 1 0Francisella_tularensis_NE061598_uid38289 0 0 3 0 0 0 0 47 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 13 1 0Francisella_tularensis_WY96-­‐3418_uid18459 0 0 3 0 0 0 0 64 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0Francisella_tularensis_holarctica_FTNF002_00_uid20197 0 0 4 0 0 0 0 58 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 39 1 0

Page 26: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Francisella_tularensis_holarctica_LVS_uid16421 0 0 3 0 0 0 0 59 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 38 1 0Francisella_tularensis_holarctica_OSU18_uid17265 0 0 2 0 0 0 0 54 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 36 1 0Francisella_tularensis_mediasiatica_FSC147_uid19571 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 16 1 0Francisella_tularensis_novicida_U112_uid16088 0 0 4 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 16 0 0Francisella_tularensis_tularensis_uid9 0 0 2 0 0 0 0 50 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 16 1 0Frankia_CcI3_uid13963 0 0 0 0 0 0 2 5 0 0 27 0 1 0 27 2 0 0 0 4 0 1 0 6 2 14 0 2Frankia_EAN1pec_uid13915 0 0 0 4 0 10 3 1 0 0 0 0 1 0 2 5 0 0 0 0 0 1 0 0 0 6 5 0Frankia_EuI1c_uid37913 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 3 0 0 0 0Frankia_alni_ACN14a_uid17403 0 0 0 1 0 0 1 0 0 0 1 0 0 0 1 0 0 0 0 1 0 0 0 0 2 2 0 0Frankia_symbiont_of_Datisca_glomerata_uid39113 0 0 0 0 0 0 0 67 0 1 0 0 0 0 2 0 3 0 0 0 0 58 0 0 0 0 16 31Fusobacterium_nucleatum_uid295 0 0 0 2 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 1 0 0 0 0 0 0Gallibacterium_anatis_UMN179_uid65483 9 0 1 0 0 0 17 0 0 0 0 0 0 0 18 0 1 0 0 0 0 12 0 0 0 0 2 0Gallionella_capsiferriformans_ES_2_uid32827 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 2 0 0 0 0 0 0 0 0 11 3Gardnerella_vaginalis_409_05_uid31001 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Gardnerella_vaginalis_ATCC_14019_uid31473 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Gardnerella_vaginalis_HMP9231_uid51067 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0Gemmatimonas_aurantiaca_T_27_uid18501 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 9 0 0 0 0 0 0Geobacillus_C56_T3_uid41701 0 0 0 0 11 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 6 0 0 1 0 7 0 0Geobacillus_WCH70_uid20805 0 0 0 9 7 9 60 3 3 0 0 0 0 4 19 11 17 0 0 0 17 0 0 17 3 1 0 10Geobacillus_Y412MC10_uid27777 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Geobacillus_Y412MC52_uid30797 0 0 0 6 16 3 15 0 0 0 0 0 0 0 0 19 0 0 0 0 11 0 0 16 0 1 0 0Geobacillus_Y412MC61_uid30537 0 0 0 6 16 3 15 0 0 0 0 0 0 0 0 17 0 0 0 0 11 0 0 15 0 1 0 0Geobacillus_Y4_1MC1_uid33183 3 0 0 4 0 0 0 0 5 0 0 0 0 0 15 0 21 19 0 0 0 2 0 0 0 0 0 0Geobacillus_kaustophilus_HTA426_uid13233 0 0 0 4 15 0 9 0 0 0 0 0 0 7 0 0 2 0 0 0 5 0 0 6 0 1 5 5Geobacillus_thermodenitrificans_NG80-­‐2_uid18655 0 0 0 10 0 0 0 0 9 0 0 0 0 0 0 0 6 2 0 0 1 6 0 0 0 0 0 0Geobacillus_thermoglucosidasius_C56_YS93_uid40781 0 0 0 0 0 0 0 0 6 0 0 0 0 2 4 0 9 19 0 0 5 3 0 7 0 0 0 0Geobacter_FRC_32_uid16263 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Geobacter_M18_uid33159 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 3 0Geobacter_M21_uid20729 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 4 0 0 7 0Geobacter_bemidjiensis_Bem_uid17707 0 0 0 0 0 0 0 0 0 0 0 0 0 9 3 0 0 0 0 0 0 0 0 0 0 0 0 0Geobacter_lovleyi_SZ_uid17423 0 0 3 5 0 0 0 0 0 0 0 0 0 2 0 0 3 9 0 0 0 4 0 5 0 10 4 0Geobacter_metallireducens_GS-­‐15_uid177 0 0 0 24 0 0 0 0 0 0 0 0 0 6 0 0 0 8 0 0 0 1 0 14 0 9 10 0Geobacter_sulfurreducens_KN400_uid39745 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 7 0 1 0 1 0 0Geobacter_sulfurreducens_uid192 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 10 0 3 0 4 4 0Geobacter_uraniumreducens_Rf4_uid15768 0 0 4 15 0 0 3 0 0 0 0 0 0 1 0 6 12 4 2 0 0 13 0 0 0 0 15 0Geodermatophilus_obscurus_DSM_43160_uid29547 0 0 0 0 0 0 0 2 0 0 0 0 0 0 1 5 4 0 0 0 1 0 0 0 0 18 0 0Glaciecola_agarilytica_4H_3_7_YE_5_uid62887 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Gloeobacter_violaceus_uid9606 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 9 0Gluconacetobacter_diazotrophicus_PAl_5_FAPERJ_uid377 7 0 0 0 0 0 0 14 0 0 0 0 0 35 0 3 4 8 0 0 0 10 0 6 0 22 0 0Gluconacetobacter_diazotrophicus_PAl_5_JGI_uid21071 10 0 0 0 0 0 0 18 0 0 0 0 0 14 0 5 6 4 0 0 0 14 0 12 0 24 0 0Gluconobacter_oxydans_621H_uid13325 0 0 0 2 0 0 0 0 0 0 0 0 0 24 0 0 0 0 0 0 0 13 0 0 0 32 0 0Gordonia_bronchialis_DSM_43247_uid29549 0 0 0 6 3 7 0 13 0 0 0 0 0 0 0 0 5 4 0 0 0 30 0 0 0 0 0 0Gordonibacter_pamelaeae_7_10_1_b_uid39175 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Gramella_forsetii_KT0803_uid19061 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 5 0 10 0 0 0 0Granulobacter_bethesdensis_CGDNIH1_uid17111 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 4 0 0Haemophilus_ducreyi_35000HP_uid38 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Haemophilus_influenzae_10810_uid50409 0 0 1 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Haemophilus_influenzae_86_028NP_uid11752 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Haemophilus_influenzae_F3031_uid50729 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Haemophilus_influenzae_F3047_uid61001 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Haemophilus_influenzae_PittEE_uid16400 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Haemophilus_influenzae_PittGG_uid16401 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Haemophilus_influenzae_R2846_uid9620 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Haemophilus_influenzae_R2866_uid9621 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Haemophilus_influenzae_uid219 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Haemophilus_parainfluenzae_T3T1_uid50411 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0Haemophilus_parasuis_SH0165_uid31099 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Haemophilus_somnus_129PT_uid322 0 0 6 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Haemophilus_somnus_2336_uid388 0 0 4 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Hahella_chejuensis_KCTC_2396_uid16064 0 0 1 0 0 0 0 6 0 0 0 0 0 0 1 0 0 4 0 0 0 0 0 0 0 8 1 0Halalkalicoccus_jeotgali_B3_uid49517 0 0 0 0 0 0 0 1 2 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 0

Page 27: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Halanaerobium__sapolanicus__uid43059 1 0 0 0 0 0 0 0 2 0 0 10 0 0 14 0 8 0 0 20 0 11 0 0 6 0 0 0Halanaerobium_praevalens_DSM_2228_uid32591 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Haliangium_ochraceum_DSM_14365_uid28711 0 0 0 0 0 7 0 0 6 0 0 0 0 0 17 0 1 0 0 0 0 2 0 0 0 0 3 0Haliscomenobacter_hydrossis_DSM_1100_uid48289 0 0 0 0 0 0 0 14 0 0 1 0 0 0 3 0 0 2 0 0 7 4 0 4 0 8 6 0Haloarcula_hispanica_ATCC_33960_uid68523 0 4 0 0 0 0 1 2 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 2 0 0 0Haloarcula_marismortui_ATCC_43049_uid105 0 1 3 0 0 0 2 5 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 8 12 4 0Halobacterium_salinarum_R1_uid106 0 12 4 0 0 0 2 1 5 0 0 0 0 0 20 0 0 0 0 0 0 0 0 0 1 6 4 5Halobacterium_sp_uid217 0 24 2 0 0 0 3 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 12 26 6Haloferax_volcanii_DS2_uid12524 0 37 1 0 0 0 0 2 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 7 0 3 21 0Halogeometricum_borinquense_DSM_11551_uid20743 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0Halomicrobium_mukohataei_DSM_12286_uid27945 0 1 6 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 2 0 0Halomonas_elongata_DSM_2581_uid49333 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Halopiger_xanaduensis_SH_6_uid56049 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Haloquadratum_walsbyi_C23_uid49335 0 8 5 0 0 0 0 1 1 0 0 0 0 0 1 10 0 0 0 0 0 0 0 0 0 5 18 4Haloquadratum_walsbyi_uid17185 0 1 0 0 0 0 3 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 3 0Halorhabdus_utahensis_DSM_12940_uid29305 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0Halorhodospira_halophila_SL1_uid15767 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 1 0 0Halorubrum_lacusprofundi_ATCC_49239_uid18455 0 41 0 0 0 0 11 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 15 3 0Haloterrigena_turkmenica_DSM_5511_uid30411 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 1 0 0Halothermothrix_orenii_H_168_uid16377 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Halothiobacillus_neapolitanus_c2_uid31049 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Helicobacter_acinonychis_Sheeba_uid17251 0 0 0 0 0 0 2 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_bizzozeronii_CIII_1_uid65019 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Helicobacter_felis_ATCC_49179_uid61189 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_hepaticus_uid185 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_mustelae_12198_uid40677 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0Helicobacter_pylori_2017_uid62769 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_2018_uid62771 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_26695_uid233 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_35A_uid34603 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_51_uid9627 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Helicobacter_pylori_83_uid34609 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_908_uid50869 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_B38_uid39685 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0Helicobacter_pylori_B8_uid41831 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Cuz20_uid49029 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Helicobacter_pylori_F16_uid50589 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0 0 0 0 0 0Helicobacter_pylori_F30_uid50591 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_F32_uid50593 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_F57_uid50595 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_G27_uid31341 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Gambia94_24_uid53317 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_HPAG1_uid16183 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_India7_uid53321 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_J99_uid234 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Lithuania75_uid53323 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_P12_uid32291 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_PeCan4_uid49027 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Puno120_uid69917 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Puno135_uid69919 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_SJM180_uid49025 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Santal49_uid69921 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_Sat464_uid49453 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Helicobacter_pylori_Shi470_uid29045 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_SouthAfrica7_uid53319 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_uid39507 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Helicobacter_pylori_v225d_uid36609 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Heliobacterium_modesticaldum_Ice1_uid13427 31 0 0 0 0 13 0 0 0 0 0 0 0 0 4 0 0 0 2 0 0 6 0 0 0 0 0 0Herbaspirillum_seropedicae_SmR1_uid47945 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Herminiimonas_arsenicoxydans_uid13467 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 9 0 2 0 0 0 0Herpetosiphon_aurantiacus_ATCC_23779_uid16523 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 0 2 0 0 0 0 0 0 0 0 0 5 0

Page 28: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Hippea_maritima_DSM_10411_uid48195 0 0 0 0 0 0 0 0 0 0 0 0 0 6 8 0 9 0 0 0 0 0 0 0 0 0 0 0Hirschia_baltica_ATCC_49814_uid33191 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Hydrogenobacter_thermophilus_TK_6_uid34131 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Hydrogenobacter_thermophilus_TK_6_uid41547 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Hydrogenobaculum_Y04AAS1_uid18891 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Hyperthermus_butylicus_uid208 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Hyphomicrobium_denitrificans_ATCC_51888_uid33261 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0Hyphomicrobium_uid67153 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Hyphomonas_neptunium_ATCC_15444_uid15721 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 1 0 9 0 0 0 0Idiomarina_loihiensis_L2TR_uid10790 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 5 0Ignicoccus_hospitalis_KIN4_I_uid13914 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ignisphaera_aggregans_DSM_17230_uid33361 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ilyobacter_polytropus_DSM_2926_uid32577 0 0 0 0 0 0 2 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 0 0 0 0 0Intrasporangium_calvum_DSM_43043_uid43527 0 0 0 7 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 2 0 0 0 0 0 0Isoptericola_variabilis_225_uid49657 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 4 0 0 0 0Isosphaera_pallida_ATCC_43644_uid32825 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Jannaschia_CCS1_uid12733 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 3 0 0 0 0Janthinobacterium_Marseille_uid16549 0 0 0 0 0 0 0 0 0 0 0 4 0 0 1 0 0 3 0 0 0 1 0 0 0 0 0 0Jonesia_denitrificans_DSM_20603_uid20833 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Kangiella_koreensis_DSM_16069_uid29443 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ketogulonicigenium_vulgare_Y25_uid51787 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0Ketogulonigenium_vulgarum_WSH_001_uid48079 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0Kineococcus_radiotolerans_SRS30216_uid10689 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Klebsiella_pneumoniae_342_uid28471 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 5 1 7 0 0Klebsiella_pneumoniae_KCTC_2242_uid67293 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 7 0 1 0 0 0 0Klebsiella_pneumoniae_MGH_78578_uid31 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 6 4 13 7 0 0Klebsiella_pneumoniae_NTUH_K2044_uid21069 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 8 0 0Klebsiella_variicola_At_22_uid37701 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Kocuria_rhizophila_DC2201_uid27833 0 0 0 1 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0Kosmotoga_olearia_TBF_19_5_1_uid29419 0 0 0 1 0 0 0 0 7 0 0 0 0 0 11 2 7 9 0 0 0 9 0 0 0 0 0 0Kribbella_flavida_DSM_17836_uid21089 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Krokinobacter_diaphorus_4H_3_7_5_uid62895 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Kytococcus_sedentarius_DSM_20547_uid21067 0 0 0 20 0 0 0 0 0 0 0 2 0 0 9 0 7 0 0 0 0 8 0 0 0 0 0 0Lacinutrix_5H_3_7_4_uid62899 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Lactobacillus_acidophilus_30SC_uid53145 6 0 0 4 0 0 5 0 0 0 0 2 0 0 0 0 1 0 0 0 3 7 0 13 0 0 5 6Lactobacillus_acidophilus_NCFM_uid82 2 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 1 0 0 0 0Lactobacillus_amylovorus_GRL_1112_uid42073 6 0 0 3 0 0 7 0 0 0 0 8 0 0 1 0 2 0 0 0 7 4 0 2 0 0 3 7Lactobacillus_amylovorus_GRL_1118_uid42079 3 0 0 0 0 0 2 0 0 0 0 3 0 0 5 0 0 0 0 0 0 1 0 2 0 0 1 0Lactobacillus_brevis_ATCC_367_uid404 0 0 0 3 0 0 0 0 0 0 0 24 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0Lactobacillus_buchneri_NRRL_B_30929_uid29003 0 0 0 19 0 0 0 0 0 0 0 18 0 0 0 0 2 0 0 0 0 6 0 0 0 11 0 0Lactobacillus_casei_ATCC_334_uid402 0 0 0 10 0 0 0 0 0 0 0 57 0 0 0 0 0 0 0 0 0 13 0 0 1 16 0 4Lactobacillus_casei_BD_II_uid64431 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 5 0 0 0 6 0 0Lactobacillus_casei_LC2W_uid64433 0 0 0 0 0 0 0 0 0 0 0 19 0 0 0 0 0 0 0 0 0 9 0 0 5 13 0 3Lactobacillus_casei_Zhang_uid28537 0 0 0 16 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Lactobacillus_casei_uid30359 0 0 0 0 0 0 0 0 0 0 0 19 0 0 1 0 0 0 0 0 0 8 0 0 4 11 0 3Lactobacillus_crispatus_ST1_uid46813 0 0 0 3 0 0 6 0 0 0 0 5 0 0 5 0 7 0 0 0 2 1 0 2 0 0 0 0Lactobacillus_delbrueckii_bulgaricus_2038_uid16120 0 0 0 4 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 3 0 1 0 8 0 0 10 0Lactobacillus_delbrueckii_bulgaricus_ATCC_BAA-­‐365_uid403 0 0 0 2 0 0 0 0 0 0 0 10 0 0 2 0 2 0 0 0 3 1 0 4 0 0 1 0Lactobacillus_delbrueckii_bulgaricus_ND02_uid49147 0 0 0 3 2 0 0 0 0 0 0 7 0 0 6 0 7 0 0 10 0 4 0 12 0 0 10 7Lactobacillus_delbrueckii_bulgaricus_uid16871 0 0 0 1 0 0 0 0 0 0 0 11 0 0 2 0 0 0 0 0 1 1 0 1 0 0 15 0Lactobacillus_fermentum_CECT_5716_uid43533 0 0 0 8 0 0 5 0 0 0 0 14 0 0 1 0 27 0 0 0 0 23 0 0 0 0 0 0Lactobacillus_fermentum_IFO_3956_uid18979 0 0 0 8 0 0 7 0 0 0 0 18 0 0 3 0 28 0 0 0 0 25 0 0 0 0 0 0Lactobacillus_gasseri_ATCC_33323_uid84 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Lactobacillus_helveticus_DPC_4571_uid17811 11 0 0 27 0 0 32 0 0 0 0 7 0 0 17 0 28 0 0 7 15 3 0 2 0 18 2 2Lactobacillus_helveticus_H10_uid49153 0 0 0 22 0 0 26 0 0 0 0 6 0 0 9 0 15 0 0 0 10 5 0 8 0 2 0 0Lactobacillus_johnsonii_DPC_6026_uid56053 1 0 0 6 0 0 4 0 0 0 0 0 0 0 1 0 0 0 0 0 3 4 0 8 0 0 0 2Lactobacillus_johnsonii_FI9785_uid36575 0 0 0 1 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0 3 5 0 2 0 0 0 0Lactobacillus_johnsonii_NCC_533_uid9638 0 0 0 5 0 0 2 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Lactobacillus_kefiranofaciens_ZW3_uid66525 0 0 0 7 0 0 6 0 0 0 0 3 0 0 12 0 9 0 0 7 3 7 0 3 0 3 0 0Lactobacillus_plantarum_JDM1_uid32969 11 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Lactobacillus_plantarum_ST_III_uid52949 6 0 0 10 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0

Page 29: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Lactobacillus_plantarum_uid356 5 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0Lactobacillus_reuteri_DSM_20016_uid15766 3 0 0 2 0 0 17 0 0 0 0 6 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 5Lactobacillus_reuteri_F275_Kitasato_uid19011 5 0 0 4 0 0 17 0 0 0 0 6 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 5Lactobacillus_reuteri_SD2112_uid30643 3 0 0 22 0 0 14 0 0 0 0 83 0 0 0 0 0 4 0 0 0 6 0 6 0 0 0 4Lactobacillus_rhamnosus_GG_uid32195 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 5 0 0 0 33 0 0Lactobacillus_rhamnosus_GG_uid40637 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 5 0 0 0 31 0 0Lactobacillus_rhamnosus_Lc_705_uid32197 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 3 0 7 0 0Lactobacillus_sakei_23K_uid13435 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0Lactobacillus_salivarius_CECT_5713_uid43535 0 0 0 22 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 3 0 0 0 0 0 0Lactobacillus_salivarius_UCC118_uid13280 0 0 0 22 0 0 2 0 0 0 0 0 0 0 0 0 1 0 0 0 0 4 0 0 0 0 0 0Lactobacillus_sanfranciscensis_TMW_1_1304_uid43579 0 0 0 0 0 0 0 0 0 0 0 1 0 0 13 0 1 0 0 0 0 5 0 0 0 0 0 0Lactococcus_garvieae_ATCC_49156_uid18705 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 12 0 0 0 0 0 0Lactococcus_garvieae_Lg2_uid18707 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 12 0 0 0 0 0 0Lactococcus_lactis_CV56_uid60377 0 0 0 0 0 0 0 0 0 3 0 9 0 0 0 0 0 0 0 0 1 6 0 0 4 0 0 0Lactococcus_lactis_KF147_uid41115 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 3 0 0 0Lactococcus_lactis_cremoris_MG1363_uid18797 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 14 7 0 0 1 27 0 0 2 0 1 0Lactococcus_lactis_cremoris_NZ9000_uid46623 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 14 7 0 0 1 34 0 0 2 0 1 0Lactococcus_lactis_cremoris_SK11_uid401 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 56 20 0 0 12 0 0 0Lactococcus_lactis_uid72 0 0 0 0 0 0 0 0 0 0 0 15 0 0 0 0 1 0 0 0 2 26 0 0 0 0 0 0Laribacter_hongkongensis_HLHK9_uid30817 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Lawsonia_intracellularis_PHE_MN1-­‐00_uid183 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Leadbetterella_byssophila_DSM_17132_uid38283 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 23 0 0 0 0 0 0Legionella_longbeachae_NSW150_uid39579 3 0 0 4 0 0 0 2 0 0 1 0 0 0 5 0 0 0 0 0 0 0 0 3 4 2 3 0Legionella_pneumophila_130b_uid51615 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 2 0Legionella_pneumophila_2300_99_Alcoy_uid18743 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0Legionella_pneumophila_Corby_uid17491 0 0 0 5 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0 2 0Legionella_pneumophila_Lens_uid13126 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 6 0Legionella_pneumophila_Paris_uid13127 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 1 0 0 0 2 9 0Legionella_pneumophila_Philadelphia_1_uid22 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 2 0 0 0 1 7 0Leifsonia_xyli_xyli_CTCB0_uid212 0 0 0 0 0 0 0 0 0 0 0 15 0 26 2 0 1 6 0 0 0 1 0 4 0 2 0 0Leptospira_biflexa_serovar_Patoc__Patoc_1__Ames__uid16153 0 0 0 0 0 0 0 0 5 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Leptospira_biflexa_serovar_Patoc__Patoc_1__Paris__uid20133 0 0 0 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Leptospira_borgpetersenii_serovar_Hardjo-­‐bovis_JB197_uid16148 0 0 0 0 0 0 0 0 0 0 0 0 0 3 93 0 0 0 0 0 0 8 0 1 0 0 18 0Leptospira_borgpetersenii_serovar_Hardjo-­‐bovis_L550_uid16146 0 0 0 0 0 0 0 0 0 0 0 0 0 3 95 0 0 0 0 0 2 9 0 0 0 0 18 0Leptospira_interrogans_serovar_Copenhageni_uid10687 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 5 0 0 0 0Leptospira_interrogans_serovar_Lai_uid293 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 12 0 41 0 0 0 0Leptothrix_cholodnii_SP_6_uid20039 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 10 0 0 0 2 0 0Leptotrichia_buccalis_DSM_1135_uid29445 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 0 0 0 0 0 0 0 0 0 0Leuconostoc_C2_uid67911 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Leuconostoc_citreum_KM20_uid16062 0 0 0 0 0 0 0 0 0 0 0 1 0 0 2 0 0 0 0 0 0 5 0 0 0 0 0 0Leuconostoc_gasicomitatum_LMG_18811_uid48123 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Leuconostoc_kimchii_IMSNU11154_uid40837 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Leuconostoc_mesenteroides_ATCC_8293_uid315 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Listeria_innocua_uid86 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Listeria_monocytogenes_08_5923_uid36363 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Listeria_monocytogenes_10403S_uid17495 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Listeria_monocytogenes_4b_F2365_uid85 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Listeria_monocytogenes_Clip81459_uid32207 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Magnetococcus_MC-­‐1_uid262 0 0 0 0 2 2 0 1 0 0 0 0 0 0 7 0 0 0 1 0 0 3 0 0 0 1 11 9Listeria_monocytogenes_FSL_R2_561_uid17317 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Listeria_monocytogenes_Finland_1988_uid17319 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Listeria_monocytogenes_HCC23_uid29409 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Listeria_monocytogenes_J0161_uid17493 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Listeria_monocytogenes_M7_uid66625 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Listeria_monocytogenes_serotype_4a_L99_uid31327 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Listeria_monocytogenes_uid276 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Listeria_monocytogenes_uid36361 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Listeria_seeligeri_serovar_1_2b_SLCC3954_uid41123 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0Listeria_welshimeri_serovar_6b_SLCC5334_uid13443 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Listonella_anguillarum_775_uid51883 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 8 0 0 5 0 0 0 15 0 16Lysinibacillus_sphaericus_C3_41_uid19619 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0

Page 30: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Macrococcus_caseolyticus_JCSC5402_uid20209 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0Magnetospirillum_magneticum_AMB-­‐1_uid16217 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0 0 0 14 0 1 0 0 0 2Mahella_australiensis_50_1_BON_uid42243 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0Mannheimia_succiniciproducens_MBEL55E_uid13068 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Maricaulis_maris_MCS10_uid17333 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Marinithermus_hydrothermalis_DSM_14884_uid50827 0 0 0 0 0 0 0 0 0 0 0 0 0 0 29 0 0 0 0 1 0 0 0 0 0 0 0 0Marinobacter_aquaeolei_VT8_uid13239 2 0 0 2 0 6 0 4 4 4 0 0 0 0 0 0 5 23 0 0 0 11 0 4 0 7 0 0Marinomonas_MWYL1_uid17445 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 15 0 0 0 1 0 0Marinomonas_mediterranea_MMB_1_uid51765 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 4 0 0 0 0 0 0 3 0 5 0 0Marinomonas_posidonica_IVIA_Po_181_uid52545 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Marivirga_tractuosa_DSM_4126_uid37901 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 3 0 0 0 0Megamonas_hypermegale_uid39163 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Megasphaera_elsdenii_uid70535 3 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 1 0 0 0 0 11 0 0 0 0 0 0Meiothermus_ruber_DSM_1279_uid28827 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 1 0Meiothermus_silvanus_DSM_9946_uid29551 0 0 0 0 4 0 2 0 0 0 0 0 0 76 21 34 16 0 0 6 7 8 0 0 0 0 0 0Melissococcus_plutonius_ATCC_35311_uid61383 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mesoplasma_florum_L1_uid10650 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mesorhizobium_BNC1_uid10690 0 0 1 0 0 1 0 0 0 2 0 0 0 0 4 0 0 0 0 0 0 0 0 0 1 0 0 0Mesorhizobium_ciceri_biovar_biserrulae_WSM1271_uid48991 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0Mesorhizobium_loti_uid18 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 4 0 0 0 0Mesorhizobium_opportunistum_WSM2075_uid33861 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 0Metallosphaera_cuprina_Ar_4_uid65497 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Metallosphaera_sedula_DSM_5348_uid17447 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Methanobacterium_AL_21_uid53495 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanobacterium_SWAN_1_uid53561 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanobacterium_thermoautotrophicum_uid289 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanobrevibacter_ruminantium_M1_uid34681 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 6 0 0Methanobrevibacter_smithii_ATCC_35061_uid18653 0 0 0 0 0 0 0 2 13 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0Methanocaldococcus_FS406_22_uid37943 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 0Methanocaldococcus_fervens_AG86_uid32615 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanocaldococcus_infernus_ME_uid32611 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanocaldococcus_vulcanius_M7_uid33047 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanocella_paludicola_SANAE_uid20361 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 6 0 0 0 0 1 0 0 0 0 0 0Methanococcoides_burtonii_DSM_6242_uid9634 0 6 0 3 0 0 0 0 10 0 0 0 0 0 10 0 4 0 2 0 0 0 10 0 0 22 0 4Methanococcus_aeolicus_Nankai-­‐3_uid18641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_jannaschii_uid102 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0Methanococcus_maripaludis_C5_uid17641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_maripaludis_C6_uid19639 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_maripaludis_C7_uid18819 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_maripaludis_S2_uid10632 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_vannielii_SB_uid17889 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanococcus_voltae_A3_uid20131 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanocorpusculum_labreanum_Z_uid18109 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanoculleus_marisnigri_JR1_uid16330 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Methanohalobium_evestigatum_Z_7303_uid37945 0 0 0 0 0 0 42 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Methanohalophilus_mahii_DSM_5219_uid30711 0 0 0 0 0 0 1 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 4 0 0Methanoplanus_petrolearius_DSM_11571_uid40773 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0Methanopyrus_kandleri_uid294 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanosaeta_concilii_GP_6_uid63185 0 0 0 3 8 0 15 4 4 0 0 0 4 3 0 1 5 2 0 0 0 7 0 19 0 7 0 2Methanosaeta_thermophila_PT_uid15765 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 9 0 0Methanosalsum_zhilinae_DSM_4017_uid40771 0 0 0 0 3 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 4 0 0Methanosarcina_acetivorans_uid290 0 0 0 2 14 0 4 3 7 0 0 0 0 19 30 0 0 19 0 0 0 0 19 4 0 8 1 3Methanosarcina_barkeri_fusaro_uid103 0 1 0 0 6 0 19 1 1 0 0 0 0 0 12 3 3 0 0 0 0 0 5 2 0 16 5 0Methanosarcina_mazei_uid300 1 5 0 0 11 0 12 9 6 0 0 0 0 0 7 0 0 2 0 0 0 0 1 7 0 1 0 15Methanosphaera_stadtmanae_uid15579 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanospirillum_hungatei_JF-­‐1_uid13015 0 0 0 0 12 0 0 0 0 0 0 0 0 0 27 7 0 0 0 0 0 0 9 0 0 1 5 0Methanothermobacter_marburgensis_Marburg_uid40103 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanothermococcus_okinawensis_IH1_uid43653 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0Methanothermus_fervidus_DSM_2088_uid33689 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methanotorris_igneus_Kol_5_uid51821 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 3 0 0 0 0 0 0 0 0 0 0 0 0Methylacidiphilum_infernorum_V4_uid28995 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0

Page 31: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Methylibium_petroleiphilum_PM1_uid10789 0 0 0 0 0 0 0 0 0 3 0 0 0 0 1 0 0 3 0 0 0 5 0 3 0 15 0 0Methylobacillus_flagellatus_KT_uid10647 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methylobacterium_4_46_uid18809 0 0 0 0 0 0 0 0 0 0 0 16 0 4 0 1 6 9 0 0 0 11 0 8 10 4 0 0Methylobacterium_chloromethanicum_CM4_uid19527 0 0 0 2 0 0 0 6 0 0 0 0 0 2 0 1 4 1 0 0 0 17 0 7 0 6 0 0Methylobacterium_extorquens_AM1_uid20 1 0 0 10 0 8 4 5 2 3 0 0 0 16 0 0 25 10 0 0 0 53 0 5 2 6 0 0Methylobacterium_extorquens_DM4_uid16093 1 0 2 4 0 0 1 8 0 0 0 1 0 8 3 1 5 3 0 0 0 18 0 12 0 15 0 0Methylobacterium_extorquens_PA1_uid18637 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 2 0 4 0 4 0 0Methylobacterium_nodulans_ORS_2060_uid20477 41 0 0 8 0 0 0 15 0 0 1 0 0 9 5 0 0 0 1 0 0 2 0 32 16 23 0 13Methylobacterium_populi_BJ001_uid19559 1 0 2 0 0 15 0 0 1 2 0 0 0 0 0 2 14 0 0 0 0 4 0 3 2 5 0 0Methylobacterium_radiotolerans_JCM_2831_uid18817 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 15 0 0 0 8 0 0Methylocella_silvestris_BL2_uid20635 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 4 0 0 0 0 0 0Methylococcus_capsulatus_Bath_uid21 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 2 0 0 0 0 7 0 0 0 21 0 0Methylomonas_methanica_MC09_uid53917 4 0 0 0 0 6 2 9 0 0 4 0 0 0 1 0 0 5 0 0 0 44 0 0 0 0 0 0Methylotenera_301_uid39983 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 2 0 0 0 0Methylotenera_mobilis_JLW8_uid33319 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methylovorus_MP688_uid52541 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Methylovorus_SIP3_4_uid33241 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Microbacterium_testaceum_StLB037_uid62249 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 4 0 0Micrococcus_luteus_NCTC_2665_uid20655 0 0 0 11 0 0 0 0 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 2 0 0 0 0Microcystis_aeruginosa_NIES_843_uid27835 0 0 0 5 66 12 10 63 0 0 16 7 8 0 76 2 0 0 0 5 0 0 0 0 0 42 18 0Microlunatus_phosphovorus_NM_1_uid66879 0 0 0 0 1 0 0 1 0 0 11 0 0 0 0 0 0 4 0 0 0 22 0 0 0 0 4 0Micromonospora_L5_uid38291 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 2 0 0 0 0 4 0 13 0 6 0 0Micromonospora_aurantiaca_ATCC_27029_uid37957 0 0 0 0 0 0 0 0 0 0 0 0 0 4 5 2 2 0 0 0 0 8 0 3 0 8 0 0Mobiluncus_curtisii_ATCC_43063_uid31519 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Moorella_thermoacetica_ATCC_39073_uid10648 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 5 0 0 0 0 0 0Moraxella_catarrhalis_RH4_uid46869 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Muricauda_ruestringensis_DSM_13258_uid52467 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 4 0 0 0 0Mycobacterium_JDM601_uid51513 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 5 0 0 0 0 4 0 0 0 0 0 0Mycobacterium_JLS_uid16079 0 0 0 0 0 0 0 0 0 0 0 3 0 0 2 0 0 0 0 0 0 15 0 1 0 0 0 0Mycobacterium_KMS_uid16081 0 0 1 0 0 0 0 0 0 0 0 2 0 9 1 0 16 0 0 0 0 14 0 6 0 0 0 0Mycobacterium_MCS_uid15762 0 0 0 0 0 0 0 0 0 0 0 2 0 11 0 0 14 0 0 0 0 11 0 3 0 0 0 0Mycobacterium_Spyr1_uid28521 5 0 0 8 11 8 0 0 0 0 0 3 0 4 3 0 19 0 0 0 0 36 0 8 0 0 8 0Mycobacterium_abscessus_ATCC_19977_uid15691 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Mycobacterium_africanum_GM041182_uid68095 0 0 0 2 0 0 0 0 1 0 0 0 0 1 3 0 7 2 0 6 0 7 0 2 0 2 0 0Mycobacterium_avium_104_uid88 0 0 0 0 0 1 0 0 0 0 0 0 0 5 1 0 25 4 0 0 0 11 0 0 0 0 0 0Mycobacterium_avium_paratuberculosis_uid91 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 0 7 0 0 0 0 0 0 20 0 0 0 0Mycobacterium_bovis_BCG_Pasteur_1173P2_uid18059 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 1 0 6 0 1 0 3 0 2 0 0Mycobacterium_bovis_BCG_Tokyo_172_uid31211 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 1 0 6 0 2 0 3 0 2 0 0Mycobacterium_bovis_uid89 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 7 2 0 5 0 1 0 2 0 2 0 0Mycobacterium_canettii_uid68135 0 0 0 6 2 0 0 0 0 0 0 0 0 0 7 0 7 1 0 5 0 14 0 7 0 4 0 0Mycobacterium_gilvum_PYR-­‐GCK_uid15760 5 0 1 7 0 3 0 0 0 0 0 2 0 0 11 0 12 0 0 0 0 23 0 4 0 0 3 0Mycobacterium_leprae_Br4923_uid31271 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycobacterium_leprae_uid90 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycobacterium_marinum_M_uid16725 0 0 0 0 0 5 0 0 0 0 0 0 0 4 2 0 0 0 0 0 0 11 0 0 0 0 0 0Mycobacterium_smegmatis_MC2_155_uid92 0 0 0 27 6 4 0 9 0 0 0 1 0 1 2 0 2 2 0 0 0 13 0 8 0 0 0 0Mycobacterium_tuberculosis_CCDC5079_uid19585 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 2 0 5 0 15 0 2 0 2 0 0Mycoplasma_genitalium_uid97 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycobacterium_tuberculosis_CCDC5180_uid19583 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 2 0 5 0 18 0 2 0 2 0 0Mycobacterium_tuberculosis_CDC1551_uid223 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 2 0 6 0 4 0 3 0 2 0 0Mycobacterium_tuberculosis_CTRI_2_uid43171 0 0 0 2 0 0 0 0 1 0 0 0 0 1 3 0 7 1 0 5 0 15 0 2 0 2 0 0Mycobacterium_tuberculosis_F11_uid15642 0 0 0 2 0 0 0 0 1 0 0 0 0 1 2 0 7 2 0 6 0 17 0 3 0 2 0 0Mycobacterium_tuberculosis_H37Ra_uid18883 0 0 0 2 0 0 0 0 1 0 0 0 0 1 3 0 7 2 0 5 0 17 0 0 0 2 0 0Mycobacterium_tuberculosis_H37Rv_uid224 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 7 3 0 5 0 16 0 2 0 2 0 0Mycobacterium_tuberculosis_KZN_1435_uid21055 0 0 0 2 0 0 0 0 3 0 0 0 0 0 2 0 7 2 0 6 0 14 0 2 0 2 0 0Mycobacterium_tuberculosis_KZN_4207_uid21053 0 0 0 2 0 0 0 0 3 0 0 0 0 0 2 0 7 2 0 6 0 12 0 2 0 2 0 0Mycobacterium_ulcerans_Agy99_uid16230 0 0 0 0 0 0 0 0 0 0 200 0 0 0 0 0 79 0 0 0 0 0 0 0 0 0 0 0Mycobacterium_vanbaalenii_PYR-­‐1_uid15761 4 0 6 0 5 3 1 0 0 0 0 2 0 5 4 0 13 0 0 0 0 12 0 8 0 2 0 0Mycoplasma_agalactiae_PG2_uid16095 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_arthritidis_158L3_1_uid1422 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_bovis_Hubei_1_uid62195 0 0 0 0 3 0 0 0 0 0 0 11 0 0 7 0 0 0 0 0 0 9 0 0 0 0 0 0Mycoplasma_bovis_PG45_uid12525 0 0 0 0 10 0 0 0 0 0 0 15 0 0 21 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 32: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Mycoplasma_capricolum_ATCC_27343_uid16208 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_conjunctivae_HRC_581_uid32285 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_crocodyli_MP145_uid29021 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_fermentans_JER_uid36551 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 0 0 0 3 0 0 0 0 0 0Mycoplasma_fermentans_M64_uid13415 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Mycoplasma_fermentans_PG18_uid28473 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Mycoplasma_gallisepticum_F_uid43301 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_gallisepticum_R_high__uid43299 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_gallisepticum_uid409 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_haemofelis_Langford_1_uid61503 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_haemofelis_Ohio2_uid50029 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_hyopneumoniae_168_uid53881 0 0 0 0 1 0 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_hyopneumoniae_232_uid13120 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_hyopneumoniae_7448_uid10639 0 0 0 0 1 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_hyopneumoniae_J_uid10675 0 0 0 0 1 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 2 0 0 0 0 0 0Mycoplasma_hyorhinis_HUB_1_uid51405 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 25 0 0 0 0 0 0Mycoplasma_hyorhinis_MCLD_uid61173 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 0 0 0 0 0 0Mycoplasma_leachii_99_014_6_uid50599 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Mycoplasma_leachii_PG50_uid27715 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Mycoplasma_mobile_163K_uid10697 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_mycoides_SC_Gladysdale_uid27713 0 0 0 0 56 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 22 0 0 0 0 0 0Mycoplasma_mycoides_capri_GM12_uid19245 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_mycoides_capri_GM12_uid39245 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_mycoides_capri_LC_95010_uid50391 0 0 0 0 6 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 8 0 0 0 0 0 0Mycoplasma_mycoides_uid10616 0 0 0 0 1 0 0 0 0 0 0 0 0 0 86 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_penetrans_uid176 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_pneumoniae_FH_uid49525 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_pneumoniae_uid99 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_pulmonis_uid100 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_putrefaciens_KS1_uid46453 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_suis_Illinois_uid49421 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Mycoplasma_synoviae_53_uid10676 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Myxococcus_fulvus_HW_1_uid27837 0 0 0 0 0 0 0 3 0 0 0 0 4 5 0 0 4 3 0 0 0 0 0 0 0 0 0 1Myxococcus_xanthus_DK_1622_uid1421 0 0 0 0 0 0 0 2 0 0 0 2 0 0 0 6 0 3 0 0 0 4 0 0 0 4 0 1Nakamurella_multipartita_DSM_44233_uid29537 13 0 0 18 0 12 0 0 2 0 0 3 0 5 12 4 20 2 0 0 0 17 0 36 0 0 0 0Nanoarchaeum_equitans_uid9599 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Natranaerobius_thermophilus_JW_NM_WN_LF_uid20207 0 0 0 0 0 0 0 0 9 0 0 0 0 0 1 0 2 0 0 0 0 13 0 12 11 0 0 0Natrialba_magadii_ATCC_43099_uid30691 0 0 0 0 0 0 2 0 1 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 3 0 0Natronomonas_pharaonis_uid15742 0 0 2 0 0 0 21 0 1 0 0 0 0 0 2 0 0 0 0 2 0 0 0 0 1 1 0 0Nautilia_profundicola_AmH_uid32573 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Neisseria_gonorrhoeae_FA_1090_uid23 0 0 16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 9 0 0 0 0Neisseria_gonorrhoeae_NCCP11945_uid29335 0 0 17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 9 0 7 0 0Neisseria_gonorrhoeae_TCDC_NG08107_uid61377 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 9 0 1 0 0Neisseria_lactamica_020_06_uid13472 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 3 0 0Neisseria_meningitidis_053442_uid16393 0 0 9 0 0 0 0 1 0 0 0 6 0 0 1 0 0 0 0 0 0 1 0 2 0 15 0 0Neisseria_meningitidis_8013_uid34687 0 0 8 0 0 0 0 2 0 0 0 10 0 0 0 0 0 0 0 0 0 1 0 2 0 18 0 0Neisseria_meningitidis_FAM18_uid255 0 0 5 0 0 0 0 1 0 0 0 9 0 0 0 0 0 0 0 0 0 1 0 8 0 14 0 0Neisseria_meningitidis_G2136_uid61085 0 0 6 0 0 0 0 1 0 0 0 8 0 0 0 0 0 0 0 0 0 1 0 4 0 29 0 0Neisseria_meningitidis_H44_76_uid61079 0 0 10 0 0 0 0 0 0 0 0 15 0 0 0 0 0 0 0 0 0 1 0 4 0 7 0 0Neisseria_meningitidis_M01_240149_uid61069 0 0 9 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 1 0 2 0 13 0 0Neisseria_meningitidis_M01_240355_uid61063 0 0 10 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 0 1 0 2 0 20 0 0Neisseria_meningitidis_M04_240196_uid61073 0 0 8 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 0 0 1 0 4 0 19 0 0Neisseria_meningitidis_MC58_uid251 0 0 5 0 0 0 0 0 0 0 0 14 0 0 0 0 0 0 0 0 0 1 0 4 0 7 0 0Neisseria_meningitidis_NZ_05_33_uid61065 0 0 9 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 1 0 4 0 2 0 0Neisseria_meningitidis_WUE_2594_uid61527 0 0 6 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 0 0 1 0 1 0 16 0 0Neisseria_meningitidis_Z2491_uid252 0 0 10 0 0 0 0 2 0 0 0 7 0 0 0 0 0 0 0 0 0 1 0 2 0 15 0 0Neisseria_meningitidis_alpha14_uid39689 0 0 5 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 1 0 1 0 39 0 0Neisseria_meningitidis_alpha710_uid36319 0 0 1 0 0 0 0 0 0 0 0 14 0 0 0 0 0 0 0 0 0 1 0 3 0 12 0 0Neorickettsia_risticii_Illinois_uid19099 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Neorickettsia_sennetsu_Miyayama_uid357 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Nitratifractor_salsuginis_DSM_16511_uid46883 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 0 0 0 0 0 0 0 0

Page 33: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Nitratiruptor_SB155-­‐2_uid18963 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Nitrobacter_hamburgensis_X14_uid13473 0 0 4 0 0 3 0 23 0 0 0 0 0 0 3 3 10 5 0 0 0 1 0 19 0 32 0 1Nitrobacter_winogradskyi_Nb-­‐255_uid13474 0 0 5 0 0 0 0 26 0 0 0 0 0 24 0 0 0 0 0 0 0 25 0 0 0 15 0 0Nitrosococcus_halophilus_Nc4_uid36589 0 0 0 0 2 0 15 2 0 0 0 0 0 0 0 0 12 0 0 2 0 0 5 0 0 0 10 0Nitrosococcus_oceani_ATCC_19707_uid13993 0 0 0 0 0 0 10 16 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0Nitrosococcus_watsoni_C_113_uid40331 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0Nitrosomonas_AL212_uid32989 0 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 2 29 0 0 0 0 0 0 0 33 0 12Nitrosomonas_Is79A3_uid52837 0 0 0 0 0 5 0 11 3 0 0 0 0 0 7 0 0 0 0 0 0 21 0 6 0 32 8 9Nitrosomonas_europaea_uid52 0 0 3 0 0 0 0 0 0 0 0 0 0 41 0 0 0 0 0 0 5 14 0 0 0 20 0 0Nitrosomonas_eutropha_C71_uid13913 0 0 1 0 0 0 0 7 0 2 0 0 0 24 0 0 2 0 0 0 0 19 0 0 0 22 0 0Nitrosopumilus_maritimus_SCM1_uid19265 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Nitrosospira_multiformis_ATCC_25196_uid13912 0 0 0 0 0 0 0 0 0 0 0 2 0 6 0 0 0 0 0 0 0 16 0 0 0 18 0 0Nocardia_farcinica_IFM10152_uid13117 0 0 0 0 0 0 0 0 0 1 0 0 0 2 1 0 0 0 0 0 0 0 0 0 0 7 0 0Nocardioides_JS614_uid12738 0 0 0 0 0 1 0 0 0 0 0 0 0 0 5 0 2 2 0 0 0 6 0 6 0 0 0 0Nocardiopsis_dassonvillei_DSM_43111_uid19709 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Nostoc_azollae__0708_uid30807 0 0 0 0 0 0 0 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 47 0 0Nostoc_punctiforme_PCC_73102_uid216 1 0 0 0 0 0 0 29 0 5 0 0 12 0 9 9 0 0 0 0 0 0 1 0 0 29 25 0Nostoc_sp_uid244 0 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0Novosphingobium_PP1Y_uid66901 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 3 0 0 0 0 0 0 1 2 0 0Novosphingobium_aromaticivorans_DSM_12444_uid204 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 3 3 0 0Oceanithermus_profundus_DSM_14977_uid40223 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Oceanobacillus_iheyensis_uid284 1 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ochrobactrum_anthropi_ATCC_49188_uid19485 0 0 0 0 0 0 0 11 0 3 0 0 0 0 0 0 0 0 0 0 0 6 0 2 5 5 0 5Odoribacter_splanchnicus_DSM_20712_uid43469 2 0 0 0 3 0 0 0 0 0 4 0 0 0 0 0 0 4 0 0 1 0 0 0 0 22 0 2Oenococcus_oeni_PSU-­‐1_uid317 0 0 0 3 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0Oligotropha_carboxidovorans_OM4_uid66839 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 3 0 0Oligotropha_carboxidovorans_OM5_uid28805 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Oligotropha_carboxidovorans_OM5_uid41025 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 2 0 0Olsenella_uli_DSM_7084_uid36641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Onion_yellows_phytoplasma_uid9615 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0Opitutus_terrae_PB90_1_uid19989 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0Orientia_tsutsugamushi_Boryong_uid16180 0 0 0 0 0 0 0 3 0 0 0 0 0 0 73 0 0 0 0 0 2 0 0 10 0 73 0 0Orientia_tsutsugamushi_Ikeda_uid18983 0 0 0 0 0 0 0 27 11 0 0 0 0 0 35 0 0 0 0 0 17 0 0 2 0 1 0 0Oxalobacteraceae_symbiotic_bacterium_CARI_uid51243 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Paenibacillus_JDR_2_uid20399 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Paenibacillus_mucilaginosus_KNP414_uid67335 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 8 0 0 0 0Paenibacillus_polymyxa_E681_uid16065 3 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 9 0 0 0 8 0 0Paenibacillus_polymyxa_SC2_uid52019 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 8 0 0 0 0 0 0Paenibacillus_polymyxa_uid68665 0 0 0 0 0 0 3 0 0 0 0 0 0 0 1 0 0 0 0 0 0 9 0 0 0 1 0 0Paludibacter_propionicigenes_WB4_uid42009 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pantoea_At_9b_uid33803 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 2Pantoea_ananatis_AJ13355_uid60793 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Pantoea_ananatis_LMG_20103_uid43085 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 0 0Pantoea_vagans_C9_1_uid43531 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Parabacteroides_distasonis_ATCC_8503_uid13485 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 6 0 0 1 0 0 0 0 0 0 0Parachlamydia_acanthamoebae_UV7_uid49033 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 2 5 0 0 0 0 0 0 0 0 4 0 0Parachlamydia_sp_UWE25_uid10700 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0Paracoccus_denitrificans_PD1222_uid13020 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 2 0 1 0 8 0 1Parvibaculum_lavamentivorans_DS-­‐1_uid17639 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Parvularcula_bermudensis_HTCC2503_uid13509 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pasteurella_multocida_uid39 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pectobacterium_carotovorum_PC1_uid31289 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pectobacterium_wasabiae_WPP163_uid31293 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Pediococcus_pentosaceus_ATCC_25745_uid398 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pedobacter_heparinus_DSM_2366_uid27949 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pedobacter_saltans_DSM_12145_uid49337 1 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 3 0 0 6 0 0 2 0Pelobacter_carbinolicus_uid13337 5 0 0 2 0 0 0 0 0 0 0 0 0 9 4 0 0 0 0 0 0 1 0 5 0 0 1 0Pelobacter_propionicus_DSM_2379_uid13384 25 0 1 0 0 0 0 1 0 3 0 0 0 5 6 0 0 5 0 0 0 0 0 0 0 1 3 0Pelodictyon_phaeoclathratiforme_BU_1_uid13011 0 0 0 0 7 3 1 4 6 0 0 0 0 0 2 0 0 0 0 0 0 4 0 0 0 0 0 0Pelotomaculum_thermopropionicum_SI_uid19023 0 0 0 0 2 0 0 0 2 0 0 0 0 4 7 0 7 0 0 0 0 2 0 2 0 6 7 0Persephonella_marina_EX_H1_uid12526 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 34: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Petrotoga_mobilis_SJ95_uid17679 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0Phenylobacterium_zucineum_HLK1_uid19931 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 10 0 4 0 0 0 0Photobacterium_profundum_SS9_uid13128 0 0 13 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 11 0 0 2 0 74 9Photorhabdus_asymbiotica_uid30577 0 0 0 0 1 0 0 10 0 0 0 1 0 0 0 0 0 0 5 0 3 7 0 2 0 0 0 0Photorhabdus_luminescens_uid9605 0 0 0 0 7 0 2 44 1 0 2 6 0 0 0 0 0 0 0 0 17 0 0 3 0 6 11 0Picrophilus_torridus_DSM_9790_uid10641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Pirellula_sp_uid413 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pirellula_staleyi_DSM_6068_uid29845 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0Planctomyces_brasiliensis_DSM_5305_uid47863 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 7 0 26 3 0Planctomyces_limnophilus_DSM_3776_uid29411 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Polaromonas_JS666_uid13121 0 0 2 5 0 1 0 1 0 2 0 0 0 1 0 1 0 0 0 0 0 8 0 4 2 8 3 0Polaromonas_naphthalenivorans_CJ2_uid13418 2 0 1 0 0 0 2 19 2 0 0 0 0 0 0 2 0 3 0 0 0 0 0 5 0 14 0 2Polymorphum_gilvum_SL003B_26A1_uid63293 0 0 0 0 0 0 0 0 0 0 0 0 0 9 3 0 0 11 0 0 0 3 0 0 0 5 0 0Polynucleobacter_necessarius_STIR1_uid19991 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Polynucleobacter_necessarius_asymbioticus_QLW_P1DMWA_1_uid16679 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Porphyromonas_asaccharolytica_DSM_20707_uid51745 0 0 0 0 3 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 8 0 3 0 0 0 0Porphyromonas_gingivalis_ATCC_33277_uid19051 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 25 0 0 0 0 38 0 0Porphyromonas_gingivalis_TDC60_uid66755 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 10 0 0 0 0 23 0 0Porphyromonas_gingivalis_W83_uid48 0 0 0 0 0 0 0 0 0 0 6 0 0 0 2 0 0 0 0 0 5 11 0 0 0 4 10 0Prevotella_denticola_F0289_uid49293 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 2 0 1 2 0Prevotella_melaninogenica_ATCC_25845_uid31383 2 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 5 0Prevotella_ruminicola_23_uid10619 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_AS9601_uid13548 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_CCMP1375_uid419 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MED4_uid213 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT9313_uid220 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9211_uid13551 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9215_uid18633 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9301_uid15746 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9303_uid13496 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9312_uid13910 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_MIT_9515_uid13617 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_NATL1A_uid15660 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Prochlorococcus_marinus_NATL2A_uid13911 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Propionibacterium_acnes_266_uid56091 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Propionibacterium_acnes_6609_uid66845 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Propionibacterium_acnes_KPA171202_uid12460 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Propionibacterium_acnes_SK137_uid31005 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Propionibacterium_freudenreichii_shermanii_CIP_103027_uid46291 0 0 0 23 0 0 0 0 0 0 0 3 0 8 0 0 0 0 0 0 0 20 0 0 0 0 0 0Prosthecochloris_aestuarii_DSM_271_uid12749 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 6 0 0 0 0Prosthecochloris_vibrioformis_DSM_265_uid12607 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0Proteus_mirabilis_uid12624 0 0 0 0 0 0 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pseudoalteromonas_SM9913_uid39311 1 0 1 0 0 0 0 0 0 0 0 0 0 0 1 0 5 0 0 0 0 9 0 0 0 0 0 0Pseudoalteromonas_atlantica_T6c_uid13454 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 10 0Pseudoalteromonas_haloplanktis_TAC125_uid15713 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pseudomonas_aeruginosa_LESB58_uid31101 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Pseudomonas_aeruginosa_PA7_uid16720 1 0 0 0 0 0 0 0 1 0 0 0 0 0 1 0 3 2 0 0 0 8 0 0 2 0 1 0Pseudomonas_aeruginosa_UCBPP-­‐PA14_uid386 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 1Pseudomonas_aeruginosa_uid331 1 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 3 0 6 0 0 0 0Pseudomonas_brassicacearum_NFM421_uid63495 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 2 0 7 0 0 0 2Pseudomonas_entomophila_L48_uid16800 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 1 0 0 0 0 1 0 0 0 0 1 0Pseudomonas_fluorescens_Pf-­‐5_uid327 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pseudomonas_fluorescens_Pf0_1_uid12 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 1 0 0 0 0Pseudomonas_fluorescens_SBW25_uid31229 1 0 0 0 0 0 0 0 0 0 0 0 0 4 1 0 0 0 0 0 0 3 0 4 0 0 0 0Pseudomonas_fulva_12_X_uid49675 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 2 0 0 0 0 0 0Pseudomonas_mendocina_NK_01_uid64797 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0Pseudomonas_mendocina_ymp_uid17457 1 0 0 13 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 0 0 0 0 0Pseudomonas_putida_BIRD_1_uid54033 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 1Pseudomonas_putida_F1_uid13909 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 6 0 1 0 0Pseudomonas_putida_GB_1_uid17629 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 3 0 3Pseudomonas_putida_KT2440_uid267 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 14 0 1 6 10

Page 35: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Pseudomonas_putida_S16_uid67881 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 9 0 0 0 11 0 28 0 8 0 4Pseudomonas_putida_W619_uid17053 1 0 0 3 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 0 0 0 0 0 2 0 1Pseudomonas_stutzeri_A1501_uid16817 2 0 0 2 0 0 0 5 0 0 0 3 0 0 3 0 0 8 0 0 0 7 0 0 0 1 0 6Pseudomonas_stutzeri_ATCC_17588___LMG_11199_uid68131 0 0 0 7 0 0 0 4 0 0 0 0 0 0 0 0 0 15 0 0 0 9 0 0 0 7 0 0Pseudomonas_stutzeri_DSM_4166_uid63543 0 0 0 11 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 14 0 5Pseudomonas_syringae_phaseolicola_1448A_uid12416 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 48 2 14 0 0 10 0 5 0 8 0 1Pseudomonas_syringae_pv_B728a_uid323 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 14 0 0 0 1 0 1Pseudomonas_syringae_tomato_DC3000_uid359 13 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 23 3 0 0 9 0 10 0 39 0 34Pseudonocardia_dioxanivorans_CB1190_uid40557 0 0 0 0 0 1 0 5 0 1 0 9 0 0 9 3 12 6 0 0 8 0 0 10 0 11 0 0Pseudoxanthomonas_suwonensis_11_1_uid48619 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 0 0 0 0 0 0 0 3 0 0Psychrobacter_PRwf-­‐1_uid15759 0 0 0 0 0 0 18 0 0 0 0 0 0 0 0 0 0 0 0 0 0 15 4 0 0 2 2 3Psychrobacter_arcticum_273-­‐4_uid9633 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 10 14 0 0 0 0 0 0Psychrobacter_cryohalolentis_K5_uid13920 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 8 0 0 0 0 0 0Psychromonas_ingrahamii_37_uid16187 1 0 1 0 0 0 0 6 0 0 5 6 0 0 0 0 0 0 0 0 0 3 0 2 0 1 0 0Pusillimonas_T7_7_uid65659 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4Pyrobaculum_aerophilum_uid172 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrobaculum_arsenaticum_DSM_13514_uid15582 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrobaculum_calidifontis_JCM_11548_uid18111 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrobaculum_islandicum_DSM_4184_uid16743 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrococcus_NA2_uid65431 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrococcus_abyssi_uid179 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrococcus_furiosus_uid287 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 5 0 0 0 23 0 0 0Pyrococcus_horikoshii_uid207 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrococcus_yayanosii_CH1_uid66055 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Pyrolobus_fumarii_1A_uid48579 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rahnella_Y9602_uid50601 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 12 0 0 11 0Ralstonia_eutropha_H16_uid13603 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0Ralstonia_eutropha_JMP134_uid10646 2 0 0 2 0 0 0 0 0 1 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 4 0 0Ralstonia_pickettii_12D_uid18937 0 0 0 1 0 0 0 0 0 5 0 1 0 0 0 0 7 2 0 0 0 9 0 0 0 0 0 3Ralstonia_pickettii_12J_uid17631 0 0 0 0 0 0 0 4 0 4 0 0 0 0 7 0 0 0 0 0 0 8 0 0 0 5 0 3Ralstonia_solanacearum_Po82_uid66837 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 10 0 0 0 1 0 0Ralstonia_solanacearum_uid13 0 0 0 2 0 0 0 8 0 1 0 0 0 0 3 6 2 4 0 0 0 21 0 0 0 24 7 0Ramlibacter_tataouinensis_TTB310_uid16294 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0Renibacterium_salmoninarum_ATCC_33209_uid19227 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 72 0 0 0 0 0 0Rhizobium_NGR234_uid21101 0 0 0 0 0 9 4 0 5 0 2 0 0 3 2 0 1 24 6 0 0 0 0 17 0 7 6 5Rhizobium_etli_CFN_42_uid13932 0 0 0 0 0 0 0 5 0 0 0 0 0 1 3 0 1 4 0 0 0 1 0 5 0 6 1 12Rhizobium_etli_CIAT_652_uid28021 0 0 0 0 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 0 0 0 6Rhizobium_leguminosarum_bv__trifolii_WSM1325_uid20097 0 0 0 1 0 0 0 0 2 0 0 0 0 4 0 0 0 0 0 0 0 1 0 3 0 0 4 1Rhizobium_leguminosarum_bv__trifolii_WSM2304_uid20179 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 3 0 5 0 5 4 0Rhizobium_leguminosarum_bv_viciae_3841_uid344 1 0 0 0 0 0 0 0 2 0 0 1 0 0 2 0 0 0 6 0 0 7 0 6 15 5 0 3Rhodobacter_capsulatus_SB1003_uid55 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Rhodobacter_sphaeroides_2_4_1_uid56 0 0 0 0 0 0 0 0 4 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1Rhodobacter_sphaeroides_ATCC_17025_uid15755 0 0 0 0 0 0 0 0 0 0 0 0 0 4 2 0 0 2 0 0 0 5 0 9 0 28 0 14Rhodobacter_sphaeroides_ATCC_17029_uid15754 0 0 0 0 0 0 0 0 0 0 0 0 0 5 4 0 0 0 0 0 0 0 0 0 0 0 0 0Rhodobacter_sphaeroides_KD131_uid31111 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 10 0 0 0 0 0 0Rhodococcus_equi_103S_uid41335 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rhodococcus_erythropolis_PR4_uid20395 0 0 0 1 0 0 0 0 0 0 0 0 0 0 2 0 1 7 1 0 0 0 0 2 0 0 0 0Rhodococcus_jostii_RHA1_uid13693 0 0 0 0 2 1 0 8 3 0 0 5 1 1 8 9 8 0 3 0 0 11 0 9 1 8 0 0Rhodococcus_opacus_B4_uid34839 0 0 0 0 1 1 0 0 0 0 0 1 1 0 3 1 2 1 0 0 0 8 0 3 0 4 0 0Rhodoferax_ferrireducens_T118_uid13908 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6Rhodomicrobium_vannielii_ATCC_17100_uid38253 0 0 0 0 0 0 0 11 0 0 0 0 0 19 0 0 0 0 0 0 0 0 0 0 0 0 0 15Rhodopseudomonas_palustris_BisA53_uid15751 3 0 0 0 0 0 0 2 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 7 0 2 0 5Rhodopseudomonas_palustris_BisB18_uid15750 0 0 1 0 0 0 0 0 0 0 1 0 0 0 6 0 2 2 0 0 0 1 0 12 0 1 0 0Rhodopseudomonas_palustris_BisB5_uid15749 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 6 0 0Rhodopseudomonas_palustris_CGA009_uid57 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 4 0 0 0 0 0 0Rhodopseudomonas_palustris_DX_1_uid38503 5 0 0 0 0 0 0 0 0 0 0 0 0 5 1 0 0 0 0 0 0 2 0 6 0 1 0 0Rhodopseudomonas_palustris_HaA2_uid15747 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Rhodopseudomonas_palustris_TIE_1_uid20167 0 0 0 0 0 0 0 0 0 0 0 0 0 4 1 0 0 0 0 0 0 2 0 2 0 1 0 0Rhodospirillum_centenum_SW_uid18307 6 0 0 0 0 0 0 5 0 0 14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rhodospirillum_rubrum_ATCC_11170_uid58 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 2 0 2 0 2Rhodospirillum_rubrum_F11_uid67413 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 2 0 2

Page 36: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Rhodothermus_marinus_DSM_4252_uid29281 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Rhodothermus_marinus_SG0_5JP17_172_uid56095 0 0 0 0 0 0 2 5 0 0 0 0 0 0 0 4 0 0 0 0 2 0 0 0 0 3 2 0Rickettsia_africae_ESF_5_uid18269 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_akari_Hartford_uid12953 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_bellii_OSU_85-­‐389_uid17237 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 25 0 0 0 0Rickettsia_bellii_RML369-­‐C_uid13996 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 26 0 0 0 0Rickettsia_canadensis_McKiel_uid12952 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_conorii_uid42 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_felis_URRWXCal2_uid13884 0 0 0 0 0 0 0 1 2 0 0 0 0 2 4 0 42 6 0 0 0 0 0 0 0 3 0 0Rickettsia_heilongjiangensis_054_uid66907 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_massiliae_MTU5_uid18271 0 0 1 0 0 0 10 0 0 0 0 0 0 2 4 0 0 0 0 0 0 0 0 1 0 0 1 0Rickettsia_peacockii_Rustic_uid31309 0 0 0 0 0 0 0 0 0 0 0 0 0 1 42 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_prowazekii_Rp22_uid19813 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_prowazekii_uid43 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_rickettsii_Iowa_uid19943 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_rickettsii_Sheila_Smith_uid9636 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rickettsia_typhi_wilmington_uid10679 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Riemerella_anatipestifer_DSM_15868_uid41989 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0Riemerella_anatipestifer_RA_GD_uid49039 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 4 0 0 0 0 0 0 0Robiginitalea_biformata_HTCC2501_uid13461 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Roseburia_hominis_A2_183_uid33399 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 2 0 0 0 2Roseburia_intestinalis_XB6B4_uid45953 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0Roseburia_intestinalis_uid39165 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1Roseiflexus_RS-­‐1_uid16190 0 0 0 0 2 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0 0 0 40 0 17 0 1Roseiflexus_castenholzii_DSM_13941_uid13462 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 1 0 0 0 0 1 0 0 0 0 1 2 0Roseobacter_denitrificans_OCh_114_uid16426 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0Roseobacter_litoralis_Och_149_uid19357 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 7 0 6 2 0 0 4Rothia_dentocariosa_ATCC_17931_uid48447 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rothia_mucilaginosa_uid38547 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Rubrobacter_xylanophilus_DSM_9941_uid10670 1 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0 0 0 0 0 4 0 0Ruegeria_pomeroyi_DSS_3_uid281 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0Ruminococcus_18P13_uid39179 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ruminococcus_albus_7_uid42255 0 0 0 0 0 0 5 0 0 0 4 0 0 0 9 0 3 0 0 0 0 3 0 5 0 5 0 16Ruminococcus_bromii_uid39153 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ruminococcus_obeum_uid39167 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Ruminococcus_torques_uid39169 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Ruminococcus_uid39149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Runella_slithyformis_DSM_19594_uid49125 0 0 0 8 0 0 0 15 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 6 0 11 0 0Saccharomonospora_viridis_DSM_43017_uid20835 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Saccharophagus_degradans_2-­‐40_uid316 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0Saccharopolyspora_erythraea_NRRL_2338_uid18489 0 0 0 0 0 0 2 11 0 1 0 12 0 0 1 3 5 6 0 0 0 5 0 1 0 20 0 0Salinibacter_ruber_DSM_13855_uid16159 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 14 0 0 0 1 0Salinispora_arenicola_CNS-­‐205_uid17109 0 0 0 0 0 0 0 6 0 0 0 0 0 0 1 0 0 6 0 0 0 0 0 0 0 3 1 0Salinispora_tropica_CNB-­‐440_uid16342 0 0 0 0 0 1 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 2 0 0 0 0 0 0Salmonella_bongori_NCTC_12419_uid351 0 0 0 0 0 0 2 1 0 0 0 0 0 0 4 0 0 0 0 0 0 4 0 0 0 0 0 0Salmonella_enterica_Choleraesuis_uid9618 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 1 2 0 0 16 2 0 11 0 10 0Salmonella_enterica_Paratypi_ATCC_9150_uid13086 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Salmonella_enterica_arizonae_serovar_62_z4_z23__RSK2980_uid13030 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Salmonella_enterica_serovar_Agona_SL483_uid20063 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 1 0 0 7 0Salmonella_enterica_serovar_Dublin_CT_02021853_uid19467 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 5 0 1 0 0 0 0Salmonella_enterica_serovar_Enteritidis_P125109_uid30687 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 1 0 0 0 0Salmonella_enterica_serovar_Gallinarum_287_91_uid30689 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 1 0 0 0 0Salmonella_enterica_serovar_Heidelberg_SL476_uid20045 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 3 10 1 0Salmonella_enterica_serovar_Newport_SL254_uid18747 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 2 0 2 4 0 0 0Salmonella_enterica_serovar_Paratyphi_A_AKU_12601_uid30943 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 2 0Salmonella_enterica_serovar_Paratyphi_B_SPB7_uid27803 0 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Salmonella_enterica_serovar_Paratyphi_C_RKS4594_uid20993 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 13 0 0 0 0 0 0Salmonella_enterica_serovar_Schwarzengrund_CVM19633_uid19459 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 3 0 0 9 0 0 0Salmonella_enterica_serovar_Typhi_Ty2_uid371 0 0 0 0 0 0 26 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Salmonella_enterica_serovar_Typhimurium_14028S_uid33067 0 0 0 0 0 0 10 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Salmonella_enterica_serovar_Typhimurium_4_74_uid56087 0 0 0 0 0 0 7 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0

Page 37: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Salmonella_enterica_serovar_Typhimurium_SL1344_uid50407 0 0 0 0 0 0 7 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Salmonella_enterica_serovar_Typhimurium_T000240_uid45951 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 13 0 4 0 0 0Salmonella_enterica_serovar_Typhimurium_UK_1_uid63211 0 0 0 0 0 0 9 0 0 0 0 0 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Salmonella_enterica_serovar_Typhimurium_uid40625 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 1 0 0 0Salmonella_typhi_uid236 0 0 0 0 0 0 26 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 4 0 3 0Salmonella_typhimurium_LT2_uid241 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Sanguibacter_keddieii_DSM_10542_uid19711 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Sebaldella_termitidis_ATCC_33386_uid29539 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Segniliparus_rotundus_DSM_44985_uid37711 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Selenomonas_sputigena_ATCC_35185_uid51247 0 0 0 0 0 0 6 0 0 0 0 8 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Serratia_AS12_uid60453 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Serratia_AS13_uid60455 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Serratia_AS9_uid60457 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Serratia_proteamaculans_568_uid17459 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 7 0 0 0 0 0 0Shewanella_ANA-­‐3_uid13905 0 0 0 0 9 0 3 0 0 1 1 1 0 1 0 0 0 0 1 0 0 9 0 0 0 1 0 0Shewanella_MR-­‐4_uid13904 0 0 0 1 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 5 0 4 2 0Shewanella_MR-­‐7_uid13903 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 11 0 0 0 4 1 0Shewanella_W3-­‐18-­‐1_uid13902 0 0 0 3 12 0 0 1 0 0 0 0 0 0 0 0 1 1 2 0 0 7 0 2 1 3 8 3Shewanella_amazonensis_SB2B_uid13385 0 0 0 0 0 0 8 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0Shewanella_baltica_BA175_uid49719 0 0 1 0 0 0 1 0 0 0 7 5 0 0 0 0 0 1 0 0 1 11 0 0 0 3 3 0Shewanella_baltica_OS117_uid49451 0 0 0 0 0 0 0 1 0 5 0 26 0 0 0 0 2 35 0 0 1 43 0 0 0 10 5 3Shewanella_baltica_OS155_uid13386 0 0 0 0 0 0 0 1 0 4 0 27 0 0 0 0 2 33 0 0 1 46 0 0 0 11 5 0Shewanella_baltica_OS185_uid17643 0 0 0 0 0 0 2 0 0 0 7 0 0 0 0 0 0 0 4 0 1 14 0 4 1 4 2 1Shewanella_baltica_OS195_uid13389 0 0 0 0 0 0 0 0 0 0 9 1 0 0 0 0 2 0 3 0 1 23 0 3 1 5 1 1Shewanella_baltica_OS223_uid17985 0 0 0 0 0 0 0 0 0 2 37 1 0 0 0 0 0 1 1 0 1 11 0 8 2 4 10 0Shewanella_baltica_OS678_uid47019 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0 0 4 0 1 25 0 2 1 4 1 1Shewanella_denitrificans_OS217_uid13390 0 0 0 0 0 0 0 0 0 0 5 8 0 1 0 0 4 0 1 0 8 10 0 10 0 0 6 0Shewanella_frigidimarina_NCIMB_400_uid13391 0 0 2 0 0 0 0 6 0 2 0 0 0 0 0 0 0 3 0 0 0 15 0 6 0 0 3 0Shewanella_halifaxensis_HAW_EB4_uid20241 0 0 0 1 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Shewanella_loihica_PV-­‐4_uid13906 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 5 0 0 0 0Shewanella_oneidensis_uid335 0 0 1 0 0 0 1 6 0 2 1 0 0 5 0 0 50 0 4 0 1 51 0 4 3 15 22 0Shewanella_pealeana_ATCC_700345_uid17415 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 21 0 11 0 0 0 0Shewanella_piezotolerans_WP3_uid17675 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 17 0 0 0 0Shewanella_putrefaciens_200_uid13392 0 0 0 0 0 1 0 8 0 0 2 0 0 0 2 0 2 0 3 0 0 7 0 12 1 3 33 10Shewanella_putrefaciens_CN-­‐32_uid13393 0 0 0 2 0 0 0 4 0 2 2 0 0 1 0 0 0 0 3 0 0 7 0 1 1 2 1 0Shewanella_sediminis_HAW-­‐EB3_uid18789 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 4 0 0 0 0 0 0Shewanella_violacea_DSS12_uid34739 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 2 1Shewanella_woodyi_ATCC_51908_uid17455 0 0 0 2 0 0 13 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 6 0 0 0 0Silicibacter_TM1040_uid13040 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 1 0 0Shigella_boydii_CDC_3083_94_uid15637 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 1 2 0 0 112 223 29 0 3 24 48Shigella_boydii_Sb227_uid13146 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 139 166 16 0 1 0 1Shigella_dysenteriae_uid13145 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 204 440 6 0 3 0 1Shigella_flexneri_2002017_uid33639 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 2 5 0 0 121 117 8 0 2 1 8Shigella_flexneri_2a_2457T_uid408 0 0 0 0 0 0 0 0 0 0 0 0 0 0 36 0 0 0 0 0 0 37 105 5 0 0 1 5Shigella_flexneri_2a_301_uid310 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 167 116 10 0 3 2 6Shigella_flexneri_5_8401_uid16375 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 126 108 6 0 0 0 4Shigella_sonnei_Ss046_uid13151 0 0 0 0 0 0 0 21 0 0 0 0 0 0 0 0 0 25 0 0 0 188 170 21 0 2 0 1Sideroxydans_lithotrophicus_ES_1_uid33161 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Simkania_negevensis_Z_uid49035 0 0 0 0 0 0 0 1 0 0 0 0 0 0 13 0 0 0 0 0 0 0 0 0 0 1 0 0Sinorhizobium_medicae_WSM419_uid16304 0 0 0 0 0 0 0 6 4 0 0 0 0 0 1 0 0 3 0 0 0 4 0 10 0 9 0 5Sinorhizobium_meliloti_AK83_uid41993 0 0 2 0 0 0 0 14 9 0 5 2 0 2 4 1 2 10 2 0 0 4 0 7 0 22 5 10Sinorhizobium_meliloti_BL225C_uid42477 0 0 0 1 0 0 0 10 7 0 7 5 0 4 0 1 14 2 0 0 0 0 0 0 2 1 8 4Sinorhizobium_meliloti_SM11_uid41117 1 0 1 1 0 0 0 5 1 0 15 4 0 4 0 3 8 17 2 0 0 1 0 1 0 4 11 3Sinorhizobium_meliloti_uid19 0 0 0 0 0 0 0 2 8 0 5 3 0 5 2 0 14 0 2 0 0 9 0 3 0 0 9 2Slackia_heliotrinireducens_DSM_20476_uid20831 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 3 0 0 0 0 3 0 4 0 0 0 0Sodalis_glossinidius_morsitans_uid16309 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 23 0 0Solibacter_usitatus_Ellin6076_uid12638 0 0 0 0 0 0 0 0 4 0 0 0 0 13 0 1 0 0 0 0 0 17 0 19 5 0 0 0Sorangium_cellulosum__So_ce_56__uid28111 1 0 0 0 0 0 0 0 0 0 0 0 0 3 4 0 0 0 0 0 0 4 1 0 0 0 9 10Sphaerobacter_thermophilus_DSM_20745_uid21087 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 19 0 0 0 0 0 0 0Sphingobacterium_21_uid51631 0 0 0 4 0 0 0 0 0 0 0 0 0 0 3 0 0 7 0 0 0 0 0 11 0 0 11 3Sphingobium_SYK_6_uid67115 0 0 0 0 0 0 0 6 0 0 0 0 0 4 3 0 0 0 0 0 0 3 0 0 0 2 0 0

Page 38: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Sphingobium_chlorophenolicum_L_1_uid50015 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 5 0 1 0 0 0 0Sphingobium_japonicum_UT26S_uid19949 0 0 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0Sphingomonas_wittichii_RW1_uid17343 4 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 7 0 0 0 1 0 4 0 12 0 0Sphingopyxis_alaskensis_RB2256_uid13907 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 4 0 0 0 0 0 1 0 8 0 0Spirochaeta_Buddy_uid47953 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Spirochaeta_caldaria_DSM_7334_uid46527 0 0 0 0 0 0 0 0 0 0 0 0 0 0 18 0 2 0 0 0 0 3 0 0 0 0 0 0Spirochaeta_coccoides_DSM_17374_uid48121 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Spirochaeta_smaragdinae_DSM_11293_uid32637 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 9 0 0 0 0 0 0Spirochaeta_thermophila_DSM_6192_uid39877 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Spirochaeta_thermophila_DSM_6578_uid50823 10 0 8 0 0 0 0 0 6 0 0 0 0 0 9 0 22 0 0 0 0 0 0 0 0 0 0 0Spirosoma_linguale_DSM_74_uid28817 0 0 0 1 0 0 0 8 0 0 0 0 0 0 2 0 0 18 0 0 0 11 0 10 0 6 0 0Stackebrandtia_nassauensis_DSM_44728_uid19713 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylococcus_aureus_04_02981_uid34809 2 0 0 6 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 1 2 0 0 0Staphylococcus_aureus_COL_uid238 3 0 0 4 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0Staphylococcus_aureus_ECT_R_2_uid52833 2 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 2 0 0 0Staphylococcus_aureus_ED133_uid41277 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 7 0 0 0 0 0 0Staphylococcus_aureus_ED98_uid39547 2 0 0 8 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 1 1 0 0 0Staphylococcus_aureus_JH1_uid15758 1 0 0 8 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 1 5 0 0 0Staphylococcus_aureus_JH9_uid15757 2 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 5 0 0 0Staphylococcus_aureus_JKD6008_uid29567 3 0 0 3 0 0 4 0 0 0 0 0 0 0 2 0 17 0 0 0 0 0 0 0 7 0 0 0Staphylococcus_aureus_JKD6159_uid50759 8 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0Staphylococcus_aureus_LGA251_uid62883 1 0 0 1 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 3 0 0 0 0 0 0Staphylococcus_aureus_MW2_uid306 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0Staphylococcus_aureus_Mu3_uid18509 2 0 0 10 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 1 2 0 0 0Staphylococcus_aureus_Mu50_uid263 2 0 0 10 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 1 6 0 0 0Staphylococcus_aureus_N315_uid264 1 0 0 8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 2 0 0 0Staphylococcus_aureus_NCTC_8325_uid237 5 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylococcus_aureus_Newman_uid18801 5 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylococcus_aureus_RF122_uid63 1 0 0 9 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 0 0Staphylococcus_aureus_ST398_uid29427 3 0 0 1 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 1 0 0 4 0 0 0Staphylococcus_aureus_T0131_uid65323 5 0 0 4 0 0 0 0 0 0 0 0 0 0 2 0 21 0 0 0 0 0 0 0 3 0 0 0Staphylococcus_aureus_TCH60_uid31539 8 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 2 0 0 1 0 0 0Staphylococcus_aureus_TW20_uid36647 1 0 0 3 0 0 4 0 2 0 0 2 0 0 9 0 8 0 0 0 0 0 0 0 6 0 0 0Staphylococcus_aureus_USA300_FPR3757_uid16313 4 0 0 3 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 1 6 0 0 0Staphylococcus_aureus_USA300_TCH1516_uid19489 3 0 0 3 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 1 5 0 0 0Staphylococcus_aureus_aureus_MRSA252_uid265 9 0 0 0 0 0 0 0 0 0 0 8 0 0 3 0 0 0 0 0 0 6 0 0 2 0 0 0Staphylococcus_aureus_aureus_MSSA476_uid266 2 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylococcus_carnosus_TM300_uid34811 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylococcus_epidermidis_ATCC_12228_uid279 1 0 0 0 0 0 4 0 0 0 0 0 0 0 1 0 0 0 0 0 0 2 0 14 3 0 0 0Staphylococcus_epidermidis_RP62A_uid64 0 0 0 0 0 0 3 0 0 0 0 0 0 0 3 0 5 0 0 0 0 2 0 13 7 0 0 0Staphylococcus_haemolyticus_uid12508 24 0 0 28 0 0 1 0 2 0 0 0 0 0 0 0 14 0 0 0 0 0 0 0 8 0 0 0Staphylococcus_lugdunensis_HKU09_01_uid42395 1 0 0 0 0 0 0 0 0 2 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0Staphylococcus_lugdunensis_N920143_uid67127 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0Staphylococcus_pseudintermedius_ED99_uid62991 1 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 2 0 0 0 0Staphylococcus_pseudintermedius_HKU10_03_uid61283 1 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 1 0 0 0Staphylococcus_saprophyticus_uid15596 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0Staphylothermus_hellenicus_DSM_12710_uid33683 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Staphylothermus_marinus_F1_uid17449 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Starkeya_novella_DSM_506_uid37659 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Stenotrophomonas_maltophilia_K279a_uid30351 0 0 0 1 0 0 0 1 0 0 0 0 0 5 0 0 0 0 0 0 0 14 0 11 0 0 0 0Stenotrophomonas_maltophilia_R551_3_uid17107 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 3 0 8 0 0 0 0Stigmatella_aurantiaca_DW4_3_1_uid52561 6 0 0 0 0 1 0 3 0 0 0 0 1 0 4 0 0 0 1 0 0 1 0 0 0 14 8 20Streptobacillus_moniliformis_DSM_12112_uid29309 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_agalactiae_2603_uid330 0 0 0 0 0 0 0 0 0 0 6 3 0 0 0 0 0 0 0 0 0 7 0 0 0 6 0 0Streptococcus_agalactiae_A909_uid326 5 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 4 0 0 0 6 0 0Streptococcus_agalactiae_NEM316_uid334 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Streptococcus_dysgalactiae_equisimilis_ATCC_12394_uid39783 6 0 0 0 0 0 0 0 0 0 0 21 0 0 0 0 0 0 0 0 1 13 0 4 0 0 0 0Streptococcus_dysgalactiae_equisimilis_GGS_124_uid27849 5 0 0 1 0 0 0 0 0 0 1 23 0 0 0 0 0 0 0 0 9 13 0 0 0 0 0 0Streptococcus_equi_4047_uid30765 0 0 0 4 38 0 2 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_equi_zooepidemicus_MGCS10565_uid30781 1 0 0 3 4 0 0 0 0 0 4 8 0 0 0 0 0 0 0 0 0 11 0 0 0 0 0 0Streptococcus_equi_zooepidemicus_uid30767 0 0 0 4 5 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 9 0 0 0 0 0 0

Page 39: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Streptococcus_gallolyticus_ATCC_43143_uid62517 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 5 0 0 0 0 0 0Streptococcus_gallolyticus_ATCC_BAA_2069_uid63179 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0 2 0 0 0Streptococcus_gallolyticus_UCN34_uid34729 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0Streptococcus_gordonii_Challis_substr_CH1_uid66 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_mitis_B6_uid16302 4 0 0 4 0 0 2 0 0 0 0 46 0 0 0 0 0 0 0 0 0 1 0 0 0 2 0 0Streptococcus_mutans_NN2025_uid28997 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Streptococcus_mutans_uid333 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 0 0 0 0 0Streptococcus_oralis_Uo5_uid60241 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_parasanguinis_ATCC_15912_uid48315 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_parauberis_KCTC_11537_uid61187 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Streptococcus_pasteurianus_ATCC_43144_uid62519 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 3 0 0 0Streptococcus_pneumoniae_670_6B_uid292 0 0 0 7 0 2 1 6 1 0 0 2 0 0 3 0 0 0 0 0 0 6 0 0 0 6 5 0Streptococcus_pneumoniae_70585_uid28043 1 0 0 3 0 2 1 11 0 0 0 1 0 0 0 0 0 0 0 0 0 6 0 0 0 5 0 0Streptococcus_pneumoniae_AP200_uid32641 0 0 0 6 0 3 1 7 1 0 0 6 0 0 0 0 0 0 0 0 0 4 0 0 0 4 0 0Streptococcus_pneumoniae_ATCC_700669_uid31233 0 0 0 3 0 1 1 5 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_CGSP14_uid29179 0 0 0 11 0 4 1 7 1 0 0 2 0 0 2 0 0 0 0 0 0 4 0 0 0 6 0 0Streptococcus_pneumoniae_D39_uid16374 0 0 0 10 0 0 1 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_G54_uid29047 0 0 0 12 0 5 1 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 7 0 0Streptococcus_pneumoniae_Hungary19A_6_uid28035 0 0 0 9 0 1 1 6 0 0 0 1 0 0 0 0 0 0 0 0 0 4 0 0 0 4 0 0Streptococcus_pneumoniae_INV104_uid50735 0 0 0 9 0 1 1 5 0 0 0 15 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0Streptococcus_pneumoniae_INV200_uid50731 0 0 0 11 0 2 1 8 2 0 0 2 0 0 1 0 0 0 0 0 0 5 0 0 0 2 0 0Streptococcus_pneumoniae_JJA_uid28039 0 0 0 5 0 1 1 5 0 0 0 3 0 0 0 0 0 0 0 0 0 5 0 0 0 4 0 0Streptococcus_pneumoniae_OXC141_uid50733 0 0 0 9 0 1 1 5 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0Streptococcus_pneumoniae_P1031_uid28041 0 0 0 6 0 5 1 8 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 4 0 0Streptococcus_pneumoniae_R6_uid278 0 0 0 5 0 0 1 8 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 4 0 0Streptococcus_pneumoniae_SPN032672_uid50809 0 0 0 8 0 1 1 5 0 0 0 17 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0Streptococcus_pneumoniae_SPN033038_uid50811 0 0 0 8 0 1 1 5 0 0 0 18 0 0 0 0 0 0 0 0 0 0 0 0 0 7 0 0Streptococcus_pneumoniae_SPN034156_uid50793 0 0 0 7 0 2 1 4 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_SPN034183_uid50795 0 0 0 9 0 1 1 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_SPN994038_uid50799 0 0 0 7 0 1 1 4 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_SPN994039_uid50801 0 0 0 7 0 1 1 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0Streptococcus_pneumoniae_TCH8431_19A_uid34659 0 0 0 5 0 13 1 6 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 4 0 0Streptococcus_pneumoniae_TIGR4_uid277 0 0 0 9 0 11 3 9 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_pneumoniae_Taiwan19F_14_uid28037 0 0 0 15 0 17 1 7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0Streptococcus_pseudopneumoniae_IS7493_uid69547 0 0 0 3 0 1 0 12 1 0 0 1 0 0 0 0 0 0 0 0 0 2 0 1 0 8 0 0Streptococcus_pyogenes_M1_GAS_uid269 1 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0Streptococcus_pyogenes_MGAS10270_uid16364 0 0 0 0 0 0 0 0 0 0 2 0 0 0 1 0 0 0 0 0 0 5 0 0 0 0 0 0Streptococcus_pyogenes_MGAS10394_uid12469 3 0 0 0 0 0 0 0 0 0 1 10 0 0 1 0 0 0 0 0 1 2 0 0 0 0 0 0Streptococcus_pyogenes_MGAS10750_uid16366 6 0 0 0 0 0 0 0 0 0 4 1 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Streptococcus_pyogenes_MGAS2096_uid16365 2 0 0 0 0 0 0 0 0 0 7 2 0 0 0 0 0 0 0 0 3 5 0 0 0 0 0 0Streptococcus_pyogenes_MGAS315_uid311 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Streptococcus_pyogenes_MGAS5005_uid13888 1 0 0 0 0 0 0 0 0 0 9 0 0 0 1 0 0 0 0 0 0 5 0 0 0 0 0 0Streptococcus_pyogenes_MGAS6180_uid13887 0 0 0 0 0 0 0 0 0 0 4 0 0 0 1 0 0 0 0 0 0 5 0 0 0 0 0 0Streptococcus_pyogenes_MGAS8232_uid286 1 0 0 0 0 0 0 0 0 0 0 12 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Streptococcus_pyogenes_MGAS9429_uid16363 2 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 2 5 0 0 0 0 0 0Streptococcus_pyogenes_Manfredo_uid270 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Streptococcus_pyogenes_NZ131_uid20707 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 7 0 0 0 0 0 0Streptococcus_pyogenes_SSI-­‐1_uid301 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0Streptococcus_salivarius_57_I_uid68129 4 0 0 8 0 0 9 0 0 0 0 9 0 0 0 0 1 0 0 0 0 1 0 0 0 0 0 0Streptococcus_salivarius_JIM8777_uid67171 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Streptococcus_salivarius_JIM8780_uid67173 3 0 0 12 0 0 18 0 0 0 0 14 0 0 0 0 1 0 0 0 0 4 0 0 0 0 0 0Streptococcus_sanguinis_SK36_uid13942 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0Streptococcus_suis_05ZYH33_uid17153 0 0 0 1 0 1 2 4 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 7 0 0 1 0Streptococcus_suis_98HAH33_uid17155 0 0 0 1 0 1 2 4 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 6 0 0 1 0Streptococcus_suis_BM407_uid32237 0 0 0 0 0 0 2 4 0 0 0 4 0 0 3 0 0 0 0 0 0 0 0 5 4 0 1 0Streptococcus_suis_GZ1_uid18737 0 0 0 1 0 0 1 4 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 3 0 1 4 0Streptococcus_suis_JS14_uid61797 0 0 0 0 0 0 1 3 0 0 0 4 0 0 0 0 0 0 0 0 0 0 0 10 0 1 1 0Streptococcus_suis_P1_7_uid352 0 0 0 1 0 0 2 4 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 6 0 1 1 0Streptococcus_suis_SC84_uid32239 0 0 0 1 0 1 2 4 0 0 0 5 0 0 1 0 0 0 0 0 0 0 0 6 0 1 1 0Streptococcus_suis_ST3_uid65245 0 0 0 6 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 2 3 0 26 0 0 0 0Streptococcus_thermophilus_CNRZ1066_uid13163 0 0 0 4 0 0 0 0 0 0 0 12 0 0 0 0 7 0 0 0 0 5 0 0 1 0 0 0

Page 40: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Streptococcus_thermophilus_LMD-­‐9_uid13773 0 0 0 18 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 10 0 0 5 0 0 0Streptococcus_thermophilus_LMG_18311_uid13162 0 0 0 11 0 0 0 0 0 0 0 8 0 0 2 0 6 0 0 0 0 8 0 0 1 0 0 0Streptococcus_thermophilus_ND03_uid49149 0 0 0 19 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 10 0 0 2 0 0 0Streptococcus_thermophilus_uid68521 0 0 0 11 0 0 0 0 0 0 0 2 0 0 0 0 8 0 0 0 0 5 0 0 1 3 0 0Streptococcus_uberis_0140J_uid353 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0Streptomyces_SirexAA_E_uid38225 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0Streptomyces_avermitilis_uid189 0 0 0 3 0 0 0 4 0 0 1 0 0 2 3 4 3 0 0 0 0 0 0 1 0 4 2 0Streptomyces_bingchenggensis_BCW_1_uid46847 0 0 0 0 0 0 1 0 0 1 0 0 0 0 5 1 0 0 0 0 0 0 0 3 0 2 2 0Streptomyces_coelicolor_uid242 0 0 0 4 0 0 0 0 0 0 0 0 0 10 0 0 0 0 0 0 0 0 0 6 0 25 0 0Streptomyces_flavogriseus_ATCC_33331_uid33771 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 3 0 0 0 5 0 0Streptomyces_griseus_NBRC_13350_uid20085 0 0 0 0 0 0 0 2 0 0 0 0 0 0 2 0 0 0 0 0 0 2 0 0 0 8 0 0Streptomyces_scabiei_87_22_uid40749 0 0 0 0 0 1 0 6 0 0 0 2 0 0 4 3 1 0 0 0 8 4 0 3 0 18 0 0Streptomyces_violaceusniger_Tu_4113_uid43031 0 0 0 0 0 0 8 5 0 0 0 0 0 0 3 0 9 0 0 0 5 7 0 0 0 14 1 0Streptosporangium_roseum_DSM_43021_uid21083 0 0 0 0 0 0 0 3 0 1 0 0 0 0 5 1 0 0 0 0 0 3 0 0 0 14 0 0Sulfolobus_acidocaldarius_DSM_639_uid13935 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Sulfolobus_islandicus_HVE10_4_uid60481 0 8 0 0 0 0 4 14 3 0 0 0 0 0 0 0 1 0 0 0 0 0 1 9 5 8 0 0Sulfolobus_islandicus_L_D_8_5_uid18803 0 20 0 0 0 0 8 0 0 0 0 0 0 0 0 0 3 0 0 5 0 0 0 15 0 5 0 0Sulfolobus_islandicus_L_S_2_15_uid18987 0 22 0 0 0 0 0 0 1 0 0 0 0 0 0 0 3 0 0 5 0 0 1 19 2 1 0 0Sulfolobus_islandicus_M_14_25_uid18871 0 5 0 0 0 0 0 0 10 0 0 0 0 0 0 0 2 0 0 0 0 0 0 6 0 0 0 0Sulfolobus_islandicus_M_16_27_uid18873 0 11 0 0 0 0 0 2 18 0 0 0 0 0 0 0 4 0 0 0 0 0 2 2 0 1 0 0Sulfolobus_islandicus_M_16_4_uid18807 0 2 0 0 0 0 3 1 3 0 0 0 0 0 0 0 2 0 0 0 0 0 1 1 0 3 0 0Sulfolobus_islandicus_REY15A_uid60485 0 22 0 0 0 0 8 0 3 0 0 0 0 0 0 0 1 0 0 0 0 0 3 5 7 9 0 0Sulfolobus_islandicus_Y_G_57_14_uid19487 0 17 0 0 0 0 0 0 1 0 0 0 0 0 0 0 19 0 0 10 0 0 13 17 0 12 0 0Sulfolobus_islandicus_Y_N_15_51_uid18651 0 21 0 0 0 0 0 0 1 0 0 0 0 0 13 0 0 0 0 18 0 0 15 13 0 32 0 0Sulfolobus_solfataricus_98_2_uid30989 0 25 0 0 0 0 10 18 15 0 0 0 0 0 0 0 3 0 0 9 0 0 3 5 2 35 0 0Sulfolobus_solfataricus_uid108 0 35 0 0 0 0 12 21 10 0 0 0 0 0 0 0 4 0 0 9 0 0 5 19 2 39 0 0Sulfolobus_tokodaii_uid246 0 4 0 0 0 0 3 0 4 0 0 0 0 0 0 0 0 0 0 5 0 0 10 6 5 9 0 0Sulfuricurvum_kujiense_DSM_16994_uid43399 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Sulfurihydrogenibium_YO3AOP1_uid18889 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 0 21 0 0 0 0 0 0 0 0 0 0 0Sulfurihydrogenibium_azorense_Az_Fu1_uid12529 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 7 0 0 0 0 0 0 0Sulfurimonas_autotrophica_DSM_16294_uid31347 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0Sulfurospirillum_deleyianum_DSM_6946_uid29529 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 8 0 0 0 0 0 0 0 0 0 0Sulfurovum_NBC37-­‐1_uid18965 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 2 0 0Symbiobacterium_thermophilum_IAM14863_uid12994 11 0 1 0 0 0 0 0 0 0 0 0 0 0 17 0 1 4 0 0 0 0 0 0 0 0 0 0Synechococcus_CC9311_uid12530 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_CC9605_uid13643 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_CC9902_uid13655 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_PCC_7002_uid28247 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 6 0Synechococcus_RCC307_uid13654 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_WH_7803_uid13642 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_elongatus_PCC_6301_uid13282 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_elongatus_PCC_7942_uid10645 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechococcus_sp_WH8102_uid230 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Synechocystis_PCC6803_uid60 0 0 0 0 0 0 0 22 0 0 0 0 0 0 0 3 0 0 0 0 0 0 3 0 0 13 10 0Synechocystis_PCC_6803_uid67081 0 0 0 0 0 0 0 20 0 0 0 0 0 0 0 4 0 0 0 0 0 0 3 0 0 6 5 0Synergistetes_bacterium_SGP1_uid45959 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 1 0 0Syntrophobacter_fumaroxidans_MPOB_uid13013 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 3 0 0 0 0Syntrophobotulus_glycolicus_DSM_8271_uid38111 0 0 0 0 0 0 0 2 0 0 0 0 0 0 4 0 4 3 0 0 0 0 0 0 0 0 0 0Syntrophomonas_wolfei_Goettingen_uid13014 1 0 0 0 2 0 0 0 0 0 0 0 0 0 7 0 0 4 0 0 0 3 0 2 0 0 0 2Thauera_MZ1T_uid20091 2 0 0 5 1 6 0 0 0 2 7 0 0 0 0 0 0 12 0 0 0 0 0 0 0 2 0 4Syntrophothermus_lipocalidus_DSM_12680_uid37873 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Syntrophus_aciditrophicus_SB_uid16258 0 0 0 12 0 0 0 3 2 0 0 0 0 0 2 0 7 0 0 0 0 0 0 2 0 1 0 0Taylorella_equigenitalis_MCE9_uid61285 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Tepidanaerobacter_Re1_uid50697 0 0 0 0 4 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 4 0 0 0 0Teredinibacter_turnerae_T7901_uid30839 6 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 6 0 2 0 0 0 0Terriglobus_saanensis_SP1PR4_uid48971 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0Thermaerobacter_marianensis_DSM_12885_uid38025 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0Thermanaerovibrio_acidaminovorans_DSM_6589_uid29531 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermincola_JR_uid41467 13 0 0 0 0 0 0 0 7 0 0 0 0 7 0 0 2 0 0 0 0 0 0 0 0 0 0 0Thermoanaerobacter_X513_uid32613 0 0 0 0 0 1 0 0 2 0 0 3 0 5 0 0 0 0 0 0 0 0 0 7 1 0 0 0Thermoanaerobacter_X514_uid16394 0 0 0 0 0 1 0 0 0 0 0 3 0 5 0 0 0 0 0 0 0 0 0 7 1 0 0 0

Page 41: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Thermoanaerobacter_brockii_finnii_Ako_1_uid32585 0 0 0 0 0 3 0 0 0 0 0 0 0 5 12 0 2 0 0 0 0 0 0 9 1 0 0 0Thermoanaerobacter_italicus_Ab9_uid33157 2 0 0 0 0 0 0 0 0 0 0 0 0 0 16 0 2 0 0 0 0 0 0 4 4 0 0 0Thermoanaerobacter_mathranii_A3_uid33329 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0Thermoanaerobacter_pseudethanolicus_ATCC_33223_uid13901 0 0 0 0 0 3 0 0 0 0 0 0 0 8 12 0 2 0 0 0 0 0 0 12 1 0 0 0Thermoanaerobacter_tengcongensis_uid249 0 0 0 0 0 1 0 0 1 0 0 2 0 0 17 0 0 0 0 5 0 0 0 20 7 0 0 0Thermoanaerobacter_wiegelii_Rt8_B1_uid42251 15 0 0 0 0 0 0 0 0 0 0 10 0 10 8 0 1 0 0 0 0 0 0 11 0 0 0 0Thermoanaerobacterium_thermosaccharolyticum_DSM_571_uid33165 0 0 0 0 0 0 0 0 0 0 0 0 0 2 3 3 0 5 0 0 0 0 0 6 0 0 2 0Thermoanaerobacterium_xylanolyticum_LX_11_uid50295 0 0 0 0 0 1 0 0 0 0 0 0 0 0 5 9 0 0 0 0 0 0 0 9 0 0 10 0Thermobaculum_terrenum_ATCC_BAA_798_uid29523 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermobifida_fusca_YX_uid94 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermobispora_bispora_DSM_43833_uid20737 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermococcus_4557_uid67883 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermococcus_barophilus_MP_uid19379 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermococcus_gammatolerans_EJ3_uid33671 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermococcus_kodakaraensis_KOD1_uid13213 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Thermococcus_onnurineus_NA1_uid20773 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermococcus_sibiricus_MM_739_uid34531 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0Thermocrinis_albus_DSM_14484_uid37275 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermodesulfatator_indicus_DSM_15286_uid40057 0 0 0 0 0 0 0 0 0 0 0 0 0 0 9 0 1 0 0 0 0 0 0 6 0 5 0 0Thermodesulfobacterium_OPB45_uid67515 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermodesulfobium_narugense_DSM_14796_uid46673 0 0 0 0 0 0 0 0 6 0 0 0 0 0 7 0 0 0 0 0 0 0 0 0 0 0 0 0Thermodesulfovibrio_yellowstonii_DSM_11347_uid30733 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Thermofilum_pendens_Hrk_5_uid16331 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermomicrobium_roseum_DSM_5159_uid32569 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermomonospora_curvata_DSM_43183_uid20825 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermoplasma_acidophilum_uid110 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0Thermoplasma_volcanium_uid206 0 0 0 0 1 0 1 0 1 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 0 0Thermoproteus_neutrophilus_V24Sta_uid15645 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Thermoproteus_uzoniensis_768_20_uid64455 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermosediminibacter_oceani_DSM_16646_uid30983 29 0 0 0 0 0 0 0 0 0 0 0 0 5 2 0 1 16 0 0 0 2 0 9 0 0 0 0Thermosipho_africanus_TCF52B_uid27767 0 0 0 0 0 0 2 0 0 0 0 0 0 0 18 0 0 0 0 0 0 17 0 2 0 0 0 0Thermosipho_melanesiensis_BI429_uid17249 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermosphaera_aggregans_DSM_11486_uid36571 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermosynechococcus_elongatus_uid308 0 0 0 0 0 0 29 0 0 0 0 0 0 14 0 0 0 0 0 0 0 0 0 0 0 8 1 0Thermotoga_RQ2_uid19543 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_lettingae_TMO_uid15644 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_maritima_uid111 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_naphthophila_RKU_10_uid33663 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_neapolitana_DSM_4359_uid21023 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_petrophila_RKU-­‐1_uid17089 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermotoga_thermarum_DSM_5069_uid41517 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 6 0 0 0 0 0 0 0 0 0 0 0Thermovibrio_ammonificans_HB_1_uid49403 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thermus_scotoductus_SA_01_uid46293 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 1 1 0 0 0 0 0 0 0Thermus_thermophilus_HB27_uid10617 0 0 0 0 4 0 0 0 0 0 0 0 0 0 3 2 1 0 0 0 0 1 0 3 0 6 0 0Thermus_thermophilus_HB8_uid13202 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 3 0 7 0 0Thermus_thermophilus_SG0_5JP17_16_uid51619 0 0 0 0 0 0 0 4 1 0 0 0 0 0 19 0 2 0 0 0 0 1 0 0 0 0 0 0Thioalkalimicrobium_cyclicum_ALM1_uid52629 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thioalkalivibrio_HL_EbGR7_uid29177 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 10 2 0Thioalkalivibrio_K90mix_uid30759 0 0 0 3 0 0 0 2 0 0 0 0 0 0 0 0 8 0 0 0 0 7 0 10 0 3 0 0Thiobacillus_denitrificans_ATCC_25259_uid13025 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Thiomicrospira_crunogena_XCL-­‐2_uid13018 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0Thiomicrospira_denitrificans_ATCC_33889_uid13019 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0Thiomonas_3As_uid50687 13 0 1 1 6 1 0 1 0 1 0 0 0 0 0 0 0 15 0 0 0 3 0 0 0 4 0 6Thiomonas_intermedia_K12_uid33641 4 0 0 0 3 0 0 0 0 2 0 0 0 0 0 0 0 8 0 0 0 16 0 0 0 4 0 2Tolumonas_auensis_DSM_9187_uid33873 0 0 0 0 0 0 0 4 0 0 0 0 0 0 0 0 0 2 0 0 0 3 0 0 0 0 0 0Treponema_azotonutricium_ZAS_9_uid30191 7 0 0 0 0 0 0 0 10 0 0 2 0 0 1 0 9 0 0 0 0 4 0 0 0 0 0 0Treponema_brennaborense_DSM_12168_uid53887 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0 0 0 0 0 3 0 0 0 0Treponema_denticola_ATCC_35405_uid4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 0 0 0 0 0 0 0 0Treponema_pallidum_Chicago_uid39981 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Treponema_pallidum_SS14_uid20067 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Treponema_pallidum_uid5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 42: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Treponema_paraluiscuniculi_Cuniculi_A_uid30657 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Treponema_primitia_ZAS_2_uid30189 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 6 0 0 0 0 0 0 0 0 0 0 0Treponema_succinifaciens_DSM_2489_uid50825 0 0 0 0 0 0 0 0 0 0 18 0 0 0 0 0 0 0 0 0 1 10 0 16 0 0 0 0Trichodesmium_erythraeum_IMS101_uid318 0 0 0 0 0 0 0 9 0 0 0 0 0 0 21 0 0 0 0 0 0 0 0 0 0 13 0 0Tropheryma_whipplei_TW08_27_uid354 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Tropheryma_whipplei_Twist_uid95 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Truepera_radiovictrix_DSM_17093_uid38371 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 8 0 0Tsukamurella_paurometabola_DSM_20162_uid29399 0 0 0 0 0 0 0 0 0 0 0 10 0 0 0 0 9 4 0 0 0 23 0 6 0 0 0 0Ureaplasma_parvum_serovar_3_ATCC_27815_uid19087 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ureaplasma_urealyticum_serovar_10_ATCC_33699_uid20247 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Ureaplasma_urealyticum_uid101 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Variovorax_paradoxus_EPS_uid43457 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 12 0 8 0 0 0 0Variovorax_paradoxus_S110_uid30453 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0 0 0 3 0 0 0 0 0 6 0 1 0 0Veillonella_parvula_DSM_2008_uid21091 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Verminephrobacter_eiseniae_EF01-­‐2_uid17187 4 0 0 0 0 0 0 16 0 1 7 0 0 3 0 0 0 0 0 0 0 3 0 18 0 11 5 0Verrucosispora_maris_AB_18_032_uid59863 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 4 0 0 0 0 0 5 0 0 0 2 0 0Vibrio_Ex25_uid40507 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Vibrio_cholerae_LMA3894_4_uid61113 0 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 2 0 0 2 0 0 0 1 0 0Vibrio_cholerae_M66_2_uid32851 0 0 0 0 0 0 4 0 0 0 1 0 0 1 0 0 0 0 0 0 0 6 0 0 0 3 0 0Vibrio_cholerae_MJ_1236_uid33555 0 0 0 0 0 0 4 0 0 0 1 0 0 1 2 0 0 0 1 0 0 4 0 0 0 4 0 0Vibrio_cholerae_O395_uid15667 0 0 0 0 0 0 7 1 0 0 1 0 0 2 2 0 0 0 0 0 0 5 0 0 0 4 0 0Vibrio_cholerae_O395_uid32853 0 0 0 0 0 0 7 1 0 0 1 0 0 2 2 0 0 0 0 0 0 5 0 0 0 4 0 0Vibrio_cholerae_uid36 0 0 0 0 0 0 4 0 0 0 1 0 0 1 0 0 0 0 0 0 0 6 0 0 0 4 0 0Vibrio_fischeri_ES114_uid12986 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Vibrio_fischeri_MJ11_uid19393 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Vibrio_furnissii_NCTC_11218_uid53247 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0Vibrio_harveyi_ATCC_BAA-­‐1116_uid19857 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 3 0 0 0 1 0 0 0 22 0 0Vibrio_parahaemolyticus_uid360 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 4 0 0Vibrio_splendidus_LGP32_uid32815 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 10 0 3 0 0 0 0Vibrio_vulnificus_CMCP6_uid349 0 0 0 0 0 0 0 5 0 0 0 8 0 1 0 0 0 0 0 0 0 0 0 0 0 3 9 0Vibrio_vulnificus_MO6_24_O_uid59881 0 0 0 0 0 0 0 4 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 0 0 0 3 0Vibrio_vulnificus_YJ016_uid1430 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 1 0 0 0 0 1 0 0 5 9 0Vulcanisaeta_distributa_DSM_14429_uid32589 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Vulcanisaeta_moutnovskia_768_28_uid63027 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Waddlia_chondrophila_WSU_86_1044_uid43761 0 0 0 0 0 0 0 8 0 0 7 0 0 0 1 0 10 2 0 0 1 3 5 0 0 3 0 0Weeksella_virosa_DSM_16922_uid50581 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 0 0Weissella_koreensis_KACC_15510_uid67979 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Wigglesworthia_brevipalpis_uid274 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Wolbachia_endosymbiont_of_Brugia_malayi_TRS_uid12475 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Wolbachia_endosymbiont_of_Culex_quinquefasciatus_Pel_uid30313 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 20 0 0 0 0 0 0 3 0 44 0 0Wolbachia_endosymbiont_of_Drosophila_melanogaster_uid272 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 13 3 0Wolbachia_wRi_uid33273 0 0 0 0 0 0 0 0 0 0 0 0 0 7 47 0 0 0 0 0 0 0 0 1 0 20 2 0Wolinella_succinogenes_uid445 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 13 0 0 0 0 0 0Xanthobacter_autotrophicus_Py2_uid15756 0 0 0 0 0 5 0 0 0 1 0 0 0 0 3 3 0 0 0 0 0 10 0 12 0 0 0 4Xanthomonas_axonopodis_citrumelo_F1_uid62495 0 0 2 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 0 0 17 0 0 0 1 0 0Xanthomonas_campestris_8004_uid15 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 23 0 0 0 33 5 0Xanthomonas_campestris_ATCC_33913_uid296 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 0 0 0 31 10 0Xanthomonas_campestris_B100_uid29801 0 0 1 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 27 0 0 0 22 4 0Xanthomonas_campestris_raphani_756C_uid63187 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 17 0 0 0 22 1 0Xanthomonas_campestris_vesicatoria_85-­‐10_uid13649 0 0 2 0 0 0 0 0 0 2 0 0 0 0 0 0 1 0 0 0 0 49 0 0 4 9 0 0Xanthomonas_citri_uid297 0 0 0 0 0 0 0 0 0 3 0 0 0 0 2 0 0 0 0 0 0 35 0 0 0 1 9 0Xanthomonas_oryzae_KACC10331_uid12931 0 0 75 18 0 0 0 32 0 0 0 20 0 0 0 40 8 0 0 0 0 7 0 0 0 171 17 0Xanthomonas_oryzae_MAFF_311018_uid16297 0 0 64 23 0 0 0 7 0 0 0 19 0 0 0 39 8 0 0 0 0 28 0 0 0 154 9 0Xanthomonas_oryzae_PXO99A_uid28127 0 0 82 14 0 0 0 31 0 0 0 24 0 0 0 59 8 0 0 0 0 5 0 0 0 136 11 0Xenorhabdus_bovienii_SS_2004_uid13399 0 0 0 0 0 0 3 35 0 0 6 8 0 0 0 0 0 0 0 0 2 2 0 0 5 12 2 2Xenorhabdus_nematophila_ATCC_19061_uid13400 0 0 0 0 0 0 0 33 0 0 13 0 0 0 1 2 0 0 0 0 0 0 0 9 5 6 3 6Xylanimonas_cellulosilytica_DSM_15894_uid19715 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0Xylella_fastidiosa_GB514_uid49399 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Xylella_fastidiosa_M12_uid17823 0 0 0 0 0 0 0 0 0 0 0 0 0 0 4 0 0 0 0 2 0 0 0 0 0 0 0 0Xylella_fastidiosa_M23_uid18457 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Xylella_fastidiosa_Temecula1_uid285 0 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 43: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Xylella_fastidiosa_uid271 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0Yersinia_enterocolitica_8081_uid190 2 0 0 0 0 0 3 0 0 0 0 0 0 0 1 0 1 0 0 0 0 17 0 14 0 0 11 0Yersinia_enterocolitica_palearctica_105_5R_r__uid53341 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 12 5 0 0 0 4 1 51 1 0 0 0Yersinia_enterocolitica_palearctica_Y11_uid60483 6 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 6 0 0 0 6 0 56 1 0 0 0Yersinia_pestis_A1122_uid67155 0 0 0 0 0 0 63 1 0 0 0 0 0 0 0 0 22 47 0 0 0 8 1 1 0 0 0 0Yersinia_pestis_Angola_uid16067 0 0 0 0 0 0 97 0 0 0 0 0 0 0 0 0 38 48 0 0 0 11 1 1 1 0 0 0Yersinia_pestis_Antiqua_uid16645 0 0 0 0 0 0 64 0 0 0 0 0 0 0 0 0 22 74 0 0 0 9 1 1 1 0 0 0Yersinia_pestis_CO92_uid34 0 0 0 0 0 0 64 0 0 0 0 0 0 0 0 0 22 48 0 0 0 9 1 1 1 0 0 0Yersinia_pestis_D106004_uid36507 0 0 0 0 0 0 60 0 0 0 0 0 0 0 0 0 22 32 0 0 0 9 1 1 1 0 0 0Yersinia_pestis_D182038_uid36545 0 0 0 0 0 0 62 0 0 0 0 0 0 0 0 0 22 36 0 0 0 9 1 1 1 0 0 0Yersinia_pestis_KIM_10_uid288 0 0 0 0 0 0 51 1 0 0 0 0 0 0 0 0 20 37 0 0 0 10 1 1 0 0 0 0Yersinia_pestis_Nepal516_uid16646 0 0 0 0 0 0 61 1 0 0 0 0 0 0 0 0 24 32 0 0 0 10 1 1 0 0 0 0Yersinia_pestis_Pestoides_F_uid16700 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 22 26 0 0 0 9 1 1 0 0 0 0Yersinia_pestis_Z176003_uid36547 0 0 0 0 0 0 61 0 0 0 0 0 0 0 0 0 22 31 0 0 0 8 1 1 1 0 0 0Yersinia_pestis_biovar_Medievalis_Harbin_35_uid30505 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 22 34 0 0 0 8 1 1 1 0 0 0Yersinia_pestis_biovar_Microtus_91001_uid10638 0 0 0 0 0 0 44 0 0 0 0 0 0 0 0 0 23 35 0 0 0 9 1 1 1 0 0 0Yersinia_pseudotuberculosis_IP32953_uid12950 0 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 7 5 0 0 0 8 1 0 1 0 0 0Yersinia_pseudotuberculosis_IP_31758_uid16070 0 0 0 0 0 0 15 0 0 0 0 0 0 0 0 0 3 0 0 0 0 0 2 0 0 0 0 0Yersinia_pseudotuberculosis_PB1__uid28745 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 8 6 0 0 0 9 1 0 0 0 0 0Yersinia_pseudotuberculosis_YPIII_uid28743 0 0 0 0 0 0 14 0 0 0 0 0 0 0 0 0 5 0 0 0 0 2 1 0 0 0 0 0Zunongwangia_profunda_SM_A87_uid38641 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 2 0 0 0 0 16 0 2 0 0 2 2Zymomonas_mobilis_ATCC_10988_uid30987 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 34 0 0Zymomonas_mobilis_NCIMB_11163_uid34821 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0Zymomonas_mobilis_pomaceae_ATCC_29192_uid34927 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0alpha_proteobacterium_IMCC1322_uid28081 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0butyrate_producing_bacterium_SM4_1_uid45955 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0butyrate_producing_bacterium_SS3_4_uid39155 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0butyrate_producing_bacterium_SSC_2_uid45957 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0cyanobacterium_UCYN_A_uid30917 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0halophilic_archaeon_DL31_uid52855 0 31 0 0 0 0 13 0 5 0 0 0 0 0 0 0 0 0 0 3 0 0 0 0 2 1 4 16marine_bacterium_HP15_uid46089 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0uncultured_Termite_group_1_bacterium_phylotype_Rs_D17_uid20871 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0uncultured_methanogenic_archaeon_RC-­‐I_uid19641 0 0 0 5 6 0 0 1 4 0 0 0 0 0 5 0 0 0 0 0 0 0 0 0 0 0 0 0

Page 44: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Supplemental  Table  4

Family Test  Statistic Control P-­‐valueIS1182 0.238569796 0.282524132 0ISH3 0.223926818 0.283855563 0IS1595 0.249433554 0.282423267 0ISL3 0.243848718 0.282349206 0IS1634 0.296352681 0.282691323 0.918IS1380 0.237200357 0.282375409 0IS200/IS605 0.267753244 0.28268475 0IS630 0.273797685 0.282652734 0.053Tn3 0.218156373 0.281940996 0ISAs1 0.20109524 0.282410971 0IS30 0.236087124 0.282203052 0ISAzo13 0.270793522 0.282552668 0.364IS481 0.246110155 0.282473862 0IS701 0.25997134 0.282887453 0.007IS256 0.267271578 0.282235139 0.001IS21 0.249356118 0.282113021 0IS91 0.198051999 0.282862756 0IS607 0.294306711 0.282858576 0.885IS982 0.261764268 0.282381155 0.012IS3 0.241419414 0.282345855 0IS1 0.196650011 0.281866199 0IS110 0.256266433 0.282405487 0IS6 0.265807761 0.282547972 0.006IS5 0.274152533 0.282473901 0.006IS4 0.244476031 0.282728744 0IS66 0.247286538 0.282069397 0

Page 45: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

Supplemental  Table  5

Coefficient Standard  Error P-­‐valueIS1595 3.1284 0.0676 0IS1380 2.9623 0.068 0ISAs1 2.2423 0.0601 0IS1 2.1932 0.0589 0IS6 2.1668 0.0657 0IS701 2.1416 0.0779 0IS30 2.1075 0.0638 0IS1634 1.9196 0.1306 0IS5 1.8897 0.0598 0Intraspecies 1.8467 0.01 0IS200/IS605 1.8111 0.0601 0IS3 1.6914 0.0587 0ISL3 1.5838 0.0647 0IS256 1.5703 0.0629 0ISH3 1.5003 0.0859 0IS110 1.462 0.0618 0IS1182 1.2534 0.0653 0IS481 1.0648 0.0731 0ISAzo13 1.0254 0.7121 1IS4 0.8917 0.0643 0IS630 0.817 0.0676 0IS982 0.7425 0.1132 0IS21 0.5041 0.0652 0IS607 0.3315 0.1644 1IS66 0.2476 0.0659 0.005Tn3 -­‐0.0327 0.3238 1Distance -­‐0.3834 0.0025 0IS91 -­‐0.9056 0.1412 0Intercept -­‐6.9336 0.0595 0

Page 46: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

0 20 40 60 80 100Tolerance: maximum distance of IS ends from benchmark ends

0.0

0.1

0.2

0.3

0.4

0.5

0.6Sensi

tivit

yOASIS and IScan sensitivity by error tolerance

OASISIScan

Page 47: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,
Page 48: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

OASIS finds multiple-copy IS elements in each genome by identifying con- served regions surrounding transposase genes. First, groups of already-annotated transposase genes that could compose multiple copies of an IS were identified by length and sequence similarity. Those that fit a high similarity threshold were assumed to be in the same group.

The edges of each element were then found by finding a region of conservation around each group of transposases. The windows upstream and downstream of a group of transposases were compared. We are thus given a set of sequences and asked to find the edge at which conservation ends, allowing for mismatches within IS elements, in a more computationally efficient matter than a multiple alignment. The edge of conservation was found via the following maximum likelihood approach.

We assumed that upstream and downstream regions each consist of a contiguous conserved region (within the IS) and an unconserved region (outside the IS). For computational efficiency, it was assumed that there were no gap mutations in the conserved region. Consider the set of upstream or downstream n-windows, in a group that contains m transposases. Let k be the length of the conserved region- the length which the IS element extends past the transposase on this side.

The first k characters of the sequences each come from multinomial distributions that are identical across the m sequences and are biased heavily towards a single nucleotide (the true sequence of the IS element). Let cj be the consensus nucleotide for the jth character of the true sequence, and let xi,j represent the jth character of the ith sequence.

xij ~Multinom(pcj) if i k

Multinom(pb) if i k

where pcj represents the distribution of a conserved nucleotide with consensus value cj, and pb represents a background distribution vector (the distribution of nucleotides in unconserved regions). pc can be any of 4 vectors that are biased towards a consensus nucleotide, with some probability of error. Let c, ε be adjustable parameters specifying the probability of a consensus match or an error in a conserved region.

The value of k is then found that maximizes the likelihood of the set of sequences. The likelihood is found by a dynamic programming algorithm as follows. For each nucleotide N ∈ {A, G, C, T }, let Ni be the count of nucleotides N in x1...m,i. Also let Mi be the count of the most common nucleotide (the MLE estimate of the consensus nucleotide in a conserved region). The log-likelihood function is then

l(k) log(m!

Ai!Ci!Ti!Gi!cM i

i1

k

mM i

) log(m!

Ai!Ci!Ti!Gi!(1

4)m

i k1

n

)

Our estimate of k is thus

Page 49: OASIS: an automated program for global investigation of ... · OASIS: an automated program for global investigation of bacterial and archaeal insertion sequences David G. Robinson1,

ˆ k argmax

kl(k)

OASIS computes this through a dynamic programming algorithm. The putative edge of the IS element is then marked as nucleotides from the edge of the transposase. This process is repeated for the windows upstream and downstream of the repeated transposase, for each group of transposases in the genome.

In some cases a small percentage of the ISs in a group are shorter than the rest (particularly partial elements such as those created by subsequent genome rearrangements). Thus, each transposase 1 to m has its window individually shortened to all possible lengths 1 to n while keeping the rest of the windows at the same length. The above calculation of likelihood is repeated under this assumption, and any changes that are found to improve the likelihood are performed. Since the number of shortened ISs is almost always a small fraction of the group, this tends to find the same solution as an exhaustive search of all nm window lengths, and does so much more efficiently.

Once the edge of conservation has been found, the edges were then checked for inverted repeats using a Smith-Waterman alignment between the regions surrounding either edge of an IS element. If inverted repeats are found that disagree with the putative edges, the edges are adjusted to a limited extent.

Single-copy IS elements were found separately by finding transposases that were not placed in multiple-copy groups, and checking for inverted repeats in the surrounding regions. A Smith-Waterman alignment of the upstream and reverse-complement of the downstream regions was used to recognize significant inverted repeats and thus possible edges, a method first developed by IScan.

Once groups of IS elements were identified, a sample IS element from the set is selected, and blastn (NCBI) was used against the genome sequence to identify missing and partial copies of the IS element.

The family and group of each IS element is identified using blastp on the identified IS element ORFs against the ISFinder database, and classifying based on the family and group of the best match.