Authors:
Oliver Bezuidt
;
Kingdom Mncube
and
Oleg N. Reva
Affiliation:
University of Pretoria, South Africa
Keyword(s):
Genome linguistics, k-Mer statistics, Oligonucleotide usage pattern, Genomic island, Horizontal gene exchange, Pathogenicity.
Related
Ontology
Subjects/Areas/Topics:
Algorithms and Software Tools
;
Artificial Intelligence
;
Bioinformatics
;
Biomedical Engineering
;
Genomics and Proteomics
;
Sequence Analysis
;
Soft Computing
;
Visualization
Abstract:
With more sequences of complete bacterial genomes getting public availability the approaches of genome comparison by frequencies of oligonucleotides (k-mers) known also as the genome linguistics are becoming popular and practical to resolve problems which can not be tackled by the traditional sequence comparison tools. In this work we present several innovative approaches based on k-mer statistics for detection of inserts of genomic islands and tracing down the ontological links and origins of mobile genetic elements. 637 bacterial genomes were analyzed by SeqWord Sniffer program that has detected 2,622 putative genomic islands. These genomic islands were clustered by DNA compositional similarity. A stratigraphic analysis was introduced that allows distinguishing between new and old genomic inserts. A method of reconstruction of donor-recipient relations between micro-organisms was proposed. The strain E. coli TY-2,482 isolated from the latest deadly outbreak of a haemorrhagic infect
ion in Europe in 2011 was used for the case study. It was shown that this strain appeared on an intersection of two independent fluxes of horizontal gene exchange, one of which is a conventional for Enterobacteria stream of vectors generated in marine gamma-Proteobacteria; and the second is a new channel of antibiotic resistance genomic islands originated from environmental beta-Proteobacteria.
(More)