flora which reached pathogenic enterobacteria. An
interference of GIs from this new channel (the cell
E2) with the conventional oscillations of
pathogenicity GIs of Enterobacteria (the cells A2-C6
in Fig. 1 and 2) led to appearance of the new deadly
E. coli EAHEC in Europe in 2011.
4 CONCLUSIONS
Repeated outbreaks of new pathogens revealed our
ignorance on the principles of horizontal gene
exchange and the evolution of pathogenic bacteria.
The strain E. coli TY-2482 from the latest outbreak
has been quickly isolated and sequenced that
allowed the discovery of its closest relatives but it
failed to resolve the origin of this strain and the way
of its evolution that made impossible for us to
predict upcoming outbreaks in future. Inability to
answer these questions showed limitations of the
current methods of comparative and evolutionary
genomics.
In this work several innovative approaches of
genome linguistics based on the analysis of the
biased distribution of tetranucleotide were
introduced. These methods were used for clustering
of GIs generated from the same source, estimation of
the relative time of GI insertions and reconstruction
of donor-recipient relations. The genome linguistic
approaches gain more credibility when used in
parallel with the traditional methods of sequence
similarity comparison.
It was found that the recurrent appearance of new
pathogens may be associated with the regular
oscillations of GI vectors. Pathogens may gain an
increased virulence when they are reached by GIs
from unusual sources. There is a pressing need to
create a system that will allow the monitoring of
distributions of horizontally transferred GIs, to aid
us in being informed and prepared regarding the
emergence of new pathogens.
ACKNOWLEDGEMENTS
This work was funded by the National Research
Foundation (South Africa) grant #71261 for
National Bioinformatics and Functional Genomics
Programme.
REFERENCES
Abe, T., Kanaya, S., Kinouchi, M., Ichiba, Y., Kozuki, T.,
et al., 2003. Informatics for unveiling hidden genome
signatures. Genome Res. 13: 693-702.
Brzuszkiewicz, E., Thürmer, A., Schuldes, J., Leimbach,
A., Liesegang, H., et al., 2011. Genome sequence
analyses of two isolates from the recent Escherichia
coli outbreak in Germany reveal the emergence of a
new pathotype: Entero-Aggregative-Haemorrhagic
Escherichia coli (EAHEC). Arch. Microbiol.,
doi:10.1007/s00203-011-0725-6.
Ganesan, H., Rakitianskaia, A. S., Davenport, C. F.,
Tümmler, B., Reva, O. N. 2008. The SeqWord
Genome Browser: an online tool for the identification
and visualization of atypical regions of bacterial
genomes through oligonucleotide usage. BMC
Bioinformatics 9: 333.
Langille, M. G., Brinkman, F. S., 2009. IslandViewer: an
integrated interface for computational identification
and visualization of genomic islands. Bioinformatics
25: 664-665.
Lawrence, J. G., Ochman, H., 1997. Amelioration of
bacterial genomes: rates of change and exchange. J.
Mol. Evol. 44: 383-397.
Levings, R. S., Partridge, S. R., Djordjevic, S. P., Hall, R.
M., 2007. SGI1-K, a variant of the SGI1 genomic
island carrying a mercury resistance region, in
Salmonella enterica serovar Kentucky. Antimicrob.
Agents Chemother. 51: 317-323.
Manrique, M., Pareja-Tobes, P., Pareja-Tobes, E., Pareja,
E., Tobes, R. 2011. Escherichia coli EHEC Germany
outbreak preliminary functional annotation using BG7
system. Nut. Preceedings, doi:10.1038/npre.2011.
6001.1.
Ramaiah, N., De, J., 2003. Unusual rise in mercury-
resistant bacteria in coastal environs. Microb. Ecol. 45:
444-454.
Reva, O. N., Tümmler, B., 2005. Differentiation of regions
with atypical oligonucleotide composition in bacterial
genomes. BMC Bioinformatics 6: 251.
Van Passel, M. W., Bart, A., Luyf, A. C., van Kampen, A.
H. van der Ende, A., 2006. The reach of the genome
signature in prokaryotes. BMC Evol. Biol. 6: 84.
APPLICATION OF GENOME LINGUISTIC APPROACHES FOR IDENTIFICATION OF GENOMIC ISLAND IN
BACTERIAL GENOMES AND TRACKING DOWN THEIR ORIGINS - Genome Linguistics to Visualize Horizontal
Gene Exchange
123