Search of Possible Insertions in Bacterial Genes

Eugene Korotkov, Yulia Suvorova, Maria Korotkova


It is known that nucleotide sequences are not homogeneous and from this heterogeneity arises the task of segmentation of a sequence into a set of homogeneous parts by the points called change points. In the work we investigated a special case of change points in genes – paired change points (PCP). We used a well-known property of coding sequences – triplet periodicity. The sequence that we are especially interested in consists of three successive parts: the first and the last parts have similar triplet periodicity (TP) and the middle part is of another TP type. We aimed to find genes with PCP and provide explanation for the phenomenon. We developed a mathematical method for PCP detection based on new measure of similarity between TP matrixes. Among 66936 studied genes we found 2700 genes with PCP and 6459 genes with single change point (SCP). We suppose that PCP could be associated with double fusion or insertion events.


