The minimum free energy of the optimal structure
for the whole RNA sequence is the minimum value
of {min
x
{S
odd
D
(1, x, x + 1, n)}, min
y<x
3
{S
even
D
(1, y, y +
1, x
3
, x
3
)}}.
From the real data, the distance between x
3
and
the end of the sequence is usually bounded by a small
constant, so we assume that the number of different
x
3
values we need to consider is only a small con-
stant. The time complexity of the above algorithm is
O(m
4
). The memory complexity of the algorithm is
also O(m
4
) .
5 CONCLUSIONS
In this paper, we consider a new class of pseudoknots
which include more complicated structures that none
of the existing algorithms can handle. We then pro-
vide an O(m
4
) time algorithm for predicting these
a structure of degree 4 with minimum free energy
which already covers all known secondary structures
of this class in existing databases. We implemented
our algorithm and the running time is reasonable,
which takes about 70sec for a RNA of length about
100 and about 3 times faster than the one in (Rivas
and Eddy, 1999). We will evaluate the accuracy of
the predicted structures once we can locate a set of
appropriate parameters for the energy model. In fact,
there are not many known RNAs with simple non-
standard pseudoknots. One of the reasons may be due
to the limitation of existing computational prediction
tools. With our algorithm, we may be able to pre-
dict more RNAs with such a structure for follow-up
verification. Although there are no other more com-
plicated known pseudoknot structures, there is a high
chance that there exist novelRNAs with more compli-
cated structures, so designing efficient prediction al-
gorithms for more complicated pseudoknot structures
remains an important open problem.
REFERENCES
Akutsu, T. (2000). Dynamic programming algorithms for
rna secondary structure prediction with pseudoknots.
Discrete Applied Mathematics, 104:45–62.
Chen, H., Condon, A., and Jabbari, H. (2009). An o(n
5
)
algorithm for mfe prediction of kissing hairpins and
4- chains in nucleic acids. Journal of Computational
Biology, 16(6):803–815.
Deogun, J., Donis, R., Komina, O., and Ma, F. (2004).
Rna secondary structure prediction with simple pseu-
doknots. In Proceedings of the second confer-
ence on Asia- Pacific Bioinformatics Conference
(APBC2004), pages 239–246.
Dirks, R. and Pierce, N. (2003). A partition function
algorithm for nucleic acid secondary structure in-
cluding pseudoknots. Journal of Comput. Chem,
24(13):1664–1677.
Frank, D. and Pace, N. (1998). Ribonuclease p: unity and
diversity in a trna processing ribozyme. Annu. Rev.
Biochem., 67:153–180.
Gluick, T. and Draper, D. (1994). Thermodynamics of
folding a pseudoknotted mrna fragment. Journal of
Molecular Biology, 241:246–262.
Lyngso, R. and Pedersen, C. (2000). A dynamic program-
ming algorithm for rna structure prediction including
pseudoknots. In Proc. of the Fourth Annual Interna-
tional Conferences on Compututational Molecular Bi-
ology (RECOMB00). ACM Press.
Nguyen, V., Kiss, T., Michels, A., and Bensaude, O. (2001).
7sk small nuclear rna binds to and inhibits the activity
of cdk9/cyclin t complexes. Nature, 414:322–325.
Reeder, J. and Giegerich, R. (2004). Design, implementa-
tion and evaluation of a practical pseudoknot folding
algorithm based on thermodynamics. BMC Bioinfor-
matics, 5:104.
Rivas, E. and Eddy, S. (1999). A dynamic programming
algorithm for rna structure prediction including pseu-
doknots. Journal of Molecular Biology, 285(5):2053–
2068.
Roland, E. (2006). Pseudoknots in rna secondary structures:
representation, enumeration, and prevalence. Journal
of Computational Biology, 13(6):1197–1213.
Uemura, Y., Hasegawa, A., Kobayashi, S., and Yokomori,
T. (1999). Tree adjoining grammars for rna structure
prediction. Theoretical Computer Science, 210:277–
303.
Yang, Z., Zhu, Q., Luo, K., and Zhou, Q. (2001). The 7sk
small nuclear rna inhibits the cdk9/cyclin t1 kinase to
control transcription. Nature, 414:317–322.
BIOINFORMATICS 2010 - International Conference on Bioinformatics
38