in weighted sequence. In International Information
Technology Conference (CUBE), pp. 435-440.
Amir, A., Iliopoulos, C. S., Kapah, O., and Porat, E. (2006).
Approximate matching in weighted sequences. In
Combinatorial Pattern Matching (CPM), pp. 365376.
Christodoulakis, M., Iliopoulos, C. S., Mouchard, L.,
Perdikuri, K., Tsakalidis, A. K., and Tsichlas, K.
(2006). Computation of repetitions and regularities of
biologically weighted sequences. In Journal of Com-
putational Biology (JCB), Volume 13, pp. 1214-1231.
Culpepper, J. S. and Moffat, A. (2010). Efficient set inter-
section for inverted indexing. In ACM Transactions
on Information Systems (TOIS), Volume 29, Article 1.
du Mouza, C., Litwin, W., Rigaux, P., and Schwarz, T. J. E.
(2009). As-index: a structure for string search using
n-grams and algebraic signatures. In ACM Conference
on Information and Knowledge Management (CIKM),
pp. 295-304.
Gao, J., Goodman, J., Li, M., and Lee, K.-F. (2002). Effi-
cient set intersection for inverted indexing. In ACM
Transactions on Asian Language Information Pro-
cessing, Volume 1, Number 1, pp. 3-33.
Gusfield, D. (1997). Algorithms on Strings, Trees and Se-
quences: Computer Science and Computational Bio-
logy. Cambridge University Press.
Holub, J. and Smyth, W. F. (2003). Algorithms on indeter-
minate strings. In Australasian Workshop on Combi-
natorial Algorithms.
Holub, J., Smyth, W. F., and Wang, S. (2008). Fast pattern-
matching on indeterminate strings. In Journal of Dis-
crete Algorithms, Volume 6, pp. 37-50.
Iliopoulos, C. S., Makris, C., Panagis, Y., Perdikuri, K.,
Theodoridis, E., and Tsakalidis, A. K. (2006). The
weighted suffix tree: An efficient data structure for
handling molecular weighted sequences and its appli-
cations. In Fundamenta Informaticae (FUIN), Volume
71, pp. 259-277.
Kaporis, A. C., Makris, C., Sioutas, S., Tsakalidis, A. K.,
Tsichlas, K., and Zaroliagis, C. D. (2003). Improved
bounds for finger search on a ram. In ESA, Volume
2832, pp. 325-336.
Kim, M.-S., Whang, K.-Y., and Lee, J.-G. (2007). n-
gram/2l-approximation: a two-level n-gram inverted
index structure for approximate string matching. In
Computer Systems: Science and Engineering, Volume
22, Number 6.
Kim, M.-S., Whang, K.-Y., Lee, J.-G., and Lee, M.-J.
(2005). n-gram/2l: A space and time efficient two-
level n-gram inverted index structure. In Interna-
tional Conference on Very Large Databases (VLDB),
pp. 325-336.
Lee, J. H. and Ahn, J. S. (1996). Using n-grams for korean
text retrieval. In ACM SIGIR, pp. 216-224.
Makris, C. and Theodoridis, E. (2011). Algorithms in
Computational Molecular Biology: Techniques, Ap-
proaches and Applications. Wiley Series in Bioinfor-
matics.
Manning, C. D., Raghavan, P., and Schutze, H. (2008). In-
troduction to Information Retrieval. Cambridge Uni-
versity Press.
Marsan, L. and Sagot, M.-F. (2000). Extracting structured
motifs using a suffix tree - algorithms and application
to promoter consensus identification. In International
Conference on Research in Computational Molecular
Biology (RECOMB), pp. 210-219.
Mayfield, J. and McNamee, P. (2003). Single n-gram stem-
ming. In ACM SIGIR, pp. 415-416.
McCreight, E. M. (1976). A space-economical suffix
tree construction algorithm. In Journal of the ACM
(JACM), Volume 23, pp. 262-272.
Millar, E., Shen, D., Liu, J., and Nicholas, C. K. (2000).
Performance and scalability of a large-scale n-gram
based information retrieval system. In Journal of Dig-
ital Information, Volume 1, Number 5.
Navarro, G. and Baeza-Yates, R. A. (1998). A practical q-
gram index for text retrieval allowing errors. In CLEI
Electronic Journal, Volume 1, Number 2.
Navarro, G., Baeza-Yates, R. A., Sutinen, E., and Tarhio,
J. (2001). Indexing methods for approximate string
matching. In IEEE Data Engineering Bulletin, Volume
24, Number 4, pp. 19-27.
Navarro, G., Sutinen, E., Tanninen, J., and Tarhio, J. (2000).
Indexing text with approximate q-grams. In Combina-
torial Pattern Matching (CPM), pp. 350-363.
Ogawa, Y. and Iwasaki, M. (1995). A new character-
based indexing organization using frequency data for
japanese documents. In ACM SIGIR, pp. 121-129.
Puglisi, S. J., Smyth, W. F., and Turpin, A. (2006). Inverted
files versus suffix arrays for locating patterns in pri-
mary memory. In String Processing and Information
Retrieval (SPIRE), pp. 122-133.
Sun, Z., Yang, J., and Deogun, J. S. (2004). Misae: A new
approach for regulatory motif extraction. In Computa-
tional Systems Bioinformatics Conference (CSB), pp.
173-181.
Tang, N., Sidirourgos, L., and Boncz, P. A. (2009). Space-
economical partial gram indices for exact substring
matching. In ACM Conference on Information and
Knowledge Management (CIKM), pp. 285-294.
Yang, S., Zhu, H., Apostoli, A., and Cao, P. (2007). N-
gram statistics in english and chinese: Similarities and
differences. In International Conference on Semantic
Computing (ICSC), pp. 454-460.
Zhang, H., Guo, Q., and Iliopoulos, C. S. (2010a). An al-
gorithmic framework for motif discovery problems in
weighted sequences. In International Conference on
Algorithms and Complexity (CIAC), pp. 335-346.
Zhang, H., Guo, Q., and Iliopoulos, C. S. (2010b). Varieties
of regularities in weighted sequences. In Algorithmic
Aspects in Information and Management (AAIM), pp.
271-280.
WEBIST2014-InternationalConferenceonWebInformationSystemsandTechnologies
238