category) could be tested. Further, since our algo-
rithm only relies on term co-occurrences, its applica-
bility can be easily extended to other languages. In-
stead, to improve the algorithm effectiveness, transfer
learning needs to be strengthened: for this purpose, a
greater number of steps in the Markov chain transition
matrix during the classification phase could help.
REFERENCES
Aue, A. and Gamon, M. (2005). Customizing sentiment
classifiers to new domains: A case study. In Proceed-
ings of recent advances in natural language process-
ing (RANLP), volume 1, pages 2–1.
Blitzer, J., Dredze, M., Pereira, F., et al. (2007). Biogra-
phies, bollywood, boom-boxes and blenders: Domain
adaptation for sentiment classification. In ACL, vol-
ume 7, pages 440–447.
Bollegala, D., Weir, D., and Carroll, J. (2013). Cross-
domain sentiment classification using a sentiment sen-
sitive thesaurus. Knowledge and Data Engineering,
IEEE Transactions on, 25(8):1719–1731.
Cao, G., Nie, J.-Y., and Bai, J. (2007). Using markov chains
to exploit word relationships in information retrieval.
In Large Scale Semantic Access to Content (Text, Im-
age, Video, and Sound), pages 388–402. LE CEN-
TRE DE HAUTES ETUDES INTERNATIONALES
D’INFORMATIQUE DOCUMENTAIRE.
Dai, W., Xue, G.-R., Yang, Q., and Yu, Y. (2007). Co-
clustering based classification for out-of-domain doc-
uments. In Proceedings of the 13th ACM SIGKDD
international conference on Knowledge discovery and
data mining, pages 210–219. ACM.
Dave, K., Lawrence, S., and Pennock, D. M. (2003). Mining
the peanut gallery: Opinion extraction and semantic
classification of product reviews. In Proceedings of
the 12th international conference on World Wide Web,
pages 519–528. ACM.
Deng, Z.-H., Luo, K.-H., and Yu, H.-L. (2014). A study of
supervised term weighting scheme for sentiment anal-
ysis. Expert Systems with Applications, 41(7):3506–
3513.
Domeniconi, G., Masseroli, M., Moro, G., and Pinoli, P.
(2015a). Random perturbations and term weighting of
gene ontology annotations for unknown gene function
discovering. In Fred, A. et al. (eds.) IC3K 2014. CCIS,
volume 553. Springer.
Domeniconi, G., Moro, G., Pasolini, R., and Sartori, C.
(2014). Cross-domain text classification through it-
erative refining of target categories representations. In
Proceedings of the 6th International Conference on
Knowledge Discovery and Information Retrieval.
Domeniconi, G., Moro, G., Pasolini, R., and Sartori, C.
(2015b). Iterative refining of category profiles for
nearest centroid cross-domain text classification. In
Fred, A. et al. (eds.) IC3K 2014. CCIS, volume 553.
Springer.
Domeniconi, G., Moro, G., Pasolini, R., and Sartori, C.
(2015c). A study on term weighting for text catego-
rization: a novel supervised variant of tf.idf. In Pro-
ceedings of the 4th International Conference on Data
Management Technologies and Applications.
Frasconi, P., Soda, G., and Vullo, A. (2002). Hidden
markov models for text categorization in multi-page
documents. Journal of Intelligent Information Sys-
tems, 18(2-3):195–217.
He, Y., Lin, C., and Alani, H. (2011). Automatically ex-
tracting polarity-bearing topics for cross-domain sen-
timent classification. In Proceedings of the 49th An-
nual Meeting of the Association for Computational
Linguistics: Human Language Technologies-Volume
1, pages 123–131. Association for Computational Lin-
guistics.
Jin, W., Ho, H. H., and Srihari, R. K. (2009). Opinionminer:
a novel machine learning system for web opinion min-
ing and extraction. In Proceedings of the 15th ACM
SIGKDD international conference on Knowledge dis-
covery and data mining, pages 1195–1204. ACM.
Jo, Y. and Oh, A. H. (2011). Aspect and sentiment unifica-
tion model for online review analysis. In Proceedings
of the fourth ACM international conference on Web
search and data mining, pages 815–824. ACM.
Li, F. and Dong, T. (2013). Text categorization based on
semantic cluster-hidden markov models. In Advances
in Swarm Intelligence, pages 200–207. Springer.
Li, F., Huang, M., and Zhu, X. (2010). Sentiment analysis
with global topics and local dependency. In AAAI,
volume 10, pages 1371–1376.
Li, L., Jin, X., and Long, M. (2012). Topic correlation anal-
ysis for cross-domain text classification. In AAAI.
Liu, B. (2012). Sentiment analysis and opinion mining.
Synthesis Lectures on Human Language Technologies,
5(1):1–167.
Mei, Q., Ling, X., Wondra, M., Su, H., and Zhai, C. (2007).
Topic sentiment mixture: modeling facets and opin-
ions in weblogs. In Proceedings of the 16th interna-
tional conference on World Wide Web, pages 171–180.
ACM.
Melville, P., Gryc, W., and Lawrence, R. D. (2009). Senti-
ment analysis of blogs by combining lexical knowl-
edge with text classification. In Proceedings of
the 15th ACM SIGKDD international conference on
Knowledge discovery and data mining, pages 1275–
1284. ACM.
Miller, D. R., Leek, T., and Schwartz, R. M. (1999). A hid-
den markov model information retrieval system. In
Proceedings of the 22nd annual international ACM
SIGIR conference on Research and development in in-
formation retrieval, pages 214–221. ACM.
Mittendorf, E. and Sch
¨
auble, P. (1994). Document and pas-
sage retrieval based on hidden markov models. In SI-
GIR94, pages 318–327. Springer.
Nasukawa, T. and Yi, J. (2003). Sentiment analysis: Cap-
turing favorability using natural language processing.
In Proceedings of the 2nd international conference on
Knowledge capture, pages 70–77. ACM.
KDIR 2015 - 7th International Conference on Knowledge Discovery and Information Retrieval
136