Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019).
BERT: Pre-training of deep bidirectional transformers
for language understanding. In Proceedings of the
2019 Conference of the North American Chapter of
the Association for Computational Linguistics: Human
Language Technologies, Volume 1 (Long and Short
Papers), pages 4171–4186, Minneapolis, Minnesota.
Association for Computational Linguistics.
Gabrilovich, E. and Markovitch, S. (2007). Computing
semantic relatedness using wikipedia-based explicit
semantic analysis. In Proceedings of the 20th Inter-
national Joint Conference on Artifical Intelligence, IJ-
CAI’07, page 1606–1611, San Francisco, CA, USA.
Morgan Kaufmann Publishers Inc.
Gysel, C. V., de Rijke, M., and Kanoulas, E. (2018). Neural
vector spaces for unsupervised information retrieval.
ACM Trans. Inf. Syst., 36(4).
Haj-Yahia, Z., Sieg, A., and Deleris, L. A. (2019). Towards
unsupervised text classification leveraging experts and
word embeddings. In Proceedings of the 57th Annual
Meeting of the Association for Computational Linguis-
tics, pages 371–379, Florence, Italy. Association for
Computational Linguistics.
Ko, Y. and Seo, J. (2000). Automatic text categorization
by unsupervised learning. In Proceedings of the 18th
Conference on Computational Linguistics - Volume 1,
COLING ’00, page 453–459, USA. Association for
Computational Linguistics.
Lau, J. H. and Baldwin, T. (2016). An empirical evaluation of
doc2vec with practical insights into document embed-
ding generation. In Proceedings of the 1st Workshop on
Representation Learning for NLP, pages 78–86, Berlin,
Germany. Association for Computational Linguistics.
Le, Q. and Mikolov, T. (2014). Distributed representations of
sentences and documents. In Xing, E. P. and Jebara, T.,
editors, Proceedings of the 31st International Confer-
ence on Machine Learning, volume 32 of Proceedings
of Machine Learning Research, pages 1188–1196, Be-
jing, China. PMLR.
Liu, B., Li, X., Lee, W. S., and Yu, P. S. (2004). Text clas-
sification by labeling words. In McGuinness, D. L.
and Ferguson, G., editors, Proceedings of the Nine-
teenth National Conference on Artificial Intelligence,
Sixteenth Conference on Innovative Applications of
Artificial Intelligence, July 25-29, 2004, San Jose, Cal-
ifornia, USA, pages 425–430. AAAI Press / The MIT
Press.
Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013).
Efficient estimation of word representations in vector
space.
Rao, D., P, D., and Khemani, D. (2006). Corpus based
unsupervised labeling of documents. In Sutcliffe, G.
and Goebel, R., editors, Proceedings of the Nineteenth
International Florida Artificial Intelligence Research
Society Conference, Melbourne Beach, Florida, USA,
May 11-13, 2006, pages 321–326. AAAI Press.
Sievert, C. and Shirley, K. (2014). LDAvis: A method for
visualizing and interpreting topics. In Proceedings of
the Workshop on Interactive Language Learning, Vi-
sualization, and Interfaces, pages 63–70, Baltimore,
Maryland, USA. Association for Computational Lin-
guistics.
Song, Y. and Roth, D. (2014). On dataless hierarchical text
classification. In Proceedings of the Twenty-Eighth
AAAI Conference on Artificial Intelligence, pages 1579–
1585.
Yin, W., Hay, J., and Roth, D. (2019). Benchmarking zero-
shot text classification: Datasets, evaluation and en-
tailment approach. In Proceedings of the 2019 Con-
ference on Empirical Methods in Natural Language
Processing and the 9th International Joint Conference
on Natural Language Processing (EMNLP-IJCNLP),
pages 3914–3923, Hong Kong, China. Association for
Computational Linguistics.
Zhang, X., Zhao, J., and LeCun, Y. (2015). Character-
level convolutional networks for text classification. In
Proceedings of the 28th International Conference on
Neural Information Processing Systems - Volume 1,
NIPS’15, page 649–657, Cambridge, MA, USA. MIT
Press.
Zhang, Y., Meng, Y., Huang, J., Xu, F., Wang, X., and Han,
J. (2020). Minimally supervised categorization of text
with metadata. In Proceedings of the 43rd International
ACM SIGIR Conference on Research and Development
in Information Retrieval, pages 1231–1240.
WEBIST 2021 - 17th International Conference on Web Information Systems and Technologies
132