Breiman, L. (2001). Random forests. Machine learning, 45,
5-32.
Cao, K., Chen, C., Baltes, S., Treude, C., & Chen, X. (2021,
May). Automated query reformulation for efficient
search based on query logs from stack overflow. In
2021 IEEE/ACM 43rd International Conference on
Software Engineering (ICSE) (pp. 1273-1285). IEEE.
Chen, T., & Guestrin, C. (2016, August). Xgboost: A
scalable tree boosting system. In Proceedings of the
22nd acm sigkdd international conference on
knowledge discovery and data mining (pp. 785-794).
Feng, Z., Guo, D., Tang, D., Duan, N., Feng, X., Gong, M.,
... & Zhou, M. (2020). Codebert: A pre-trained model
for programming and natural languages. arXiv preprint
arXiv:2002.08155.
Gilda, S. (2017, July). Source code classification using
Neural Networks. In 2017 14th international joint
conference on computer science and software
engineering (JCSSE) (pp. 1-6). IEEE.
Jain, V., & Lodhavia, J. (2020, June). Automatic Question
Tagging using k-Nearest Neighbors and Random
Forest. In 2020 International Conference on Intelligent
Systems and Computer Vision (ISCV) (pp. 1-4). IEEE.
Kavuk, E. M., & Tosun, A. (2020, June). Predicting Stack
Overflow question tags: a multi-class, multi-label
classification. In Proceedings of the IEEE/ACM 42nd
International Conference on Software Engineering
Workshops (pp. 489-493).
Klein, D., Murray, K., & Weber, S. (2011). Algorithmic
programming language identification. arXiv preprint
arXiv:1106.4064.
Khasnabish, J. N., Sodhi, M., Deshmukh, J., &
Srinivasaraghavan, G. (2014, July). Detecting
programming language from source code using
bayesian learning techniques. In International
Workshop on Machine Learning and Data Mining in
Pattern Recognition (pp. 513-522). Springer, Cham.
Kuo, D. (2011). On word prediction methods. Technical
report, Technical report, EECS Department.
McHugh, M. L. (2013). The chi-square test of
independence. Biochemia medica, 23(2), 143-149.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013).
Efficient estimation of word representations in vector
space. arXiv preprint arXiv:1301.3781.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V.,
Thirion, B., Grisel, O., ... & Duchesnay, E. (2011).
Scikit-learn: Machine learning in Python. the Journal of
machine Learning research, 12, 2825-2830.
Pennington, J., Socher, R., & Manning, C. D. (2014,
October). Glove: Global vectors for word
representation. In Proceedings of the 2014 conference
on empirical methods in natural language processing
(EMNLP) (pp. 1532-1543).
Programming language identification tool, 2018. Available:
https://www.algorithmia.com [Online].
Rekha, V. S., Divya, N., & Bagavathi, P. S. (2014,
October). A hybrid auto-tagging system for
stackoverflow forum questions. In Proceedings of the
2014 International Conference on Interdisciplinary
Advances in Applied Computing (pp. 1-5).
Saha, A. K., Saha, R. K., & Schneider, K. A. (2013, May).
A discriminative model approach for suggesting tags
automatically for stack overflow questions. In 2013
10th Working Conference on Mining Software
Repositories (MSR) (pp. 73-76). IEEE.
Saini, T., & Tripathi, S. (2018, March). Predicting tags for
stack overflow questions using different classifiers. In
2018 4th International Conference on Recent Advances
in Information Technology (RAIT) (pp. 1-5). IEEE.
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019).
DistilBERT, a distilled version of BERT: smaller,
faster, cheaper and lighter. arXiv preprint
arXiv:1910.01108.
Stanley, C., & Byrne, M. D. (2013, July). Predicting tags
for stackoverflow posts. In Proceedings of ICCM (Vol.
2013).
Swaraj, A. and Kumar, S. A Methodology for Detecting
Programming Languages in Stack Overflow Questions.
DOI: 10.5220/0011310400003266. In Proceedings of
the 17th International Conference on Software
Technologies (ICSOFT 2022)
Van Dam, J. K., & Zaytsev, V. (2016, March). Software
language identification with natural language
classifiers. In 2016 IEEE 23rd international conference
on software analysis, evolution, and reengineering
(SANER) (Vol. 1, pp. 624-628). IEEE.
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C.,
Moi, A., ... & Rush, A. M. (2019). Huggingface's
transformers: State-of-the-art natural language
processing. arXiv preprint arXiv:1910.03771.