Breiman, L. (2001). Random forests. Machine learning, 45,
Cao, K., Chen, C., Baltes, S., Treude, C., & Chen, X. (2021,
May). Automated query reformulation for efficient
search based on query logs from stack overflow. In
2021 IEEE/ACM 43rd International Conference on
Software Engineering (ICSE) (pp. 1273-1285). IEEE.
Chen, T., & Guestrin, C. (2016, August). Xgboost: A
scalable tree boosting system. In Proceedings of the
22nd acm sigkdd international conference on
knowledge discovery and data mining (pp. 785-794).
Feng, Z., Guo, D., Tang, D., Duan, N., Feng, X., Gong, M.,
... & Zhou, M. (2020). Codebert: A pre-trained model
for programming and natural languages. arXiv preprint
Gilda, S. (2017, July). Source code classification using
Neural Networks. In 2017 14th international joint
conference on computer science and software
engineering (JCSSE) (pp. 1-6). IEEE.
Jain, V., & Lodhavia, J. (2020, June). Automatic Question
Tagging using k-Nearest Neighbors and Random
Forest. In 2020 International Conference on Intelligent
Systems and Computer Vision (ISCV) (pp. 1-4). IEEE.
Kavuk, E. M., & Tosun, A. (2020, June). Predicting Stack
Overflow question tags: a multi-class, multi-label
classification. In Proceedings of the IEEE/ACM 42nd
International Conference on Software Engineering
Workshops (pp. 489-493).
Klein, D., Murray, K., & Weber, S. (2011). Algorithmic
programming language identification. arXiv preprint
Khasnabish, J. N., Sodhi, M., Deshmukh, J., &
Srinivasaraghavan, G. (2014, July). Detecting
programming language from source code using
bayesian learning techniques. In International
Workshop on Machine Learning and Data Mining in
Pattern Recognition (pp. 513-522). Springer, Cham.
Kuo, D. (2011). On word prediction methods. Technical
report, Technical report, EECS Department.
McHugh, M. L. (2013). The chi-square test of
independence. Biochemia medica, 23(2), 143-149.
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013).
Efficient estimation of word representations in vector
space. arXiv preprint arXiv:1301.3781.
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V.,
Thirion, B., Grisel, O., ... & Duchesnay, E. (2011).
Scikit-learn: Machine learning in Python. the Journal of
machine Learning research, 12, 2825-2830.
Pennington, J., Socher, R., & Manning, C. D. (2014,
October). Glove: Global vectors for word
representation. In Proceedings of the 2014 conference
on empirical methods in natural language processing
(EMNLP) (pp. 1532-1543).
Programming language identification tool, 2018. Available: [Online].
Rekha, V. S., Divya, N., & Bagavathi, P. S. (2014,
October). A hybrid auto-tagging system for
stackoverflow forum questions. In Proceedings of the
2014 International Conference on Interdisciplinary
Advances in Applied Computing (pp. 1-5).
Saha, A. K., Saha, R. K., & Schneider, K. A. (2013, May).
A discriminative model approach for suggesting tags
automatically for stack overflow questions. In 2013
10th Working Conference on Mining Software
Repositories (MSR) (pp. 73-76). IEEE.
Saini, T., & Tripathi, S. (2018, March). Predicting tags for
stack overflow questions using different classifiers. In
2018 4th International Conference on Recent Advances
in Information Technology (RAIT) (pp. 1-5). IEEE.
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019).
DistilBERT, a distilled version of BERT: smaller,
faster, cheaper and lighter. arXiv preprint
Stanley, C., & Byrne, M. D. (2013, July). Predicting tags
for stackoverflow posts. In Proceedings of ICCM (Vol.
Swaraj, A. and Kumar, S. A Methodology for Detecting
Programming Languages in Stack Overflow Questions.
DOI: 10.5220/0011310400003266. In Proceedings of
the 17th International Conference on Software
Technologies (ICSOFT 2022)
Van Dam, J. K., & Zaytsev, V. (2016, March). Software
language identification with natural language
classifiers. In 2016 IEEE 23rd international conference
on software analysis, evolution, and reengineering
(SANER) (Vol. 1, pp. 624-628). IEEE.
Wolf, T., Debut, L., Sanh, V., Chaumond, J., Delangue, C.,
Moi, A., ... & Rush, A. M. (2019). Huggingface's
transformers: State-of-the-art natural language
processing. arXiv preprint arXiv:1910.03771.