Huang, M., Cao, Y., and Dong, C. (2016). Model-
ing rich contexts for sentiment classification with
LSTM. CoRR, abs/1605.01478. http://arxiv.org/abs/
1605.01478.
Huh, M., Agrawal, P., and Efros, A. A. (2016). What
makes ImageNet good for transfer learning? CoRR,
abs/1608.08614. http://arxiv.org/abs/1608.08614.
Impermium (2013). Dataset for detecting insults in social
commentary. https://www.kaggle.com/c/detecting-
insults-in-social-commentary/data.
Joshi, G. and Chowdhary, G. (2018). Cross-domain trans-
fer in reinforcement learning using target appren-
tice. CoRR, abs/1801.06920. http://arxiv.org/abs/
1801.06920.
Kim, J.-K., Kim, Y.-B., Sarikaya, R., and Fosler-Lussier,
E. (2017). Cross-lingual transfer learning for POS
tagging without cross-lingual resources. In Empirical
Methods in Natural Language Processing (EMNLP),
pages 2832–2838, Copenhagen, Denmark. Associa-
tion for Computational Linguistics.
Kim, Y. (2014). Convolutional neural networks for sentence
classification. In Proceedings of the 2014 Conference
on Empirical Methods in Natural Language Process-
ing (EMNLP), pages 1746–1751, Doha, Qatar. Asso-
ciation for Computational Linguistics.
Kingma, D. P. and Ba, J. (2014). Adam: A method for
stochastic optimization. CoRR, abs/1412.6980. http:
//arxiv.org/abs/1412.6980.
Kolhatkar, V. and Taboada, M. (2017). Constructive lan-
guage in news comments. In Proceedings of the First
Workshop on Abusive Language Online, pages 11–17,
Vancouver, BC, Canada.
Krishnamoorthy, P., MacQueen, R., and Schsuter,
S. (2017). Detecting insults in online com-
ments. Technical report, Stanford University.
http://www.rorymacqueen.org/wp-content/uploads/
2017/07/cs224u report4.pdf.
Kunze, J., Kirsch, L., Kurenkov, I., Krug, A., Johannsmeier,
J., and Stober, S. (2017). Transfer learning for speech
recognition on a budget. In Proceedings of the 2nd
Workshop on Representation Learning for NLP, pages
168–177, Vancouver, Canada. Association for Com-
putational Linguistics.
Kwok, I. and Wang, Y. (2013). Locate the hate: Detecting
tweets against blacks. In Conference on Artificial In-
telligence (AAAI), pages 1621–1622, Bellevue, Wash-
ington. AAAI Press, Palo Alto, California.
Lee, J. Y. and Dernoncourt, F. (2016). Sequential short-
text classification with recurrent and convolutional
neural networks. In Human Language Technologies:
North American Chapter of the Association for Com-
putational Linguistics (NAACL), pages 515–520, San
Diego, California.
Liu, P., Qiu, X., and Huang, X. (2017). Adversarial multi-
task learning for text classification. In Proceedings of
the 55th Annual Meeting of the Association for Com-
putational Linguistics (ACL), pages 1–10, Vancouver,
Canada. Association for Computational Linguistics.
Mou, L., Meng, Z., Yan, R., Li, G., Xu, Y., Zhang, L., and
Jin, Z. (2016). How transferable are neural networks
in nlp applications? In Proceedings of the Conference
on Empirical Methods in Natural Language Process-
ing (EMNLP), pages 479–489, Austin, Texas. Associ-
ation for Computational Linguistics.
Oquab, M., Bottou, L., Laptev, I., and Sivic, J. (2014).
Learning and transferring mid-level image represen-
tations using convolutional neural networks. In Pro-
ceedings of IEEE Conference on Computer Vision
and Pattern Recognition (CVPR), pages 1717–1724,
Washington, DC, USA. IEEE Computer Society.
Pan, S. J. and Yang, Q. (2010). A survey on transfer learn-
ing. IEEE Transactions on Knowledge and Data En-
gineering, 22(10):1345–1359.
Pavlopoulos, J., Malakasiotis, P., and Androutsopoulos, I.
(2017). Deep learning for user comment moderation.
In Proceedings of the First Workshop on Abusive Lan-
guage Online, pages 25–35, Vancouver, Canada.
Pennington, J., Socher, R., and Manning, C. D. (2014).
GloVe: Global vectors for word representation. In
Empirical Methods in Natural Language Processing
(EMNLP), pages 1532–1543, Doha, Qatar.
Pew (2017). The future of free speech, trolls, anonymity
and fake news online. Technical report, Pew Research
Center. www.pewinternet.org/2017/03/29/the-future-
of-free-speech-trolls-anonymity-and-fake-news-
online/.
Poland, B. (2016). Haters: Harassment, Abuse, and Vio-
lence Online. Potomac Books, Lincoln, Nebraska.
Prates De Pelle, R. and Moreira, V. P. (2017). Offensive
comments in the brazilian web: a dataset and base-
line results. In Brazilian Workshop on Social Net-
work Analysis and Mining (BRASNAM), pages 510–
519, S
˜
au Paulo, Brazil.
Qian, Q., Huang, M., Lei, J., and Zhu, X. (2017). Lin-
guistically regularized lstm for sentiment classifica-
tion. In Proceedings of the 55th Annual Meeting of
the Association for Computational Linguistics (ACL),
pages 1679–1689, Vancouver, Canada. Association
for Computational Linguistics.
R Core Team (2013). R: A Language and Environment
for Statistical Computing. R Foundation for Sta-
tistical Computing, Vienna, Austria. http://www.R-
project.org/.
Ross, B., Rist, M., Carbonell, G., Cabrera, B., Kurowsky,
N., and Wojatzki, M. (2016). Measuring the reliability
of hate speech annotations: The case of the european
refugee crisis. In 3rd Workshop on Natural Language
Processing for Computer-Mediated Communication,
pages 6–10, Bochum, Germany.
Ruder, S. and Plank, B. (2017). Learning to select data
for transfer learning with bayesian optimization. In
Empirical Methods in Natural Language Processing
(EMNLP), pages 372–382, Copenhagen, Denmark.
Association for Computational Linguistics.
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh,
S., Ma, S., Huang, Z., Karpathy, A., Khosla, A.,
Bernstein, M., Berg, A. C., and Fei-Fei, L. (2015).
ImageNet Large Scale Visual Recognition Challenge.
International Journal of Computer Vision (IJCV),
115(3):211–252.
LSTM Neural Networks for Transfer Learning in Online Moderation of Abuse Context
121