increase the performance. In a practical system which
values recall over precision, if any of the models pre-
dict that a post should be rejected, then this should
be flagged to the human moderators. There are also a
number of other hyper-parameters and potential fea-
tures which could be explored. For example, we have
noticed that posts made by boys have a higher rejec-
tion rate. Consequently, future work could explore
whether and how to incorporate extra-linguistic fea-
tures of the posts including gender and age of user.
REFERENCES
Cheng, J., Danescu-Niculescu-Mizil, C., and Leskovec, J.
(2015). Antisocial behavior in online discussion com-
munities. In Ninth International AAAI Conference on
Web and Social Media.
Devlin, J., Chang, M., Lee, K., and Toutanova, K. (2018).
BERT: pre-training of deep bidirectional transformers
for language understanding. CoRR, abs/1810.04805.
Duggan, M. (2017). Online harrassment. Technical report,
Pew Research Center.
Fellbaum, C. (1998). WordNet: An Electronic Lexical
Database. Bradford Books.
Gagliardone, I., Gal, D., Alves, T., and Martinez, G. (2015).
Countering online hate speech. Unesco Publishing.
Harris, Z. (1954). Distributional structure. Word,
10(23):146 – 162.
Hochreiter, S., Bengio, Y., Frasconi, P., Schmidhuber, J.,
et al. (2001). Gradient flow in recurrent nets: the dif-
ficulty of learning long-term dependencies.
Howard, J. and Ruder, S. (2018). Universal language model
fine-tuning for text classification. arXiv preprint.
Kobayashi, S. (2018). Contextual augmentation: Data
augmentation by words with paradigmatic relations.
CoRR, abs/1805.06201.
McCarthy, D., Koeling, R., Weeds, J., and Carroll, J.
(2004). Finding predominant word senses in untagged
text. In Proceedings of the 42nd Meeting of the Asso-
ciation for Computational Linguistics (ACL’04), Main
Volume, pages 279–286, Barcelona, Spain.
McManus, S., Bebbington, P., Jenkins, R., and Brugha,
T. (2016). Mental health and wellbeing in england:
Adult psychiatric morbidity survey 2014.
McManus, S., Meltzer, H., Brugha, T. S., Bebbington, P. E.,
and Jenkins, R. (2009). Adult psychiatric morbidity
in england, 2007: results of a household survey.
Mikolov, T., Yih, W. T., and Zweig, G. (2013). Linguis-
tic regularities in continuous space word representa-
tions. In Proceedings of the 2013 Conference of the
North American Chapter of the Association for Com-
putational Linguistics: Human Language Technolo-
gies, pages 746–751.
Park, D. S., Chan, W., Zhang, Y., Chiu, C., Zoph, B., Cubuk,
E. D., and Le, Q. V. (2019). Specaugment: A sim-
ple data augmentation method for automatic speech
recognition. CoRR, abs/1904.08779.
Pavlopoulos, J., Malakasiotis, P., and Androutsopoulos, I.
(2017). Deep learning for user comment moderation.
CoRR, abs/1705.09993.
Pennington, J., S. R. . M. C. (2014). Glove: Global vectors
for word representation. In Proceedings of the 2014
conference on empirical methods in natural language
processing (EMNLP), pages 1532–1543.
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M.,
Clark, C., Lee, K., and Zettlemoyer, L. (2018).
Deep contextualized word representations. CoRR,
abs/1802.05365.
Pieschl, S., Kuhlmann, C., and Porsch, T. (2015). Beware
of publicity! perceived distress of negative cyber in-
cidents and implications for defining cyberbullying.
Journal of School Violence, 14(1):111–132.
Radford, A., Narasimhan, K., Salimans, T., and Sutskever,
I. (2013). Improving language understanding by gen-
erative pre-training.
Saleem, H. M., Dillon, K. P., Benesch, S., and Ruths, D.
(2017). A web of hate: Tackling hateful speech in
online social spaces. CoRR, abs/1709.10159.
Shrivastava, A., Pfister, T., Tuzel, O., Susskind, J., Wang,
W., and Webb, R. (2017). Learning from simulated
and unsupervised images through adversarial training.
In 2017 IEEE Conference on Computer Vision and
Pattern Recognition (CVPR), pages 2242–2251.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones,
L., Gomez A. N., ., and Polosukhin, I. (2017). Atten-
tion is all you need. In Advances in neural information
processing systems, pages 5998–6008.
Weeds, J., Weir, D., and McCarthy, D. (2004). Character-
ising measures of lexical distributional similarity. In
Proceedings of the International Conference on Com-
putational Linguistics (COLING), pages 1015–1021,
Geneva, Switzerland.
Wei, J. W. and Zou, K. (2019). EDA: easy data augmenta-
tion techniques for boosting performance on text clas-
sification tasks. CoRR, abs/1901.11196.
Wulczyn, E., Thain, N., and Dixon, L. (2017). Ex machina:
Personal attacks seen at scale. International World
Wide Web Conferences Steering Committee., pages
1391–1399.
Yu, A. W., Dohan, D., Luong, M., Zhao, R., Chen, K.,
Norouzi, M., and Le, Q. V. (2018). Qanet: Combining
local convolution with global self-attention for read-
ing comprehension. CoRR, abs/1804.09541.
Improving Mental Health using Machine Learning to Assist Humans in the Moderation of Forum Posts
197