fisheries domain and more research is needed con-
cerning the generalizability into other domains and
other forms of text, such as scientific literature and
other technical language. Furtermore, research is
needed to explain why accuracy varies between verb
scores and why collocation statistics work better in
some cases. Finally, research is also necessary when
scoring the verbs against a non-related text corpus to
see which types or genres of non-related domain cor-
pora affects the domain under study.
ACKNOWLEDGEMENTS
This research was funded by the project SAF21 - So-
cial science aspects of fisheries for the 21
st
Century.
SAF21 is a project financed under the EU Horizon
2020 Marie Skłodowska-Curie (MSC) ITN - ETN
programme (project 642080).
REFERENCES
Curran, J. R., Murphy, T., and Scholz, B. (2007). Minimis-
ing semantic drift with mutual exclusion bootstrap-
ping. In In Proceedings of the 10th Conference of
the Pacific Association for Computational Linguistics,
pages 172–180.
Dunning, T. (1993). Accurate methods for the statistics of
surprise and coincidence. Computational Linguistics,
19(1):61–74.
Igo, S. P. and Riloff, E. (2009). Corpus-based semantic lex-
icon induction with web-based corroboration. In Pro-
ceedings of the Workshop on Unsupervised and Mini-
mally Supervised Learning of Lexical Semantics, UM-
SLLS ’09, pages 18–26, Stroudsburg, PA, USA. As-
sociation for Computational Linguistics.
Lenat, D. B. (1995). Cyc: A large-scale investment in
knowledge infrastructure. Commun. ACM, 38(11):33–
38.
Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., and
Miller, K. J. (1990). Introduction to WordNet: an on-
line lexical database. International Journal of Lexi-
cography, 3(4):235–244.
Pantel, P. and Ravichandran, D. (2004). Automatically la-
beling semantic classes. In HLT-NAACL, pages 321–
328.
Phillips, W. and Riloff, E. (2002). Exploiting strong syn-
tactic heuristics and co-training to learn semantic lex-
icons. In Proceedings of the ACL-02 Conference on
Empirical Methods in Natural Language Processing -
Volume 10, EMNLP ’02, pages 125–132, Stroudsburg,
PA, USA. Association for Computational Linguistics.
Porter, M. F. (1980). An algorithm for suffix stripping. Pro-
gram, 14(3):130–137.
Qadir, A., Mendes, P. N., Gruhl, D., and Lewis, N. (2015).
Semantic lexicon induction from twitter with pattern
relatedness and flexible term length. In Proceedings
of the Twenty-Ninth AAAI Conference on Artificial In-
telligence, AAAI’15, pages 2432–2439. AAAI Press.
Qadir, A. and Riloff, E. (2012). Ensemble-based seman-
tic lexicon induction for semantic tagging. In Pro-
ceedings of the First Joint Conference on Lexical and
Computational Semantics - Volume 1: Proceedings of
the Main Conference and the Shared Task, and Volume
2: Proceedings of the Sixth International Workshop
on Semantic Evaluation, SemEval ’12, pages 199–
208, Stroudsburg, PA, USA. Association for Compu-
tational Linguistics.
Riloff, E. (1996). Automatically generating extraction pat-
terns from untagged text. In Proceedings of the Thir-
teenth National Conference on Artificial Intelligence -
Volume 2, AAAI’96, pages 1044–1049. AAAI Press.
Riloff, E. and Shepherd, J. (1997). A corpus-based ap-
proach for building semantic lexicons. In In Proceed-
ings of the Second Conference on Empirical Methods
in Natural Language Processing, pages 117–124.
Roark, B. and Charniak, E. (1998). Noun-phrase co-
occurrence statistics for semiautomatic semantic lexi-
con construction. In Proceedings of the 17th Interna-
tional Conference on Computational Linguistics - Vol-
ume 2, COLING ’98, pages 1110–1116, Stroudsburg,
PA, USA. Association for Computational Linguistics.
Thelen, M. and Riloff, E. (2002). A bootstrapping method
for learning semantic lexicons using extraction pattern
contexts. In Proceedings of the ACL-02 Conference on
Empirical Methods in Natural Language Processing -
Volume 10, EMNLP ’02, pages 214–221, Stroudsburg,
PA, USA. Association for Computational Linguistics.
Widdows, D. and Dorow, B. (2002). A graph model for un-
supervised lexical acquisition. In Proceedings of the
19th International Conference on Computational Lin-
guistics - Volume 1, COLING ’02, pages 1–7, Strouds-
burg, PA, USA. Association for Computational Lin-
guistics.
Ziering, P., van der Plas, L., and Sch
¨
utze, H. (2013a). Boot-
strapping semantic lexicons for technical domains. In
IJCNLP, pages 1321–1329. Asian Federation of Nat-
ural Language Processing / ACL.
Ziering, P., van der Plas, L., and Sch
¨
utze, H. (2013b).
Multilingual lexicon bootstrapping - improving a lex-
icon induction system using a parallel corpus. In
IJCNLP, pages 844–848. Asian Federation of Natural
Language Processing / ACL.
KDIR 2016 - 8th International Conference on Knowledge Discovery and Information Retrieval
196