knowledge domain of protein families. Bioinformatics,
14:600–607.
Clark, K. and Gale, W. (1995). Inverse document frequency
(idf): A measure of deviation from poisson. In Third
Workshop on Very Large Corpora, pages 121–130.
Ercan, G. and Cicekli, I. (2007). Using lexical chains for
keyword extraction. Inf. Process. Manage., 43(6):1705–
1714.
Frank, E., Paynter, G. W., Witten, I. H., Gutwin, C.,
and Nevill-Manning, C. G. (1999). Domain-specific
keyphrase extraction. In Dean, T., editor, IJCAI’99,
pages 668–673. Morgan Kaufmann.
HaCohen-Kerner, Y. (2003). Automatic extraction of key-
words from abstracts. In Palade, V., Howlett, R. J., and
Jain, L. C., editors, KES 2003, volume 2773 of Lecture
Notes in Computer Science, pages 843–849. Springer.
HaCohen-Kerner, Y., Gross, Z., and Masa, A. (2005). Auto-
matic extraction and learning of keyphrases from scien-
tific articles. In Gelbukh, A. F., editor, CICLing 2005,
volume 3406 of Lecture Notes in Computer Science,
pages 657–669. Springer.
Hulth, A. (2003). Improved automatic keyword extraction
given more linguistic knowledge. In Conference on Em-
pirical Methods in Natural Language Processing, pages
216–223.
Hulth, A. (2004). Enhancing linguistically oriented auto-
matic keyword extraction. In North American Human
language technology conference.
Hulth, A., Karlgren, J., Jonsson, A., Bostr¨om, H., and
Asker, L. (2001). Automatic keyword extraction us-
ing domain knowledge. In Gelbukh, A. F., editor, CI-
CLing’01, volume 2004 of Lecture Notes in Computer
Science, pages 472–482. Springer.
Kim, S., Medelyan, O., Kan, M., and Baldwin, T. (2010).
Semeval-2010 task 5: Automatic keyphrase extraction
from scientific articles. In Proceedings of the 5th Inter-
national Workshop on Semantic Evaluation, ACL 2010,
pages 21–26.
Matsuo, Y. and Ishizuka, M. (2003). Keyword extraction
from a single document using word co-occurrence statis-
tical information. In Russell, I. and Haller, S. M., editors,
FLAIRS Conference, pages 392–396. AAAI Press.
Nguyen, T. D. and Kan, M.-Y. (2007). Keyphrase extraction
in scientific publications. In Goh, D. H.-L., Cao, T. H.,
Sølvberg, I., and Rasmussen, E. M., editors, ICADL, vol-
ume 4822 of Lecture Notes in Computer Science, pages
317–326. Springer.
Ohsawa, Y., Benson, N. E., and Yachida, M. (1998).
Keygraph: Automatic indexing by co-occurrence graph
based on building construction metaphor. In ADL’98,
pages 12–18. IEEE Computer Society.
Page, L., Brin, S., Motwani, R., and Winograd, T. (1998).
The pagerank citation ranking: Bringing order to the
web. Technical report, Stanford.
Rennie, J. D. M. and Jaakkola, T. (2005). Using term infor-
mativeness for named entity detection. In Baeza-Yates,
R. A., Ziviani, N., Marchionini, G., Moffat, A., and Tait,
J., editors, SIGIR’05, pages 353–360. ACM.
Timonen, M. (2012). Categorization of very short docu-
ments. In In-press KDIR’12. SciTePress Digital Library.
Timonen, M., Silvonen, P., and Kasari, M. (2011a). Classi-
fication of short documents to categorize consumer opin-
ions. In ADMA’11. Online proceedings.
Timonen, M., Silvonen, P., and Kasari, M. (2011b). Mod-
elling a query space using associations. Frontiers in Ar-
tificial Intelligence and Applications, 255:77–96.
Tomokiyo, T. and Hurst, M. (2003). A language model ap-
proach to keyphrase extraction. In Proceedings of ACL
Workshop on Multiword Expressions.
Turney, P. D. (2000). Learning algorithms for keyphrase
extraction. Inf. Retr., 2(4):303–336.
Turney, P. D. (2003). Coherent keyphrase extraction via
web mining. In Gottlob, G. and Walsh, T., editors, IJ-
CAI’03, pages 434–442. Morgan Kaufmann.
Wan, X. and Xiao, J. (2008). Collabrank: Towards a collab-
orative approach to single-document keyphrase extrac-
tion. In Scott, D. and Uszkoreit, H., editors, COLING’08,
pages 969–976.
Witten, I. H., Paynter, G. W., Frank, E., Gutwin, C., and
Nevill-Manning, C. G. (1999). Kea: Practical automatic
keyphrase extraction. CoRR, cs.DL/9902007.
Yih, W., Goodman, J., and Carvalho, V. R. (2006). Finding
advertising keywords on web pages. In Carr, L., Roure,
D. D., Iyengar, A., Goble, C. A., and Dahlin, M., editors,
WWW’06, pages 213–222. ACM.
Informativeness-basedKeywordExtractionfromShortDocuments
421