ACKNOWLEDGEMENTS
The authors would like to thank the referees for their
comments on the earlier version of this paper. This
work was partially supported by the Telecommunica-
tions Advancement Foundation.
REFERENCES
Bar-Hillel, A., Hertz, T., Shental, N., and Weinshall, D.
(2003). Learning Distance Functions using Equiva-
lence Relations. In Proc. of the 20th International
Conference on Machine Learning, pages 11–18.
Bilenko, M., Basu, S., and Mooney, R. J. (2004). Integrating
Constraints and Metric Learning in Semi-Supervised
Clustering. In Proc. of the 21th International Confer-
ence on Machine Learning, pages 81–88.
Bouraev, B., Briscoe, E. J., Carroll, J., Carter, D.,
and Grover, C. (1987). The Derivation of a
Grammatically-Indexed Lexicon from the Longman
Dictionary of Contemporary English. In Proc. of the
25th Annual Meeting of the Association for Computa-
tional Linguistics, pages 193–200.
Brew, C. and Walde, S. S. (2002). Spectral Clustering
for German Verbs. In Proc. of 2002 Conference on
Empirical Methods in Natural Language Processing,
pages 117–123.
Briscoe, E. J. and Carroll, J. (1997). Automatic Extraction
of Subcategorization from Corpora. In Proc. of 5th
ACL Conference on Applied Natural Language Pro-
cessing, pages 356–363.
Briscoe, E. J. and Carroll, J. (2002). Robust Accurate Sta-
tistical Annotaion of General Text. In Proc. of 3rd
International Conference on Language Resources and
Evaluation, pages 1499–1504.
Dagan, I., Lee, L., and Pereira, F. C. N. (1999). Similarity-
based Models of Word Cooccurrence Probabilities.
Machine Learning, 34(1-3):43–69.
Grishman, R., Macleod, C., and Meyers, A. (1994). Com-
plex Syntax: Building a Computational Lexicon. In
Proc. of International Conference on Computational
Linguistics, pages 268–272.
Hindle, D. (1990). Noun Classification from Predicate-
Argument Structures. In Proc. of 28th Annual Meet-
ing of the Association for Computational Linguistics,
pages 268–275.
Hughes, J. (1994). Automatically Acquiring Classification
of Words. Ph.D. thesis University of Leeds.
Kermanidis, K., Maragoudakis, M., Fakotakis, N., and
Kokkinakis, G. K. (2008). Learning Verb Com-
plements for Modern Greek: Balancing the Noisy
Dataset. Natural Language Engineering, 14(1):71–
100.
Kirkpatrick, S., Jr., C. D. G., and Vecchi, M. P. (1983).
Optimization by Simulated Annealing. Science,
220(4598):671–680.
Korhonen, A. (2002). Subcategorization Acquisition. Ph.D.
thesis University of Cambridge.
Korhonen, A., Krymolowski, Y., and Briscoe, T. (2006).
A Large Subcategorization Lexicon for Natural Lan-
guage Processing Applications. In Proc. of the 5th
International Conference on Language Resources and
Evaluation.
Korhonen, A., Krymolowski, Y., and Marx, Z. (2003). Clus-
tering Polysemic Subcategorization Frame Distribu-
tions Semantically. In Proc. of the 41st Annual Meet-
ing of the Association for Computational Linguistics,
pages 64–71.
Lee, L. (1999). Measures of Distributional Similarity. In
Proc. of the 37th Annual Meeting of the Association
for Computational Linguistics, pages 25–32.
Leech, G. (1992). 100 Million Words of English:
The British National Corpus. Language Research,
28(1):1–13.
Levin, B. (1993.). English Verb Classes and Alternations.
Chicago University Press.
Lin, D. (1998). Automatic Retrieval and Clustering of Sim-
ilar Words. In Proc. of 36th Annual Meeting of the
Association for Computational Linguistics and 17th
International Conference on Computational Linguis-
tics, pages 768–773.
Matsuo, Y., Sakaki, T., Uchiyama, K., and Ishizuka, M.
(2006). Graph-based Word Clustering using a Web
Search Engine. In Proc. of 2006 Conference on
Empirical Methods in Natural Language Processing
(EMNLP2006), pages 542–550.
Navigli, R. (2008). A Structural Approach to the Automatic
Adjudication of Word Sense Disagreements. Natural
Language Engineering, 14(4):547–573.
Navigli, R. (2009). Word Sense Disambiguation: A Survey.
ACM Computing Surveys, 41(2):1–69.
Ng, A. Y., Jordan, M. I., and Weiss, Y. (2002.). On Spectral
Clustering: Analysis and an Algorithm. MIT Press.
Pereira, F., Tishby, N., and Lee, L. (1993). Distributional
Clustering of English Words. In Proc. of the 31st
Annual Meeting of the Association for Computational
Linguistics, pages 183–190.
Reichardt, J. and Bornholdt, S. (2004). Detecting Fuzzy
Community Structure in Complex Networks with a
Potts Model. PHYSICAL REVIEW LETTERS, 93(21).
Reichardt, J. and Bornholdt, S. (2006). Statistical Mechan-
ics of Community Detection. PHYSICAL REVIEW E,
74.
Reiter, E. and Dale, R. (2000.). Building Natural Language
Generation Systems. Cambridge University Press.
Rooth, M. (1998). Two-Dimensional Clusters in Grammat-
ical Relations. In Inducing Lexicons with the EM Al-
gorithm, AIMS Report, 4(3).
Rooth, M., Riezler, S., Prescher, D., Carroll, G., and Beil,
F. (1999). Inducing a Semantically Annotated Lex-
icon via EM-Based Clustering. In Proc. of the 37th
Annual Meeting of the Association for Computational
Linguistics.
Schulte im Walde, S. (2000). Clustering Verbs Seman-
tically according to their Alternation Behaviour. In
Proc. of the 18th International Conference on Com-
putational Linguistics, pages 747–753.
SEMANTIC CLASSIFICATION OF UNKNOWN WORDS BASED ON GRAPH-BASED SEMI-SUPERVISED
CLUSTERING
45