results quantitatively.
In the context of link analysis, the graph-based
ranking method has been widely and successfully
used in NLP and its applications, such as WSD
(Agirre and Soroa, 2009; Navigli and Lapata, 2010),
text semantic similarity (Ramage et al., 2009), query
expansion in IR (Lafferty and Zhai, 2001), and doc-
ument summarization (Mihalcea and Tarau, 2005).
The basic idea is that of “voting” or “recommenda-
tions” between nodes. The model first constructs a
directed or undirected graph to reflect the relation-
ships between the nodes and then applies the graph-
based ranking algorithm to compute the rank scores
for the nodes. The nodes with large rank scores are
chosen as important nodes. Our methodology is the
first aimed at applying a graph-based ranking method
to identify the predominant sense depending on each
domain/category.
5 CONCLUSIONS
We presented a method for identifying predom-
inant sense of WordNet depending on each do-
main/category defined in the Reuters corpus. The av-
erage precision was 0.661, and recall against the Sub-
ject Field Code resources was 0.686. Moreover, the
results applying text classification significantly im-
proved classification accuracy. Future work will in-
clude: (i) applying the method to other part-of-speech
words, (ii) comparing the method with existing other
automated method, and (iii) extending our approach
to find domain-specific senses with unknown words
(Ciaramita and Johnson, 2003).
REFERENCES
Agirre, E. and Lacalle, O. L. (2009). Supervised Domain
Adaption for WSD. In Proc. of the EACL, pages 42–
50.
Agirre, E. and Soroa, A. (2009). Personalizing PageR-
ank for Word Sense Disambiguation. In Proc. of the
EACL, pages 33–41.
Bremaud, P. (1999). Markov Chains: Gibbs Fields, Monte
Carlo Simulation, and Queues. Springer-Verlag.
Brin, S. and Page, L. (1998). The Anatomy of a Large-
scale Hypertextual Web Search Engine. Computer
Networks, 30(1-7):107–117.
Buitelaar, P. and Sacaleanu, B. (2001). Ranking and Select-
ing Synsets by Domain Relevance. In Proc. of Word-
Net and Other Lexical Resources: Applications, Ex-
tensions and Customization, pages 119–124.
Chan, Y. S. and Ng, H. T. (2007). Domain Adaptation with
Active Learning for Word Sense Disambiguation. In
Proc. of the ACL, pages 49–56.
Ciaramita, M. and Johnson, M. (2003). Supersense Tag-
ging of Unknown Nouns in WordNet. In Proc. of the
EMNLP2003, pages 168–175.
Cotton, S., Edmonds, P., Kilgarriff, A., and
Palmer, M. (1998). SENSEVAL-2,
http://www.sle.sharp.co.uk/senseval2/.
Lafferty, J. and Zhai, C. (2001). Document Language Mod-
eling, Query Models, and Risk Minimization for In-
formation Retrieval. In Proc. of the SIGIR2001, pages
111–119.
Magnini, B. and Cavaglia, G. (2000). Integrating Subject
Field Codes into WordNet. In In Proc. of LREC-2000.
McCarthy, D., Koeling, R., Weeds, J., and Carroll, J.
(2004). Finding Predominant Senses in Untagged
Text. In Proc. of the ACL, pages 280–287.
Mihalcea, R. and Tarau, P. (2005). Language Independent
Extractive Summarization. In In Proc. of the ACL,
pages 49–52.
Miller, G. A., Leacock, C., Tengi, R., and Bunker, R. T.
(1993). A Semantic Concordance. In Proc. of the
ARPA Workshop on HLT, pages 303–308.
Navigli, R. and Lapata, M. (2010). An Experimental Study
of Graph Connectivity for Unsupervised Word Sense
Disambiguation. IEEE Trans. Pattern Anal. Mach. In-
tell., 32(4):678–692.
Netlib (2007). http://www.netlib.org/scalapack/index.html.
In Netlib Repository at UTK and ORNL.
Nigam, K., McCallum, A. K., Thrun, S., and Mitchell, T.
(2000). Text Classification from Labeled and Un-
labeled Documents using EM. Machine Learning,
39(2):103–134.
Ramage, D., Rafferty, A. N., and Manning, C. D. (2009).
Random Walks for Text Semantic Similarity. In Proc.
of the 4th TextGraphs Workshop on Graph-based Al-
gorithms in NLP, pages 23–31.
Rose, T. G., Stevenson, M., and Whitehead, M. (2002). The
Reuters Corpus Volume 1 - from yesterday’s news to
tomorrow’s language resources. In Proc. of LREC-
2002.
Schmid, H. (1995). Improvements in Part-of-Speech Tag-
ging with an Application to German. In Proc. of the
EACL SIGDAT Workshop.
KEOD 2010 - International Conference on Knowledge Engineering and Ontology Development
268