Integrating Internet Directories by Estimating Category Correspondences

Yoshimi Suzuki, Fumiyo Fukumoto

Abstract

This paper focuses on two existing category hierarchies and proposes a method for integrating these hierarchies into one. Integration of hierarchies is proceeded based on semantically related categories which are extracted by using text categorization. We extract semantically related category pairs by estimating category correspondences. Some categories within hierarchies are merged based on the extracted category pairs. We assign the remaining categories to a newly constructed hierarchy. To evaluate the method, we applied the results of new hierarchy to text categorization task. The results showed that the method was effective for categorization.

Download


Paper Citation