Patrícia F. Castro, Geraldo B. Xexéo


Conventional information retrieval systems have proven ineffective in dealing with information overload. One possible solution is to incorporate some features that allow users of these systems to custom handle this information. In order to enable systems of this kind, some of the characteristics of present-day systems should be reviewed. Among other features, all documents are described with the same level of detail. We believe that the redrafting of document models is the starting point for reform of these systems. The paradigm of granular computing has proven to be very suitable for the treatment of complex problems and can produce significant results in large-scale environments such as the Web. This paper explores the granulation process of words with a view to its application in the subsequent improvement in document representation. We use fuzzy relations and spectral clustering in this process and present some result.


  1. Chakrabarti, S. 2003. Mining the Web: Discovering Knowledge from Hypertext Data. s.l. : Morgan Kaufmann, 2003.
  2. Doan, S, Ha, S and Horiguchi, S. 2005. A Fuzzy-Based Approach for text Representation in Text Categorization. 14th IEEE International Conference on Fuzzy Systems. 05 2005, pp. 1008-1013. ISBN: 0- 7803-9159-4.
  3. Dumais, S and Landauer, T. 1997. A Solution to Platós Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. Psychological Review. 1997. Vols. 104, No.2, pp. 211-240.
  4. Fishbein, J. 2008. Integrating Structure and Meaning Using Holographic Reduced Representation to Improve Automatic Text Classification. Master Thesis. University of Waterloo : s.n., 2008.
  5. Ganter, B, Stumme, G and Wille, R. 2005. Formal Concept Analysis: Foudations and Applications. Lecture Notes in Artificial Intelligence. s.l. : Springer-Verlag, 2005. ISBN 3-540-27891-5.
  6. Haruechaiyasak, C, Shyu, M and Chen, ML. 2002. Web Classification Based on Fuzzy Association. Proceedings of the 25th Annual International Computer Software and Applications Conference (COMPSAC02). 2002.
  7. Hobbs, J. 1985. Granularity. Proceeding of the Nineth International Joint Conference on Artificial Intelligence. 1985, pp. 423-435.
  8. Ingersen, P, Skov, B and Larsen, B. 2008. Inter and Intra-document Context Applied in Polyrepresentation for Best Match IR. Information Processing and Management: an International Journal. 2008, Vol. 44, pp. 1673-1683.
  9. Khalled, S. 2006. A Semantic Graph Model for Text Representation and Matching in Document Mining. PhD Thesis. University of Waterloo. Canadá : s.n., 2006.
  10. Kozima, T. 1993. Similarity Between Words Computed by Spreading Activation on an English Dictionary. Proceedings of the 6th Conference of the European Chapter of the ACL. 1993, pp. 232-239.
  11. Lin, T. 2007. Granular Computing and Modeling the Human Thoughts in Web Documents. Lecture Notes in Artificial Intelligence. Proceedings of the 12th International Fuzzy Systems Association World Congress on Foubdations of Fuzzy Logig and Soft Computing. Cancun, Mexico : s.n., 2007. pp. 263- 270. ISBN: 978-3-540-72917-4.
  12. Liu, G. 1994. The Semantic Vector Space Model (SVSM): A Text Representation and Searching Techmique System Sciences. Information Systems: Collaboration Technology Organizational Systems and Technology. Proceedings of the Twenty-Seventh Hawaii International Conference. 1994, Vol. IV, pp. 928-937.
  13. Ng, A and Jordan, M. 2001. On Spectral Clustering: Analysis and an Algorithm. Advances in Neural Information Processing Systems. 2001, Vol. 14.
  14. Predycz, W. 2005. Knowledge-Based Clustering. From Data to Information Granules. Hoboken, New Jersey : John Wiley & Sons, 2005.
  15. Rapp, R. 2002. The Computation of Word Associations: Comparing Syntagmatic and Paradigmatic Approaches. Proceedings of COLING-02. 2002.
  16. Steyvers, M and Griffiths, T. 2007. Probabilistic Topic Models. [book auth.] T Landauer, et al. Latent Semantic Analysis: A Road to Meaning. s.l. : Laurence Erlbaum, 2007.
  17. von Luxburg, U. 2006. A tutorial on Spectral Clustering. Technical Report 149 : Max Planck Institute for Biological Cybernetics, 2006.
  18. Yao, Y. 2007a. A Ten-year Review of Granular Computing. Proceedings of IEEE International Conference on Granular Computing. 2007a, pp. 734-739.
  19. Yao, Y and Zhong, Y. 2002. Granular Computing using Information Tables. [book auth.] T Y Lin, Y Yao and L A Zadeh. Data Mining, Rough Sets and Granular Computing. Heidelberg : Physica, 2002, pp. 102-124.
  20. Yao, Y. 2002. Information Retrieval Support Systems. The 2002 IEEE World Congress on Computational Intelligence, Honolulu, Hawai, USA. 2002, pp. 773- 778.
  21. -. 2007. The Art of Granular Computing. LNAI 4585. s.l. : Springer, 2007, pp. 101-102.
  22. Zadeh, L. 1993. Fuzzy Sets. Readings in Fuzzy Sets for Intelligent Systems. 1993.
  23. Zhong, N, et al. 2008. Towards Granular Reasoning on the Web. Proceedings of the 2008 Workshop on New Forms of Reasoning for Semantic Web: Scalable, Tolerant and Dynamic (NEFORD 2008), the 3rd Asian Semantic Web Conference (ASWC2008). 2008.

Paper Citation

in Harvard Style

F. Castro P. and B. Xexéo G. (2012). GRANULES OF WORDS FROM FUZZY RELATIONS AND SPECTRAL CLUSTERING . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 683-688. DOI: 10.5220/0003934706830688

in Bibtex Style

author={Patrícia F. Castro and Geraldo B. Xexéo},
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},

in EndNote Style

JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
SN - 978-989-8565-08-2
AU - F. Castro P.
AU - B. Xexéo G.
PY - 2012
SP - 683
EP - 688
DO - 10.5220/0003934706830688