Concept Profiles for Filtering Parliamentary Documents
Francisco J. Ribadas, Luis M. de Campos, Juan M. Fernández-Luna, Juan F. Huete
2015
Abstract
Content-based recommender/filtering systems help to appropriately distribute information among the individuals or organizations that could consider it of interest. In this paper we describe a filtering system to deal with the problem of assigning documents to members of the parliament potentially interested on them. The proposed approach exploits subjects taken from a conceptual thesaurus to create the user profiles and to describe the documents to be filtered. The assignment of subjects to documents is modeled as a multilabel classification problem. Experiments with a real parliamentary corpus are reported, evaluating several methods to assign conceptual subjects to documents and to match those sets of subjects with user profiles.
References
- Belkin, N.J., and Croft, W.B. (1992). Information Filtering and Information Retrieval: Two Sides of the Same Coin? Communications of the ACM, 35:29-38.
- de Campos, L.M., Fernández-Luna, J.M., Huete, J.F., Martin-Dancausa, C.J., Tur-Vigil, C., Tagua, A. (2009). An Integrated System for Managing the Andalusian Parliament's Digital Library. Program: Electronic Library and Information Systems, 43:121-139.
- Chang, C.-C and Lin, C.-J (2011). LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1-27:27.
- Gauch, S., Speretta, M., Chandramouli, A., and Micarelli, A. (2007). User Profiles for Personalized Information Access. In: The Adaptative Web. LCNS, vol. 4321, pages 54-89.
- Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I.H. (2009). The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1):10-18.
- Hanani, U., Shapira, B., and Shoval, P. (2001). Information Filtering: Overview of Issues, Research and Systems. User Modeling and User-Adapted Interaction, 11:203-259.
- Lantz, B. (2013). Machine Learning with R. Packt Publishing Ltd.
- Lin., D. (1998). An Information-Theoretic Definition of Similarity. Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), pages 296-304.
- Lops, P., de Gemmis, M., and Semerano, G. (2011). Content-based Recommender Systems: State of the Art and Trends. In: Recommender Systems Handbook, pages 73-105, Springer.
- Pazzani, M., and Billsus, D. (2007). Content-based Recommendation Systems. In: The Adaptive Web. LCNS, vol. 4321, pages 325-341.
- Read, J., Pfahringer, B., Holmes, G., and Frank, E. (2011). Classifier chains for multi-label classification. Machine Learning, 85(3):333-359.
- Silla Jr., C.N., and Freitas, A.A. (2011) A Survey of Hierarchical Classification across different Application Domains. Data Mining and Knowledge Discovery, 22(1- 2):31-72.
- Tsoumakas, G., Katakis, I., Vlahavas, I. (2010). Mining Multi-label Data. In Data Mining and Knowledge Discovery Handbook, pages 667-685, O. Maimon, L. Rokach (Eds.), Springer.
- Yeh, A. (2000). More accurate tests for the statistical significance of result differences. In Proceedings of the 18th International Conference on Computational Linguistics (COLING), pages 947-953.
Paper Citation
in Harvard Style
Ribadas F., de Campos L., Fernández-Luna J. and Huete J. (2015). Concept Profiles for Filtering Parliamentary Documents . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 409-416. DOI: 10.5220/0005616104090416
in Bibtex Style
@conference{kdir15,
author={Francisco J. Ribadas and Luis M. de Campos and Juan M. Fernández-Luna and Juan F. Huete},
title={Concept Profiles for Filtering Parliamentary Documents},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={409-416},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005616104090416},
isbn={978-989-758-158-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Concept Profiles for Filtering Parliamentary Documents
SN - 978-989-758-158-8
AU - Ribadas F.
AU - de Campos L.
AU - Fernández-Luna J.
AU - Huete J.
PY - 2015
SP - 409
EP - 416
DO - 10.5220/0005616104090416