loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Nuo Zhang ; Daisuke Matsuzaki ; Toshinori Watanabe and Hisashi Koga

Affiliation: Graduate School of Information Systems, The University of Electro-Communications, Japan

Keyword(s): Document analysis, PRDC, Topic extraction, Relation analysis, Clustering, Data compression.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Biomedical Signal Processing ; Data Manipulation ; Data Mining ; Databases and Information Systems Integration ; Enterprise Information Systems ; Health Engineering and Technology Applications ; Human-Computer Interaction ; Knowledge Representation and Reasoning ; Methodologies and Methods ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Signal Processing ; Soft Computing ; Symbolic Systems

Abstract: Nowadays, there are a great deal of e-documents can be easily accessed. It will be beneficial if a method can evaluate documents and abstract significant content. Similarity analysis and topic extraction are widely used as document relation analysis techniques. Most of the methods are based on dictionary-base morphological analysis. They cannot meet the requirement when the Internet grows fast and new terms appear but dictionary cannot be automatically updated fast enough. In this study, we propose a novel document relation analysis (topic extraction) method based on a compressibility vector. Our proposal does not require morphological analysis, and it can automatically evaluate input documents. We will examine the proposal with using model document and Reuters-21578 dataset, for relation analysis and topic extraction. The effectiveness of the proposed method will be shown in simulations.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.91.153

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Zhang, N.; Matsuzaki, D.; Watanabe, T. and Koga, H. (2009). DOCUMENT RELATION ANALYSIS BASED ON COMPRESSIBILITY VECTOR. In Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART; ISBN 978-989-8111-66-1; ISSN 2184-433X, SciTePress, pages 255-260. DOI: 10.5220/0001660202550260

@conference{icaart09,
author={Nuo Zhang. and Daisuke Matsuzaki. and Toshinori Watanabe. and Hisashi Koga.},
title={DOCUMENT RELATION ANALYSIS BASED ON COMPRESSIBILITY VECTOR},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART},
year={2009},
pages={255-260},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001660202550260},
isbn={978-989-8111-66-1},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the International Conference on Agents and Artificial Intelligence - ICAART
TI - DOCUMENT RELATION ANALYSIS BASED ON COMPRESSIBILITY VECTOR
SN - 978-989-8111-66-1
IS - 2184-433X
AU - Zhang, N.
AU - Matsuzaki, D.
AU - Watanabe, T.
AU - Koga, H.
PY - 2009
SP - 255
EP - 260
DO - 10.5220/0001660202550260
PB - SciTePress