NEW METHOD USING DECLINABLE WORDS AND CONCURRENT WORDS TO CREATE A LARGE NUMBER OF FA WORDS

El-Sayed Atlam, K. Morita, M. Fuketa, Jun-ichi Aoe

Abstract

The Readers can know the subject of many document fields by reading only some specific Field Association (FA) words. Document fields can be decided efficiently if there are many rank 1 FA words (words that direct connect to terminal fields) and if the frequency rate is high. This paper proposes a new method for increasing rank 1 FA words using declinable words and concurrent words which relate to narrow association categories and eliminate FA word ambiguity. Concurrent words become Concurrent Field Association Words (CFA words) if there is a little field overlap. Usually, efficient CFA words are difficult to extract using only frequency, so this paper proposes weighting according to degree of importance of concurrent words. The new weighting method causes Precision and Recall to be higher than by using frequency alone. Moreover, combining CFA words with FA words allow easy search of fields which can not be searched by using only FA words.

References

  1. Aoe, J., Morita K., and Mochizuki H. 1989. An Efficient Retrieval Algorithm of Collocate Information Using Tree Structure. Transaction of The IPSJ, 39 (9), pp.2563-2571.
  2. Atlam, E.-S., Morita K., and Aoe, J. 2002. A New Method For Selecting English Compound Terms and its Knowledge Representation. Information P. & Management Journal, Vol. 38, No. 6, pp. 807-821.
  3. Atlam, E.-S., Morita, K., Fuketa M., and Aoe, J. 2006. Automatic Building of New Field Association Word Candidates Using Search Engineā€, Information Processing & Management Journal, Vol.42, No. 4, pp.951-962.
  4. Breiman, L., Friedman, J.H., Olshen R. A. and Stone C.J. 1994. Classification and Regression Trees. Chapman Hall.
  5. Callen, J. P. 1994. Passage and level evidence in document retrieval. In Proc. of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.302- 310
  6. Dozawa, T. 1999. Innovative Multi Information Dictionary Imidas'99, Annual Series, Zueisha Publication Co., Japan ( In Japanese).
  7. Fuhr, N. 1989. Models for retrieval with probabilistic indexing, Information Processing and Retrieval, 25 (1), 55-72.
  8. Elmarhomy, G., Atlam, E.-S., Morita, K., Fuketa, M. and Aoe, J. 2006. Automatic Deletion of Unnecessary Field Association Word Using Morphological Analysis, International Journal of Computer and Mathematics, Vol. 83, 3, pp 247-262.
  9. Fukumoto, F., Suzuki, Y. 1996. Automatic Clustering of Articles using Dictionary definitions. Proceeding of the 16th International Conference on Computional Linguistic (COLING'96), 406-411.
  10. Iwayama, M. and Tokunaga, T., 1999. Probabilistic Passage Categorization and Its Application. Journal of Natural language Processing. Vol. 6 No. 3, pp. 181- 198.
  11. Melucii, M. 1998. Passage Retrieval and a Probabilistic technique. Information Processing and Management, 34(1), 43-68.
  12. Salton G., and McGill M.J., 1983. Introduction of Modern Information Retrieval. New York: McGraw-Hill.
  13. G. Salton. 1988, Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley.
  14. Tsuji, T., Nigazawa, H., Okada, M. and Aoe, J. 1999. Early Field Recognition by Using Field Association Words. In Conference on Computer Processing of Oriental Language, Vol. 2, pp. 301-304.
Download


Paper Citation


in Harvard Style

Atlam E., Morita K., Fuketa M. and Aoe J. (2010). NEW METHOD USING DECLINABLE WORDS AND CONCURRENT WORDS TO CREATE A LARGE NUMBER OF FA WORDS . In Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-674-021-4, pages 527-531. DOI: 10.5220/0002709705270531


in Bibtex Style

@conference{icaart10,
author={El-Sayed Atlam and K. Morita and M. Fuketa and Jun-ichi Aoe},
title={NEW METHOD USING DECLINABLE WORDS AND CONCURRENT WORDS TO CREATE A LARGE NUMBER OF FA WORDS},
booktitle={Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2010},
pages={527-531},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002709705270531},
isbn={978-989-674-021-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - NEW METHOD USING DECLINABLE WORDS AND CONCURRENT WORDS TO CREATE A LARGE NUMBER OF FA WORDS
SN - 978-989-674-021-4
AU - Atlam E.
AU - Morita K.
AU - Fuketa M.
AU - Aoe J.
PY - 2010
SP - 527
EP - 531
DO - 10.5220/0002709705270531