loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Takumi Sonoda and Takao Miura

Affiliation: HOSEI University, Japan

Keyword(s): Collocation, Co-occurrences, Feature Selection, Natural Language Processing.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence and Decision Support Systems ; Enterprise Information Systems ; Human Factors ; Human-Computer Interaction ; Interface Design ; Natural Language Interfaces to Intelligent Systems ; Physiological Computing Systems

Abstract: In this investigation, we discuss a computational approach to extract collocation based on both data mining and statistical techniques. We extend n-grams consisting of independent words and that we take frequencies on them after filtering on colligation. Then we apply statistical filters for the candidates, and compare these feature selection methods in statistical learning with each other. Five methods are evaluated, including term frequency (TF), Pairwise Mutual Information (PMI), Dice Coefficient(DC), T-Score (TS) and Pairwise Log-Likelihood ratio (PLL).We found PMI, MC and TS the most effective in our experiments. Using these we got 88 percent accuracy to extract collocation.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.220.160.216

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Sonoda, T. and Miura, T. (2013). Mining Japanese Collocation by Statistical Indicators. In Proceedings of the 15th International Conference on Enterprise Information Systems - Volume 3: ICEIS; ISBN 978-989-8565-59-4; ISSN 2184-4992, SciTePress, pages 381-388. DOI: 10.5220/0004397503810388

@conference{iceis13,
author={Takumi Sonoda. and Takao Miura.},
title={Mining Japanese Collocation by Statistical Indicators},
booktitle={Proceedings of the 15th International Conference on Enterprise Information Systems - Volume 3: ICEIS},
year={2013},
pages={381-388},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004397503810388},
isbn={978-989-8565-59-4},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the 15th International Conference on Enterprise Information Systems - Volume 3: ICEIS
TI - Mining Japanese Collocation by Statistical Indicators
SN - 978-989-8565-59-4
IS - 2184-4992
AU - Sonoda, T.
AU - Miura, T.
PY - 2013
SP - 381
EP - 388
DO - 10.5220/0004397503810388
PB - SciTePress