Pre-indexing Techniques in Arabic Information Retrieval

Souheila Ben Guirat; Ibrahim Bounhas; Yahia Slimani

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Pre-indexing Techniques in Arabic Information Retrieval

Topics: Machine Learning; Natural Language Processing

In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, 237-246, 2019 , Prague, Czech Republic

Authors: Souheila Ben Guirat ¹ ; Ibrahim Bounhas ² and Yahia Slimani ³

Affiliations: ¹ Computer Sciences Department, Prince Sattam Bin Abdulaziz University, K.S.A., Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia, JARIR: Joint Group for Artificial Reasoning and Information Retrieval and Tunisia ; ² Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia, JARIR: Joint Group for Artificial Reasoning and Information Retrieval and Tunisia ; ³ Laboratory of Computer Science for Industrial Systems, Carthage University, Tunisia, Higher Institute of Multimedia Arts of Manouba (ISAMM), La Manouba University, Tunisia, JARIR: Joint Group for Artificial Reasoning and Information Retrieval and Tunisia

Keyword(s): Arabic Information Retrieval, Hybrid Index, Statistical Modeling, Smoothing.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Knowledge Discovery and Information Retrieval ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Machine Learning ; Natural Language Processing ; Pattern Recognition ; Soft Computing ; Symbolic Systems

Abstract: Arabic document indexing is yet challenging given the morphological specificities of this language. Although there has been much effort in the field, developing more efficient indexing approaches is more and more demanding. One of the most important issues concerns the choice of the indexing units (e.g. stems, roots, lemmas, etc.) which both enhances retrieval efficiency and optimizes the indexing process. The question is how to process Arabic texts to retrieve the basic forms which better reflect the meaning of words and documents? In the literature several indexing units have been compared, while combining multiple indexes seems to be promising. In our previous works, we showed that hybrid indexes based on stems, patterns and roots enhances results. However, we need to find the optimal weight of each indexing unit. Therefore, this paper proposes to contribute in optimizing hybrid indexing. We compare and evaluate four pre-indexing methods.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.229

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Ben Guirat, S., Bounhas, I., Slimani and Y. (2019). Pre-indexing Techniques in Arabic Information Retrieval. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART; ISBN 978-989-758-350-6; ISSN 2184-433X, SciTePress, pages 237-246. DOI: 10.5220/0007393402370246

@conference{icaart19,
author={Souheila {Ben Guirat} and Ibrahim Bounhas and Yahia Slimani},
title={Pre-indexing Techniques in Arabic Information Retrieval},
booktitle={Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART},
year={2019},
pages={237-246},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007393402370246},
isbn={978-989-758-350-6},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART
TI - Pre-indexing Techniques in Arabic Information Retrieval
SN - 978-989-758-350-6
IS - 2184-433X
AU - Ben Guirat, S.
AU - Bounhas, I.
AU - Slimani, Y.
PY - 2019
SP - 237
EP - 246
DO - 10.5220/0007393402370246
PB - SciTePress