From the experiments carried out to evaluate
various IR models, we see that Jelinek-Mercer
model outperformed the other models on the
2AIRTC corpus. Regarding the extracted features
from the Indri index, there is a high correlation
between Jelinek-Mercer and Dirichlet query-
document features, but generally low correlations
between the different query-IDF features, with few
of them strongly pairwise-correlated. Based on our
results, it is clear that some queries perform better
than others. Most of the query terms are more of
specific terms and are able to discriminate retrieval
results. Regarding the extracted features from the
Terrier index, most of them have high correlations
between the different features (except for three of
them). An interesting point of discussion is that both
query-document and document Letor features have
the same correlation matrix as well as the same data
distribution for each feature. This is because we
nearly obtained the same results for both query-
document and document Letor features. Regarding
the computed evaluation metrics from both Terrier
and Indri (for both Dirichlet and Jelinek-Mercer
smoothing) indexes, we can see very similar data
distributions and correlations for the precision,
NDCG, MAP, recall, relative precision, set-based,
success and number-based metrics. The correlation
matrices between the query-IDF predictors and the
evaluation measures show very low PCC values.
6 CONCLUSION
QPP is the task of estimating the retrieval quality of
an IR system for a given query. In this study, we
investigated the retrieval effectiveness of various IR
models, examined the quality of some query
performance predictors in IR task, and explored the
correlations between some predictors. We carried
out a range of experiments to analyze the effect of
QPP methods on 2AIRT based on different IR
models and IR metrics. The performances of IR
models for queries were tested on 2AIRTC test
collection. Our findings reveal Jelinek-Mercer
model outperformed over the other models and more
correlated to Dirichlet than BM25. Strong
correlation is observed between TF, inb2, dhl3, dl
and dfic features. Future work needs to focus on
developing large test collections for generalizing the
quality of a retrieval model on a given query and for
investigating the consistence of each query
performance predictors across test collections and
query types. Further investigation can be made on
the quality of more post-retrieval strategies and
automatic query expansion on all queries and
selective queries using QPP methods.
REFERENCES
Abate, M. and Assabie, Y. (2014). Development of
Amharic morphological analyzer using memory based
learning. In Proc. of the 9th Int. Conf. on Natural
Language Processing, Warsaw, pp. 1-13.
Akdere, M., Çetintemel, U., Riondato, M., Upfal, E.,
Zdonik, S. (2012). Learning-based query performance
modelling and prediction. In Proceedings of 2012
IEEE 28th International Conference on Data
Engineering Arlington, VA, USA, pp. 390-401.
Alemayehu, N. and Willett, P. (2002). Stemming of
Amharic words for information retrieval. Literary and
Linguistic Computing, 17(1), pp.1–17.
Argaw, A.A and Asker, L. (2006). Amharic-English
information retrieval. In: Peters, C., et al. Evaluation
of Multilingual and Multi-modal Information
Retrieval. CLEF 2006. Lecture Notes in Computer
Science, vol 4730. Springer, Berlin, Heidelberg.
Carmel, D. and Yom-Tov, E. (2010). Estimating the query
difficulty for information retrieval. Synthesis Lectures
Inf. Concepts Retrieval Serv, 2(1).
Cronen-Townsend, S., Zhou, Y. and Croft, W. B. (2002).
Predicting query performance. In: J¨arvelin, K.,
Beaulieu, M., Baeza-Yates, R.A., Myaeng, S. (eds.)
SIGIR 2002: In Proceedings of the 25th Annual
International ACM SIGIR Conference on Research
and Development in Information Retrieval, Tampere,
Finland, pp. 299-306.
Datta,S., Ganguly,D., Mitra,M. and Greene, D. (2022). A
relative information gain-based query performance
prediction framework with generated query variants.
ACM Transactions on Information Systems, 1(1).
Eshetu, A., Teshome, G. and Abebe, T. (2020). Learning
word and sub-word vectors for Amharic (Less
Resourced Language). Int. J. Adv. Eng. Res. Sci.
(IJAERS), 7(8), pp. 358-366.
Gambäck, B. (2012). Tagging and verifying an Amharic
news corpus. Workshop on Language Technology for
Normalisation of Less-Resourced Languages
(SALTMIL8/AfLaT2012), Istanbul, Turkey,pp. 79-84.
Ganguly, D., Datta, S., Mitra, M. and Greene, D. (2022).
An analysis of variations in the effectiveness of query
performance prediction. In 44th European Conference
on Information Retrieval (ECIR 2022), Stavanger,
Norway, pp. 215-229.
Gasser, M., (2011). Hornmorpho: A system for
morphological processing of Amharic, Afaan Oromo,
and Tigrinya. In: conference on Human language
technology for development, Alexandria, Egypt, pp.
94-99.
He, B. and Ounis, I. (2006). Query performance
prediction. Information Systems, 31(7), pp. 585–594.
Mothe, J., Tanguy, L. (2005). Linguistic features to predict
query difficulty. In ACM Conference on research and