Heterogeneous Ensemble Learning for Modelling Species Distribution: A Case Study of Redstarts Habitat Suitability

Omar El Alaoui, Ali Idri, Ali Idri

2023

Abstract

Habitat protection is a critical aspect of species conservation, as restoring a habitat to its former state after it has been destroyed can be difficult. Species Distribution Models (SDMs), also known as habitat suitability models, are commonly used to address this issue. It finds ecological and evolutionary insights by linking species occurrences records to environmental data. Machine learning (ML) algorithms have been recently used to predict the distribution of species. Yet, a single ML algorithm may not always yield accurate predictions for a given dataset, making it challenging to develop a highly accurate model using a single algorithm. Therefore, this study proposes a novel approach to assess habitat suitability of three redstarts species based on ensemble learning techniques. Initially, eight machine learning algorithms, including MultiLayer Perceptron (MLP), Support Vector Machine (SVM), K-nearest neighbors (KNN), Decision Trees (DT), Gradient Boosting Classifier (GB), Random Forest (RF), AdaBoost (AB), and Quadratic Discriminant Analysis (QDA), were trained as base-learners. Subsequently, based on the performance of these base-learners, seven heterogeneous ensembles of two up to eight models, were constructed for each species dataset. The performance of the proposed approach was evaluated using five performance criteria (accuracy, sensitivity, specificity, AUC, and Kappa), Scott Knott (SK) test to statistically compare the performance of the presented models, and the Borda Count voting method to rank the best performing models based on multiple performance criteria. The findings revealed that the heterogeneous ensembles outperformed their singles in all three species datasets, underscoring the efficacy of the proposed approach in modelling species distribution.

Download


Paper Citation


in Harvard Style

El Alaoui O. and Idri A. (2023). Heterogeneous Ensemble Learning for Modelling Species Distribution: A Case Study of Redstarts Habitat Suitability. In Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-664-4, SciTePress, pages 105-114. DOI: 10.5220/0012118100003541


in Bibtex Style

@conference{data23,
author={Omar El Alaoui and Ali Idri},
title={Heterogeneous Ensemble Learning for Modelling Species Distribution: A Case Study of Redstarts Habitat Suitability},
booktitle={Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2023},
pages={105-114},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012118100003541},
isbn={978-989-758-664-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Heterogeneous Ensemble Learning for Modelling Species Distribution: A Case Study of Redstarts Habitat Suitability
SN - 978-989-758-664-4
AU - El Alaoui O.
AU - Idri A.
PY - 2023
SP - 105
EP - 114
DO - 10.5220/0012118100003541
PB - SciTePress