Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands
Loading...
Date
Authors
Okolie, Chukwuma
Adeleke, Adedayo
Mills, Jon
Smit, Julian
Maduako, Ikechukwu
Bagheri, Hossein
Komar, Tom
Wang, Shidong
Journal Title
Journal ISSN
Volume Title
Publisher
Taylor and Francis
Abstract
There has been a rapid evolution of tree-based ensemble algorithms
which have outperformed deep learning in several studies,
thus emerging as a competitive solution for many applications. In
this study, ten tree-based ensemble algorithms (random forest,
bagging meta-estimator, adaptive boosting (AdaBoost), gradient
boosting machine (GBM), extreme gradient boosting (XGBoost),
light gradient boosting (LightGBM), histogram-based GBM, categorical
boosting (CatBoost), natural gradient boosting (NGBoost), and
the regularised greedy forest (RGF)) were comparatively evaluated
for the enhancement of Copernicus digital elevation model (DEM)
in an agricultural landscape. The enhancement methodology combines
elevation and terrain parameters alignment, with featurelevel
fusion into a DEM enhancement workflow. The training dataset
is comprised of eight DEM-derived predictor variables, and the
target variable (elevation error). In terms of root mean square error
(RMSE) reduction, the best enhancements were achieved by GBM,
random forest and the regularised greedy forest at the first, second
and third implementation sites respectively. The computational
time for training LightGBM was nearly five-hundred times faster
than NGBoost, and the speed of LightGBM was closely matched by
the histogram-based GBM. Our results provide a knowledge base
for other researchers to focus their optimisation strategies on the
most promising algorithms.
Description
CODE AVAILABILITY STATEMENT : Code written in support of this publication is publicly available at https://github.com/mrjohnokolie/ dem-enhancement.
DATA AVAILABILITY STATEMENT : On reasonable request, the corresponding author will provide data that support the findings of this study.
DATA AVAILABILITY STATEMENT : On reasonable request, the corresponding author will provide data that support the findings of this study.
Keywords
Copernicus, Global digital elevation model, Machine learning, Tree-based ensembles, Bagging, Boosting, Gradient boosting, Explainability, Partial dependence, Light detection and ranging (LiDAR), SDG-09: Industry, innovation and infrastructure
Sustainable Development Goals
SDG-09: Industry, innovation and infrastructure
Citation
Chukwuma Okolie, Adedayo Adeleke, Jon Mills, Julian Smit, Ikechukwu
Maduako, Hossein Bagheri, Tom Komar & Shidong Wang (2024) Assessment of explainable tree-based ensemble algorithms for the enhancement of Copernicus digital elevation model in agricultural lands, International Journal of Image and Data Fusion, 15:4, 430-460, DOI:10.1080/19479832.2024.2329563.