
Data-centric Explainable Machine Learning for Land Cover Classification: A Practical Guide in R
Data-centric Explainable Machine Learning for Land Cover Classification: A Practical Guide in R
Land cover classification remains challenging due to satellite sensor calibration issues, lack of quality training data, spatial heterogeneity, and spectral confusion, to name but a few. Remote sensing researchers and analysts have successfully used supervised machine learning models for land cover classification. However, machine learning models do not clearly explain how and why they make predictions. Hence, there is a need to gain insights into or understand how machine learning models make predictive decisions. The insights will also make machine learning models more trustworthy and reliable.
Supervised machine learning algorithms require quality training data to perform land cover classification. In general, quality training data influences the effectiveness of supervised machine learning models and the accuracy of land cover maps. While training data preparation is time-consuming and expensive, remote sensing analysts and researchers should prepare quality training data rather than focus only on refining and tuning machine learning algorithms.
This book will improve land cover mapping using quality training data and seasonal optical and synthetic-aperture radar (SAR) data. It will also introduce concepts and techniques for starting data-centric explainable machine learning for land cover classification.
With this book, you will learn to:
- import satellite imagery and training sample points in R;
- prepare training data;
- perform exploratory data analysis;
- tune and train machine learning models;
- explore strategies to minimize class imbalance;
- perform explainable machine learning using the iml, DALEX, and iBreakdown packages and
- perform data-centric, explainable machine learning
Table of contents
Chapter 1. Introduction
Chapter 2. Explainable machine learning using the iml package
Chapter 3. Explainable machine learning using the DALEX package
Chapter 4. Exploring data-balancing strategies for land cover classification
Chapter 5. Towards data-centric explainable machine learning for land cover classification
Chapter 6. Concluding remarks
Appendix. Resources
Who should use this book?
This book does not introduce remote sensing image classification, R programming, or machine learning concepts. To learn more from this book, you must have some foundational knowledge and experience with remote sensing, R, and machine learning. The book aims to introduce data-centric explainable machine learning for land cover classification. I hope you will go through all the modules and understand how to improve the land cover classification.
Some of the potential readers of this book include:
- remote sensing professionals who already use machine learning for land cover classification but need to gain more insights from their models;
- remote sensing researchers and analysts, and data scientists who want an introduction to data-centric explainable machine learning tools and techniques and
- graduate students who wish to improve land cover classification using data-centric explainable machine learning tools and techniques.