English
 
Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Cross-Validation Strategy Impacts the Performance and Interpretation of Machine Learning Models

Sweet, L.-b., Müller, C., Anand, M., Zscheischler, J. (2023): Cross-Validation Strategy Impacts the Performance and Interpretation of Machine Learning Models. - Artificial Intelligence for the Earth Systems, 2, 4, e230026.
https://doi.org/10.1175/AIES-D-23-0026.1

Item is

Files

show Files
hide Files
:
Crossvalidation_accepted_230704.pdf (Preprint), 2MB
 
File Permalink:
-
Name:
Crossvalidation_accepted_230704.pdf
Description:
-
Visibility:
Private
MIME-Type / Checksum:
application/pdf
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-
:
28570oa.pdf (Publisher version), 6MB
Name:
28570oa.pdf
Description:
-
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-
License:
-

Locators

show

Creators

show
hide
 Creators:
Sweet, Lily-belle1, Author
Müller, Christoph2, Author              
Anand, Mohit1, Author
Zscheischler, Jakob1, Author
Affiliations:
1External Organizations, ou_persistent22              
2Potsdam Institute for Climate Impact Research, ou_persistent13              

Content

show
hide
Free keywords: -
 Abstract: Machine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.

Details

show
hide
Language(s): eng - English
 Dates: 2023-03-312023-07-032023-07-102023-10
 Publication Status: Finally published
 Pages: 14
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: Organisational keyword: RD2 - Climate Resilience
PIKDOMAIN: RD2 - Climate Resilience
Working Group: Land Use and Resilience
MDB-ID: No data to archive
Model / method: LPJmL
Model / method: Machine Learning
Regional keyword: Global
Research topic keyword: Food & Agriculture
OATYPE: Green Open Access
DOI: 10.1175/AIES-D-23-0026.1
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Artificial Intelligence for the Earth Systems
Source Genre: Journal, other, oa
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 2 (4) Sequence Number: e230026 Start / End Page: - Identifier: CoNE: https://publications.pik-potsdam.de/cone/journals/resource/2769-7525
Publisher: American Meteorological Society (AMS)