Deep reinforcement learning in World-Earth system models to discover 
sustainable management strategies

Strnad, Felix; Barfuss, Wolfram; Donges, Jonathan Friedemann; Heitzig, Jobst

doi:10.1063/1.5124673

Local TagsRelease HistoryDetailsSummary

Deep reinforcement learning in World-Earth system models to discover sustainable management strategies

Strnad, F., Barfuss, W., Donges, J. F., Heitzig, J. (2019): Deep reinforcement learning in World-Earth system models to discover sustainable management strategies. - Chaos, 29, 12, 123122.
https://doi.org/10.1063/1.5124673

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://publications.pik-potsdam.de/pubman/item/item_23422 Version Permalink: https://publications.pik-potsdam.de/pubman/item/item_23422_4

Genre: Journal Article

Files

show Files

hide Files

:

8655.pdf (Publisher version), 3MB

File Permalink:
-

Name:
8655.pdf

Description:
-

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Strnad, Felix¹, Author
Barfuss, Wolfram¹, Author
Donges, Jonathan Friedemann¹, Author
Heitzig, Jobst¹, Author

Affiliations:
1Potsdam Institute for Climate Impact Research, ou_persistent13

Content

show

hide

Free keywords: -

Abstract: Increasingly complex nonlinear World-Earth system models are used for describing the dynamics of the biophysical Earth system and the socioeconomic and sociocultural World of human societies and their interactions. Identifying pathways toward a sustainable future in these models for informing policymakers and the wider public, e.g., pathways leading to robust mitigation of dangerous anthropogenic climate change, is a challenging and widely investigated task in the field of climate research and broader Earth system science. This problem is particularly difficult when constraints on avoiding transgressions of planetary boundaries and social foundations need to be taken into account. In this work, we propose to combine recently developed machine learning techniques, namely, deep reinforcement learning (DRL), with classical analysis of trajectories in the World-Earth system. Based on the concept of the agent-environment interface, we develop an agent that is generally able to act and learn in variable manageable environment models of the Earth system. We demonstrate the potential of our framework by applying DRL algorithms to two stylized World-Earth system models. Conceptually, we explore thereby the feasibility of finding novel global governance policies leading into a safe and just operating space constrained by certain planetary and socioeconomic boundaries. The artificially intelligent agent learns that the timing of a specific mix of taxing carbon emissions and subsidies on renewables is of crucial relevance for finding World-Earth system trajectories that are sustainable in the long term. We propose a framework for using deep reinforcement learning (DRL) as an approach to extend the field of Earth system analysis by a new method. We build our framework upon the agent-environment interface concept. The agent can apply management options to models of the Earth system as the environment of interest and learn by rewards provided by the environment. We train our agent with a deep Q-neural network extended by current state-of-the-art algorithms. We find that the agent is able to learn novel, previously undiscovered policies that navigate the system into sustainable regions in two exemplary conceptual models of the World-Earth system.

Details

show

hide

Language(s):

Dates: Finally published : 2019

Publication Status: Finally published

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: Peer

Identifiers: DOI: 10.1063/1.5124673
PIKDOMAIN: RD4 - Complexity Science
PIKDOMAIN: RD1 - Earth System Analysis
eDoc: 8655
Research topic keyword: Climate Policy
Research topic keyword: Planetary Boundaries
Research topic keyword: Sustainable Development
Model / method: Agent-based Models
Model / method: Machine Learning
Regional keyword: Global
Organisational keyword: FutureLab - Game Theory & Networks of Interacting Agents
Organisational keyword: FutureLab - Earth Resilience in the Anthropocene
Organisational keyword: RD1 - Earth System Analysis
Organisational keyword: RD4 - Complexity Science
Working Group: Whole Earth System Analysis
Working Group: Network- and machine-learning-based prediction of extreme events

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Chaos

Source Genre: Journal, SCI, Scopus, p3

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 29 (12) Sequence Number: 123122 Start / End Page: - Identifier: CoNE: https://publications.pik-potsdam.de/cone/journals/resource/180808