Formula-E race strategy development using distributed policy gradient reinforcement learning

Liu, Xuze; Fotouhi, Abbas; Auger, Daniel J.

Formula-E race strategy development using distributed policy gradient reinforcement learning

Files

Formula-E_race_strategy_development-2021.pdf (2.75 MB)

Date

2021-01-20

Authors

Liu, Xuze
Fotouhi, Abbas
Auger, Daniel J.

Publisher

Elsevier

Type

Article

ISSN

0950-7051

URI

https://doi.org/10.1016/j.knosys.2021.106781
https://dspace.lib.cranfield.ac.uk/handle/1826/16246

Citation

Liu X, Fotouhi A, Auger DJ. (2021) Formula-E race strategy development using distributed policy gradient reinforcement learning. Knowledge-Based Systems, Volume 216, March 2021, Article number 106781

Abstract

Energy and thermal management is a crucial element in Formula-E race strategy development. In this study, the race-level strategy development is formulated into a Markov decision process (MDP) problem featuring a hybrid-type action space. Deep Deterministic Policy Gradient (DDPG) reinforcement learning is implemented under distributed architecture Ape-X and integrated with the prioritized experience replay and reward shaping techniques to optimize a hybrid-type set of actions of both continuous and discrete components. Soft boundary violation penalties in reward shaping, significantly improves the performance of DDPG and makes it capable of generating faster race finishing solutions. The new proposed method has shown superior performance in comparison to the Monte Carlo Tree Search (MCTS) with policy gradient reinforcement learning, which solves this problem in a fully discrete action space as presented in the literature. The advantages are faster race finishing time and better handling of ambient temperature rise.

Keywords

Energy management, Formula-E race strategy, Deep deterministic policy gradient, Reinforcement leaning

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

http://creativecommons.org/licenses/by-nc-nd/4.0/

Collections

Staff publications (SATM)

Full item page

Formula-E race strategy development using distributed policy gradient reinforcement learning

Files

Date

Free to read from

Authors

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Department

Type

ISSN

Format

URI

Citation

Abstract

Description

Software Description

Software Language

Github

Keywords

DOI

Rights

Relationships

Relationships

Supplements

Funder/s

Collections