Formula-E race strategy development using distributed policy gradient reinforcement learning

Liu, Xuze; Fotouhi, Abbas; Auger, Daniel J.

Formula-E race strategy development using distributed policy gradient reinforcement learning

dc.contributor.author	Liu, Xuze
dc.contributor.author	Fotouhi, Abbas
dc.contributor.author	Auger, Daniel J.
dc.date.accessioned	2021-01-25T17:57:04Z
dc.date.available	2021-01-25T17:57:04Z
dc.date.issued	2021-01-20
dc.description.abstract	Energy and thermal management is a crucial element in Formula-E race strategy development. In this study, the race-level strategy development is formulated into a Markov decision process (MDP) problem featuring a hybrid-type action space. Deep Deterministic Policy Gradient (DDPG) reinforcement learning is implemented under distributed architecture Ape-X and integrated with the prioritized experience replay and reward shaping techniques to optimize a hybrid-type set of actions of both continuous and discrete components. Soft boundary violation penalties in reward shaping, significantly improves the performance of DDPG and makes it capable of generating faster race finishing solutions. The new proposed method has shown superior performance in comparison to the Monte Carlo Tree Search (MCTS) with policy gradient reinforcement learning, which solves this problem in a fully discrete action space as presented in the literature. The advantages are faster race finishing time and better handling of ambient temperature rise.	en_UK
dc.identifier.citation	Liu X, Fotouhi A, Auger DJ. (2021) Formula-E race strategy development using distributed policy gradient reinforcement learning. Knowledge-Based Systems, Volume 216, March 2021, Article number 106781	en_UK
dc.identifier.issn	0950-7051
dc.identifier.uri	https://doi.org/10.1016/j.knosys.2021.106781
dc.identifier.uri	https://dspace.lib.cranfield.ac.uk/handle/1826/16246
dc.language.iso	en	en_UK
dc.publisher	Elsevier	en_UK
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Energy management	en_UK
dc.subject	Formula-E race strategy	en_UK
dc.subject	Deep deterministic policy gradient	en_UK
dc.subject	Reinforcement leaning	en_UK
dc.title	Formula-E race strategy development using distributed policy gradient reinforcement learning	en_UK
dc.type	Article	en_UK

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Formula-E_race_strategy_development-2021.pdf
Size:: 2.75 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.63 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Staff publications (SATM)