Multi-agent deep reinforcement learning for solving large-scale air traffic flow management problem: a time-step sequential decision approach

Tang, Yifan; Xu, Yan

Multi-agent deep reinforcement learning for solving large-scale air traffic flow management problem: a time-step sequential decision approach

dc.contributor.author	Tang, Yifan
dc.contributor.author	Xu, Yan
dc.date.accessioned	2023-01-05T19:49:23Z
dc.date.available	2023-01-05T19:49:23Z
dc.date.issued	2022-11-15
dc.description.abstract	In this paper, we focus on the demand-capacity balancing (DCB) problem in air traffic flow management, which is considered as a fully cooperative multi-agent learning task. First, a rule-based time-step environment is designed to mimic the DCB process. In this environment, each agent ‘flight’ decides its action at valid time steps. Three different rules are defined, based on the remaining capacity and the number of cooperative flights in each sector, to ease the learning process. Second, a multi-agent reinforcement learning framework, built on the proximal policy optimization (MAPPO), is proposed by using the parameter sharing mechanism and the mean-field approximation method, where an inherent feature of all other agents is extracted to address the credit assignment problem. Moreover, a supervisor integrated MAPPO framework is proposed, where a supervisor is designed to generate supervised actions, in such a way to further improve the learning performance. In the experiments, two performance indices, Search Capability and Generalization Capability, are considered. Both indices are assessed with the evaluation of two toy cases and a real-world case study. Results suggest that, the supervisor integrated MAPPO with supervised actions achieves the best performance across the different cases; other proposed methods also show some promising Search Capability, but only prove an acceptable Generalization Capability in simpler cases than the training cases.	en_UK
dc.identifier.citation	Tang Y, Xu Y. (2021) Multi-agent deep reinforcement learning for solving large-scale air traffic flow management problem: a time-step sequential decision approach. In: 2021 AIAA/IEEE 40th Digital Avionics Systems Conference (DASC), 3-7 October 2021, San Antonio, USA	en_UK
dc.identifier.eisbn	978-1-6654-3420-1
dc.identifier.isbn	978-1-6654-3421-8
dc.identifier.uri	https://doi.org/10.1109/DASC52595.2021.9594329
dc.identifier.uri	https://dspace.lib.cranfield.ac.uk/handle/1826/18882
dc.language.iso	en	en_UK
dc.publisher	IEEE	en_UK
dc.rights	Attribution-NonCommercial 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc/4.0/	*
dc.subject	air traffic flow management	en_UK
dc.subject	demand-capacity balance	en_UK
dc.subject	multi-agent reinforcement learning	en_UK
dc.subject	proximal policy optimization	en_UK
dc.title	Multi-agent deep reinforcement learning for solving large-scale air traffic flow management problem: a time-step sequential decision approach	en_UK
dc.type	Conference paper	en_UK

Files

Original bundle

Now showing 1 - 1 of 1

Name:: solving_large-scale_air_traffic_flow_management_problem-2022.pdf
Size:: 777.38 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.63 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Staff publications (SATM)