Strategic conflict management for performance-based urban air mobility operations with multi-agent reinforcement learning

Huang, Cheng; Petrunin, Ivan; Tsourdos, Antonios

Strategic conflict management for performance-based urban air mobility operations with multi-agent reinforcement learning

Files

Strategic_conflict_management-2022.pdf (3.1 MB)

Date

2022-07-26

Authors

Huang, Cheng
Petrunin, Ivan
Tsourdos, Antonios

Publisher

IEEE

Type

Conference paper

ISSN

2373-6720

URI

https://doi.org/10.1109/ICUAS54217.2022.9836139
https://dspace.lib.cranfield.ac.uk/handle/1826/18274

Citation

Huang C, Petrunin I, Tsourdos A. (2022) Strategic conflict management for performance-based urban air mobility operations with multi-agent reinforcement learning. In: 2022 International Conference on Unmanned Aircraft Systems (ICUAS), 21-24 June 2022, Dubrovnik, Croatia

Abstract

With the urban air mobility (UAM) quickly evolving, the great demand for public airborne transit and deliveries, besides creating a big market, will result in a series of technical, operational, and safety problems. This paper addresses the strategic conflict issue in low-altitude UAM operations with multi-agent reinforcement learning (MARL). Considering the difference in flight characteristics, the aircraft performance is fully integrated into the design process of strategic deconfliction components. With this concept, the multi-resolution structure for the low-altitude airspace organization, Gaussian Mixture Model (GMM) for the speed profile generation, and dynamic separation minima enable efficient UAM operations. To resolve the demand and capacity balancing (DCB) issue and the separation conflict at the strategic stage, the multi-agent asynchronous advantage actor-critic (MAA3C) framework is built with mask recurrent neural networks (RNNs). Meanwhile, variable agent number, dynamic environments, heterogeneous aircraft performance, and action selection between speed adjustment and ground delay can be well handled. Experiments conducted on a developed prototype and various scenarios indicate the obvious advantages of the constructed MAA3C in minimizing the delay cost and refining speed profiles. And the effectiveness, scalability, and stabilization of the MARL solution are ultimately demonstrated.

Keywords

Recurrent neural networks, Scalability, Refining, Weather forecasting, Reinforcement learning, Organizations, Delays

Rights

Attribution-NonCommercial 4.0 International

http://creativecommons.org/licenses/by-nc/4.0/

Collections

Staff publications (SATM)

Full item page

Strategic conflict management for performance-based urban air mobility operations with multi-agent reinforcement learning

Files

Date

Authors

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Department

Type

ISSN

Format

Free to read from

URI

Citation

Abstract

Description

Software Description

Software Language

Github

Keywords

DOI

Rights

Relationships

Relationships

Supplements

Funder/s

Collections