Uncovering reward goals in distributed drone swarms using physics-informed multiagent inverse reinforcement learning

Date published

2025-01-01

Free to read from

2025-02-27

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE

Department

Type

Article

ISSN

2168-2267

Format

Citation

Perrusquía A, Guo W. (2025) Uncovering reward goals in distributed drone swarms using physics-informed multiagent inverse reinforcement learning. IEEE Transactions on Cybernetics, Volume 55, Issue 1, January 2025, pp. 14-23

Abstract

The cooperative nature of drone swarms poses risks in the smooth operation of services and the security of national facilities. The control objective of the swarm is, in most cases, occluded due to the complex behaviors observed in each drone. It is paramount to understand which is the control objective of the swarm, whilst understanding better how they communicate with each other to achieve the desired task. To solve these issues, this article proposes a physics-informed multiagent inverse reinforcement learning (PI-MAIRL) that: 1) infers the control objective function or reward function from observational data and 2) uncover the network topology by exploiting a physics-informed model of the dynamics of each drone. The combined contribution enables to understand better the behavior of the swarm, whilst enabling the inference of its objective for experience inference and imitation learning. A physically uncoupled swarm scenario is considered in this study. The incorporation of the physics-informed element allows to obtain an algorithm that is computationally more efficient than model-free IRL algorithms. Convergence of the proposed approach is verified using Lyapunov recursions on a global Riccati equation. Simulation studies are carried out to show the benefits and challenges of the approach.

Description

Software Description

Software Language

Github

Keywords

Drone swarms, imitation learning, multiagent inverse reinforcement learning (IRL), network topology, physics-informed, reward function, 46 Information and Computing Sciences, 4602 Artificial Intelligence, Behavioral and Social Science, Basic Behavioral and Social Science

DOI

Rights

Attribution 4.0 International

Relationships

Relationships

Resources

Funder/s