Safe online learning for nonlinear dynamical systems using control contraction metrics

dc.contributor.advisorShin, Hyo-Sang
dc.contributor.advisorIgnatyev, Dmitry
dc.contributor.advisorTsourdos, Antonios
dc.contributor.authorOsborne, Matthew
dc.date.accessioned2024-09-05T12:49:18Z
dc.date.available2024-09-05T12:49:18Z
dc.date.freetoread2024-09-05
dc.date.issued2023-03
dc.description.abstractThis thesis aims to develop an online learning framework for a military fixed-wing aircraft that can adapt a control policy to unforeseen changes in the airframe’s flight dynamics. This is an active area of research and a significant challenge for high dimensional non linear systems due to the inherent safety risks and computational challenges of solving exponential time algorithms. The research achieves this aim by providing an extensive survey of safe online learn ing approaches for nonlinear systems by assessing each technique with the aid of key performance metrics. Two critical performance metrics are the reliability and time com plexity of the approach taken. To support the survey a benchmarking study of salient techniques provides further evidence to support the findings of the literature survey to identify promising avenues of research. A generic safe learning process is defined and a convex optimisation learning pipeline is developed to handle nonlinear system identification and online controller synthesis via control contraction metrics. The developed pipeline is applied to a longitudinal simulation of an F-16 aircraft using high-fidelity wind tunnel data. The gap in knowledge around the application of control contraction metrics to aircraft designed to meet flying qualities requirements based on linear time invariant theory is bridged. A novel cascaded two loop algorithm is developed to explicitly place the eigenvalues of the inner and outer loop of a differential feedback controller. Further a parameterisa tion of a robust controller is shown to better optimise the performance relative to flying qualities specifications. Conditions for a hybrid linear sum of controllers is shown to provide a stability guarantee for a mixed controller that enables a performance trade-off of each approach. The performance of the developed controllers is demonstrated on six damaged aircraft profiles to assess the robustness and transient characteristics for each method based on a forty second flight trajectory. The variation in nonlinear damage profiles illustrates the limitations of a linear ap proximating function for three nonlinear deviations. We show that the robust quadratic regulator controller generates a smoother transient response compared to the exponential contraction controllers. The two-loop contraction metric controller improves the rise-time performance compared to a single loop but is less robust to damage variations. The out come of the research is a greater understanding of the application of contraction-based controllers and the effect of tuning parameters for a robust controller with potential for a reinforcement learning algorithm. Further a method to hybridise control policies is proposed and a loop shaping method using contraction based linear matrix inequalities developed with potential application to cascaded systems.
dc.description.coursenamePhD in Manufacturing
dc.description.sponsorshipEngineering and Physical Sciences Research Council (EPSRC)
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/22898
dc.language.isoen
dc.publisherCranfield University
dc.publisher.departmentSATM
dc.rights© Cranfield University, 2023. All rights reserved. No part of this publication may be reproduced without the written permission of the copyright holder.
dc.subjectSafe Learning
dc.subjectModel Based Reinforcement Learning
dc.subjectAdaptive Control
dc.subjectNonlinear Sys tems
dc.subjectControl Contraction Metrics
dc.subjectFixed-Wing Control
dc.subjectConvex Optimisation.
dc.titleSafe online learning for nonlinear dynamical systems using control contraction metrics
dc.typeThesis
dc.type.qualificationlevelDoctoral
dc.type.qualificationnamePhD

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Osborne_M_2023.pdf
Size:
44.07 MB
Format:
Adobe Portable Document Format
Description:
Standard 6 month embargo.
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed upon to submission
Description: