Cost inference of discrete-time linear quadratic control policies using human-behaviour learning

Perrusquía, Adolfo; Guo, Weisi

Cost inference of discrete-time linear quadratic control policies using human-behaviour learning

dc.contributor.author	Perrusquía, Adolfo
dc.contributor.author	Guo, Weisi
dc.date.accessioned	2022-07-06T10:47:14Z
dc.date.available	2022-07-06T10:47:14Z
dc.date.issued	2022-06-30
dc.description.abstract	In this paper, a cost inference algorithm for discrete-time systems using human-behaviour learning is pro-posed. The approach is inspired in the complementary learning that exhibits the neocortex, hippocampus, and striatum learning systems to achieve complex decision making. The main objective is to infer the hidden cost function from expert's data associated to the hippocampus (off-policy data) and transfer it to the neocortex for policy generalization (on-policy data) in different systems and environments. The neocortex is modelled by a Q-learning and a least-squares identification algorithms for on-policy learning and system identification. The cost inference is obtained using a one-step gradient descent rule and an inverse optimal control algorithm. Convergence of the cost inference algorithm is discussed using Lyapunov recursions. Simulations verify the effectiveness of the approach.	en_UK
dc.identifier.citation	Perrusquia A, Guo W. (2022) Cost inference of discrete-time linear quadratic control policies using human-behaviour learning. In: CODiT 2022: 8th International Conference on Control, Decision and Information Technologies, 17-20 May 2022, Istanbul, Turkey, pp. 165-170	en_UK
dc.identifier.eisbn	978-1-6654-9607-0
dc.identifier.eissn	2576-3555
dc.identifier.isbn	978-1-6654-9608-7
dc.identifier.issn	2576-3547
dc.identifier.uri	https://doi.org/10.1109/CoDIT55151.2022.9804118
dc.identifier.uri	https://dspace.lib.cranfield.ac.uk/handle/1826/18152
dc.language.iso	en	en_UK
dc.publisher	IEEE	en_UK
dc.rights	Attribution-NonCommercial 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc/4.0/	*
dc.subject	Learning systems	en_UK
dc.subject	Costs	en_UK
dc.subject	Q-learning	en_UK
dc.subject	Decision making	en_UK
dc.subject	Optimal control	en_UK
dc.subject	Cost function	en_UK
dc.subject	Inference algorithms	en_UK
dc.title	Cost inference of discrete-time linear quadratic control policies using human-behaviour learning	en_UK
dc.type	Conference paper	en_UK
dcterms.dateAccepted	2022-03-11

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Discrete-time_linear_quadratic_control_policies-2022.pdf
Size:: 1.09 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.63 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Staff publications (SATM)