A complementary learning approach for expertise transference of human-optimized controllers
dc.contributor.author | Perrusquía, Adolfo | |
dc.date.accessioned | 2021-10-26T10:40:21Z | |
dc.date.available | 2021-10-26T10:40:21Z | |
dc.date.issued | 2021-10-21 | |
dc.description.abstract | In this paper, a complementary learning scheme for experience transference of unknown continuous-time linear systems is proposed. The algorithm is inspired in the complementary learning properties that exhibit the hippocampus and neocortex learning systems via the striatum. The hippocampus is modelled as pattern-separated data of a human optimized controller. The neocortex is modelled as a Q-reinforcement learning algorithm which improves the hippocampus control policy. The complementary learning (striatum) is designed as an inverse reinforcement learning algorithm which relates the hippocampus and neocortex learning models to seek and transfer the weights of the hidden expert’s utility function. Convergence of the proposed approach is analysed using Lyapunov recursions. Simulations are given to verify the proposed approach. | en_UK |
dc.identifier.citation | Perrusquia A. (2022) A complementary learning approach for expertise transference of human-optimized controllers. Neural Networks, Volume 145, January 2022, pp. 33-41 | en_UK |
dc.identifier.issn | 0893-6080 | |
dc.identifier.uri | https://doi.org/10.1016/j.neunet.2021.10.009 | |
dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/17204 | |
dc.language.iso | en | en_UK |
dc.publisher | Elsevier | en_UK |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Complementary learning | en_UK |
dc.subject | Hippocampus and neocortex learning systemsQ-learning | en_UK |
dc.subject | Inverse reinforcement learning | en_UK |
dc.subject | Batch least squares | en_UK |
dc.subject | Gradient-descent rule | en_UK |
dc.title | A complementary learning approach for expertise transference of human-optimized controllers | en_UK |
dc.type | Article | en_UK |