Data-driven model reference control of MIMO vertical tank systems with model-free VRFT and Q-Learning
- PMID:29325777
- DOI: 10.1016/j.isatra.2018.01.014
Data-driven model reference control of MIMO vertical tank systems with model-free VRFT and Q-Learning
Abstract
This paper proposes a combined Virtual Reference Feedback Tuning-Q-learning model-free control approach, which tunes nonlinear static state feedback controllers to achieve output model reference tracking in an optimal control framework. The novel iterative Batch Fitted Q-learning strategy uses two neural networks to represent the value function (critic) and the controller (actor), and it is referred to as a mixed Virtual Reference Feedback Tuning-Batch Fitted Q-learning approach. Learning convergence of the Q-learning schemes generally depends, among other settings, on the efficient exploration of the state-action space. Handcrafting test signals for efficient exploration is difficult even for input-output stable unknown processes. Virtual Reference Feedback Tuning can ensure an initial stabilizing controller to be learned from few input-output data and it can be next used to collect substantially more input-state data in a controlled mode, in a constrained environment, by compensating the process dynamics. This data is used to learn significantly superior nonlinear state feedback neural networks controllers for model reference tracking, using the proposed Batch Fitted Q-learning iterative tuning strategy, motivating the original combination of the two techniques. The mixed Virtual Reference Feedback Tuning-Batch Fitted Q-learning approach is experimentally validated for water level control of a multi input-multi output nonlinear constrained coupled two-tank system. Discussions on the observed control behavior are offered.
Keywords: Batch fitted Q-learning; Model reference tracking; Model-free optimal control; Multi input-multi output systems; Neural networks; Vertical tank systems; Virtual reference feedback tuning.
Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.
Similar articles
- Model-Free Primitive-Based Iterative Learning Control Approach to Trajectory Tracking of MIMO Systems With Experimental Validation.Radac MB, Precup RE, Petriu EM.Radac MB, et al.IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2925-38. doi: 10.1109/TNNLS.2015.2460258. Epub 2015 Aug 13.IEEE Trans Neural Netw Learn Syst. 2015.PMID:26285221
- Data-based virtual unmodeled dynamics driven multivariable nonlinear adaptive switching control.Chai T, Zhang Y, Wang H, Su CY, Sun J.Chai T, et al.IEEE Trans Neural Netw. 2011 Dec;22(12):2154-72. doi: 10.1109/TNN.2011.2167685. Epub 2011 Nov 16.IEEE Trans Neural Netw. 2011.PMID:22106143
- Trajectory Tracking within a Hierarchical Primitive-Based Learning Approach.Radac MB.Radac MB.Entropy (Basel). 2022 Jun 28;24(7):889. doi: 10.3390/e24070889.Entropy (Basel). 2022.PMID:35885112Free PMC article.
- Modeling and control of non-square MIMO system using relay feedback.Kalpana D, Thyagarajan T, Gokulraj N.Kalpana D, et al.ISA Trans. 2015 Nov;59:408-17. doi: 10.1016/j.isatra.2015.09.012. Epub 2015 Oct 6.ISA Trans. 2015.PMID:26453020Review.
- Long-latency reflexes for inter-effector coordination reflect a continuous state feedback controller.Crevecoeur F, Kurtzer I.Crevecoeur F, et al.J Neurophysiol. 2018 Nov 1;120(5):2466-2483. doi: 10.1152/jn.00205.2018. Epub 2018 Aug 22.J Neurophysiol. 2018.PMID:30133376Review.
Cited by
- Data-based bipartite formation control for multi-agent systems with communication constraints.Wang J, Zhao H, Yu H, Yang R, Li J.Wang J, et al.Sci Prog. 2024 Jan-Mar;107(1):368504241227620. doi: 10.1177/00368504241227620.Sci Prog. 2024.PMID:38361488Free PMC article.
- Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology.Zhao H, Peng L, Yu H.Zhao H, et al.Sensors (Basel). 2020 Jul 27;20(15):4164. doi: 10.3390/s20154164.Sensors (Basel). 2020.PMID:32726953Free PMC article.
LinkOut - more resources
Full Text Sources
Other Literature Sources