OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

自动化学报 2013

基于数据ADP算法的一类带有执行器饱和的未知离散时间系统最优跟踪控制

DOI: 10.3724/SP.J.1004.2013.01413, PP. 1413-1420

宋睿卓, 肖文栋, 孙长银

Keywords: 自适应动态规划,迭代算法,最优跟踪控制,基于数据,辨识器

Full-Text Cite this paper Add to My Lib

Abstract:

？针对一类带有执行器饱和的未知动态离散时间非线性系统,提出了一种新的最优跟踪控制方案.该方案基于迭代自适应动态规划算法,为了实现最优控制,首先建立了未知系统动态的数据辨识器.通过引入M网络,获得了稳态控制的精确表达式.为了消除执行器饱和的影响,提出了一个非二次的性能指标函数.然后提出了一种迭代自适应动态规划算法获得最优跟踪控制的解,并给出了收敛性分析.为了实现最优控制方案,神经网络被用来构建数据辨识器、计算性能指标函数、近似最优控制策略和求解稳态控制.仿真结果验证了本文所提出的最优跟踪控制方法的有效性.

References

[1]	Zhang X, Zhang H G, Sun Q Y, Luo Y H. Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence. Neurocomputing, 2012, 91: 48-55
[2]	Shen D, Hou Z S. Iterative learning control with unknown control direction: a novel data-based approach. IEEE Transactions on Neural Networks, 2011, 22(12): 2237-2249
[3]	Zhang H G, Luo Y H, Liu D R. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Transactions on Neural Networks, 2009, 20(9): 1490-1503
[4]	Bernstein D S. Optimal nonlinear, but continuous, feedback control of systems with saturating actuators. International Journal of Control, 1995, 62(5): 1209-1216
[5]	Bellman R E. Dynamic Programming. Princeton, NJ: Princeton University Press, 1957
[6]	Liu D R, Xiong X X, Zhang Y. Action-dependent adaptive critic designs. In: Proceedings of the 2001 International Joint Conference on Neural Networks. Washington, D.C.: IEEE, 2001, 2: 990-995
[7]	Wei Q L, Liu D R. An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state. Neural Networks, 2012, 32: 236-244
[8]	Song R Z, Zhang H G. The finite-horizon optimal control for a class of time-delay affine nonlinear system. Neural Computing and Applications, 2013, 22(2): 229-235
[9]	Wei Q L, Zhang H G, Cui L L. Data-based optimal control for discrete-time zero-sum games of 2-D systems using adaptive critic designs. Acta Automatica Sinica, 2009, 35(6): 682-692
[10]	Fu J, He H B, Zhou X M. Adaptive learning and control for MIMO system based on adaptive dynamic programming. IEEE Transactions on Neural Networks, 2011, 22(7): 1133-1148
[11]	Wei Q L, Zhang H G, Liu D R, Zhao Y. An optimal control scheme for a class of discrete-time nonlinear systems with time delays using adaptive dynamic programming. Acta Automatica Sinica, 2010, 36(1): 121-129
[12]	Wei Q L, Zhang H G, Dai J. Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing, 2009, 72(7-9): 1839-1848
[13]	Zhang H G, Cui L L, Zhang X, Luo Y H. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Transactions on Neural Networks, 2011, 22(12): 2226-2236
[14]	Hou Z S, Wang Z. From model-based control to data-driven control: survey, classification and perspective. Information Sciences, 2013, 235: 3-35
[15]	Lai C Y, Xiang C, Lee T H. Data-based identification and control of nonlinear systems via piecewise affine approximation. IEEE Transactions on Neural Networks, 2011, 22(12): 2189-2200
[16]	Saberi A, Lin Z L, Teel A R. Control of linear systems with saturating actuators. IEEE Transactions on Automatic Control, 1996, 41(3): 368-378
[17]	Sussmann H J, Sontag E D, Yang Y. A general result on the stabilization of linear systems using bounded controls. IEEE Transactions on Automatic Control, 1994, 39(12): 2411-2425
[18]	Bryson A E, Ho Y C. Applied Optimal Control: Optimization, Estimation, and Control. New York, NY: Hemisphere-Wiley, 1975
[19]	Zhang H G, Song R Z, Wei Q L, Zhang T Y. Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming. IEEE Transactions on Neural Networks, 2011, 22(12): 1851-1862
[20]	Song R Z, Zhang H G, Luo Y H, Wei Q L. Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming. Neurocomputing, 2010, 73(16-18): 3020-3027
[21]	Liu Z W, Zhang H G. Delay-dependent stability for systems with fast-varying neutral-type delays via a PTVD compensation. Acta Automatica Sinica, 2010, 36(1): 147-152
[22]	Zhang H G, Wei Q L, Liu D R. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica, 2011, 47(1): 207-214
[23]	He H B, Ni Z, Fu J. A three-network architecture for on-line learning and optimization based on adaptive dynamic programming. Neurocomputing, 2012, 78(1): 3-13
[24]	Zhang H G, Wei Q L, Luo Y H. A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 937-942
[25]	Wang D, Liu D R, Wei Q L, Zhao D B, Jin N. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. Automatica, 2012, 48(8): 1825-1832
[26]	Liu D R, Wang D, Zhao D B, Wei Q L, Jin N. Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming. IEEE Transactions on Automation Science and Engineering, 2012, 9(3): 628-634

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133