Holenda B, Domokos E, Rédey A, Fazakas J. Dissolved oxygen control of the activated sludge wastewater treatment process using model predictive control. Computers and Chemical Engineering, 2008, 32(6): 1270-1278
[2]
Zhang H G, Cui L L, Zhang X, Luo Y H. Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method. IEEE Transactions on Neural Networks, 2011, 22(12): 2226-2236
[3]
Wang F Y, Zhang H G, Liu D R. Adaptive dynamic programming: an introduction. IEEE Computational Intelligence Magazine, 2009, 4(2): 39-47
[4]
Wei Qing-Lai, Zhang Hua-Guang, Liu De-Rong, Zhao Yan. An optimal control scheme for a class of discrete-time nonlinear systems with time delays using adaptive dynamic programming. Acta Automatica Sinica, 2010, 36(1): 121-129 (魏庆来, 张化光, 刘德荣, 赵琰. 基于自适应动态规划的一类带有时滞的离散时间非线性系统的最优控制策略. 自动化学报, 2010, 36(1): 121-129)
[5]
Zhao Dong-Bin, Liu De-Rong, Yi Jian-Qiang. An overview on the adaptive dynamic programming based urban city traffic signal optimal control. Acta Automatica Sinica, 2009, 35(6): 676-681 (赵冬斌, 刘德荣, 易建强. 基于自适应动态规划的城市交通信号优化控制方法综述. 自动化学报, 2009, 35(6): 676-681)
[6]
Jaeger H. The "echo state" approach to analysing and training recurrent neural networks. GMD Report, German National Research Center for Information Technology, 2001, 12(8): 1-43
[7]
IWA Taskgroup on Benchmarking of Control Stategies for WWTPs. Benchmark simulation model No.1 (BSM1) [Online], available: http://www.iwapublishing.com, April 2008
[8]
Shi Xiong-Wei, Qiao Jun-Fei, Yuan Ming-Zhe. Optimal control for wastewater treatment process based on improved particle optimization algorithm. Information and Control, 2011, 40(5): 698-703 (史雄伟, 乔俊飞, 苑明哲. 基于改进粒子群优化算法的污水处理过程优化控制. 信息与控制, 2011, 40(5): 698-703)
[9]
Dellana S A, West D. Predictive modeling for wastewater applications: linear and nonlinear approaches. Environmental Modelling & Software, 2009, 24(1): 96-106
[10]
Lewis F L, Vamvoudakis K G. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Transactions on Systems, Man, and Cybernetics — Part B: Cybernetics, 2011, 41(1): 14-25
[11]
Wei Qing-Lai, Zhang Hua-Guang, Cui Li-Li. Data-based optimal control for discrete-time zero-sum games of 2-D systems using adaptive critic designs. Acta Automatica Sinica, 2009, 35(6): 682-692(魏庆来, 张化光, 崔黎黎. 基于数据自适应评判的离散2-D系统零和博弈最优控制. 自动化学报, 2009, 35(6): 682-692)
[12]
Fu J, He H B, Zhou X M. Adaptive learning and control for MIMO system based on adaptive dynamic programming. IEEE Transactions on Neural Networks, 2011, 22(7): 1133-1148
[13]
White D A, Sofge D A. Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. New York: Van Nostrand Reinhold Press, 1992
[14]
Busoniu L, Babuska R, De Schutter B. Reinforcement Learning and Dynamic Programming Using Function Approximators. Boca Raton: CRC Press, 2010