Kamioka T,Kamioka T,Uchibe E,et al.Multiobjective reinforcement learning based on multiple value functions[J].IEIC Technical Report (Institute of Electronics,Information and Communication Engineers), 2006,105(658):127-132.
[2]
Mariano C,Morales E.A new distributed reinforcement learning algorithm for multiple objective optimization problems[M]//Berlin:Springer Berlin Heidelberg,2000:290-299.
[3]
Mariano C E,Morales E F.Distributed reinforcement learning for multiple objective optimization problems[C]// Proceedings of the 2000 Congress on Evolutionary Computation.La Jolla,CA:IEEE:2000:188-195.
[4]
赵昀.有关强化学习的若干问题研究[D].南京:南京理工大学,2009.Zhao Yun.For a number of issues of reinforcement learning[D].Nanjing:Nanjing University of Science,2009(in Chinese).
[5]
Zitzler E,Deb K,Thiele L.Comparison of multiobjective evolutionary algorithms:empirical results[J].Evolutionary Computation,2000,8(2):173-195.
[6]
Das I,Dennis J E.Normal-boundary intersection: a new method for generating the Pareto surface in nonlinear multicriteria optimization problems[J].SIAM Journal on Optimization,1998,8(3):631-657.
[7]
Roman C,Rosehart W.Evenly distributed Pareto points in multi-objective optimal power flow[J].IEEE Transactions on Power Systems,2006,21(2):1011-1012.
[8]
熊宁,程浩忠,马则良,等.发电机出力成本与负荷裕度置换度指标的NBI 求解方法[J].电力系统自动化,2010,34(5):34-37.Xiong Ning,,Chen Haozhong,Ma Zeliang,et al.The determination of substitute degree between generation cost and loading margin based on NBI method[J].Automation of Electric Power Systems,2010,34(5):34-37(in Chinese).
[9]
郭庆来,孙宏斌,张伯明,等.协调二级电压控制的研究[J].电力系统自动化,2005,29(23):19-24.Guo Qinglai,Sun Hongbin,Zhang Boming,et al.Study on coordinated secondary voltage control[J].Automation of Electric Power Systems,2005,23(29):19-24(in Chinese).
[10]
张安安,杨洪耕.基于ε-支配域的模糊多目标无功优化方法[J].电力系统自动化,2009,33(5):34-39.Zhang Anan,Yang Honggeng.A new ε-domination based fuzzy multi-objective reactive power optimization approach[J].Automation of Electric Power Systems,2009,33(5):34-39(in Chinese).
[11]
Konak A,Coit D W,Smith A E.Multi-objective optimization using genetic algorithms:a tutorial[J].Reliability Engineering & System Safety,2006,91(9):992-1007.
[12]
Bui L T. Multi-objective optimization in computational intelligence: theory and practice[M].Information Science Reference,2008.
[13]
Zhang Q,Li H.MOEA/D:A multiobjective evolutionary algorithm based on decomposition[J].IEEE Transactions on Evolutionary Computation,2007,11(6):712-731.
[14]
H L Liao,Q H Wu,L Jiang.Multi-objective optimization by reinforcement learning for power system dispatch and voltage stability[C]//Innovative Smart Grid Technologies Conference Europe (ISGT Europe).Gothenburg:IEEE,2010:1-8.
[15]
Barraclough D J,Conroy M L,Lee D.Prefrontal cortex and decision making in a mixed-strategy game[J].Nature Neuroscience,2004,7(4):404-410.
[16]
Fu Wai-Tat,Anderson John R.From recurrent choice to skill learning:a reinforcement-learning model[J].Journal of Experimental Psychology:General,2006,135(2):184-206.
[17]
Sutton R S,Barto A G.Reinforcement learning:an introduction[M].Cambridge:MIT press,1998.
[18]
Nouri M A,Hesami A,Seifi A.Reactive power planning in Distribution Systems using a reinforcement learning method[C]//IEEE International Conference on Intelligent and Advanced Systems.Kuala Lumpur:IEEE,2007:157-161.