|
- 2017
基于MDP的战机对抗导弹措施优化方法
|
Abstract:
摘要 对战机对抗导弹的措施优化问题进行了研究,将该问题归结为不确定条件下的序贯决策问题,提出了一种基于Markov决策过程(MDP)的导弹对抗措施优化方法。首先,该方法根据作战过程将作战阶段分为不同的状态,将电子对抗措施(ECM)和战术机动作为可供战机选择的行动;然后,通过不同战术行动下状态之间的转移概率来反映行动的不确定性,以雷达搜索状态和导弹命中状态的期望值来表征不同策略下的飞机生存力;最后,通过建立Markov决策模型得到飞机生存力最大时各个状态对应的最佳行动。仿真结果表明:飞机生存力随着时间推移而逐渐降低;不同策略下飞机的生存力不同,基于MDP的策略可以有效提高飞机的生存力;单步优化措施不能提高战机的生存力,必须考虑长期状态转移的影响。
Abstract:Missile countermeasure optimization problem for fighter aircraft was studied. The missile countermeasure optimization was modeled as a sequential decision-making problem under uncertainty, and a Markov decision process (MDP) based approach was proposed. First, the engagement process was divided into several phases, and these phrases were marked by states. Electronic counter measures (ECM) and strategic maneuvers were treated as actions. Then, the state transition probability was used to reflect the uncertainty of each action, and average occupancy of "hit" state and "search" state was used to evaluate the aircraft survivability under different policies. Finally, the policy iteration algorithm was used to get the optimal policy, which maps the optimal action to be taken in each state. Simulation indicates that the aircraft survivability decreases as time goes on; the proposed MDP-based approach can effectively improve the aircraft survivability; one-step optimization is useless for the aircraft survivability and the influence of the state transition should be considered from the view of long-time horizon.
[1] | MARTINO A D.现代电子战系统导论[M].姜道安,等,译.北京:电子工业出版社,2014:5-18.MARTINO A D.Introduction to modern EW systems[M].JIANG D A,et al,translated.Beijing:Publishing House of Electronic Industry,2014:5-18(in Chinese). |
[2] | SIGAUD O,BUFFET O.Markov decision processes in artificial intelligence:MDPs,beyond MDPs and applications[M].New York:John Wiley & Sons,2010:1-63. |
[3] | 傅莉,谢福怀,孟光磊,等.基于滚动时域的无人机空战决策专家系统[J].北京航空航天大学学报,2015,41(11):1994-1999.FU L,XIE F H,MENG G L,et al.An UAV air-combat decision expert system based on receding horizon control[J].Journal of Beijing University of Aeronautics and Astronautics,2015,41(11):1994-1999(in Chinese). |
[4] | ERLANDSSON T,NIKLASSON L. An air-to-ground combat survivability model[J].Journal of Defense Modeling and Simulation, Application,Methodology,Technology,2013,12(3):1-15. |
[5] | RANDLEFF L R.Decision support system for fighter pilots[D].Kongens Lyngby:Technical University of Denmark,2007. |
[6] | VIRTANEN K,KARELAHTI J,RAIVIO T.Modeling air combat by a moving horizon influence diagram game[J].Journal of Guidance,and Dynamics,2006,29(5):1080-1091. |
[7] | ERLANDSSON T,NIKLASSON L.Automatic evaluation of air mission routes with respect to combat survival[J].Information Fusion,2014,20(1):88-98. |
[8] | HASAN S M,GUITOUNI A.Variable dwell time task scheduling for multifunction radar[J].IEEE Transactions on Automation Science and Engineering,2014,11(2):463-472. |
[9] | RUSSELL S J,NORVIG P.Artificial intelligence:A modern approach[M].3rd ed.Beijing:Tsinghua University Press,2011:645-684. |
[10] | VISNEVSKI N,KRISHNAMURTHY V,WANG A,et al.Syntactic modeling and signal processing of multifunction radars:A stochastic context-free grammar approach[J].Proceeding of IEEE,2007,95(5):1000-1025. |
[11] | ZARCHAN P.Tactical and strategic missile guidance[M].6th ed.Reston:AIAA,2012:33-58. |
[12] | VIRTANEN K,RAIVIO T.Decision theoretical approach to pilot simulation[J].Journal of Aircraft,1999,36(4):18-27. |
[13] | YOMCHINDA T.A study of autonomous evasive planar-maneuver against proportional-navigation guidance missiles for unmanned aircraft[C]//2015 Asian Conference on Defense Technology.Piscataway,NJ:IEEE Press,2015. |
[14] | BACHMANN D J,EVANS R J,MORAN B.Game theoretic analysis of adaptive radar jamming[J].IEEE Transactions on Aerospace and Electronic Systems,2011,47(2):1081-1100. |
[15] | BROOKS R R,PANG J E,GRIFFIN C.Game and information theory analysis of electronic countermeasures in pursuit-evasive games[J].IEEE Transactions on Systems,Man and Cybernetics- Part A:Systems and Humans,2008,38(6):1281-1294. |
[16] | 沈阳,陈永光,李修和.基于0-1规划的雷达干扰资源优化分配研究[J].兵工学报,2007,28(5):528-532.SHEN Y,CHEN Y G,LI X H.Research on optimal distribution of radar jamming resource based on zero-one programming[J].Acta Armamentarii,2007,28(5):528-532(in Chinese). |
[17] | MOORE F W.A methodology for missile countermeasures optimization under uncertainty[J].Evolutionary Computation,2002,10(2):129-149. |
[18] | BALL R E.The fundamentals of aircraft combat survivability analysis and design[M].2nd ed.Reston:AIAA,2003:445-602. |
[19] | 杨哲,李曙林,周莉.机载自卫压制干扰和箔条干扰下飞机生存力研究[J].北京理工大学学报,2013,33(4):375-379.YANG Z,LI S L,ZHOU L.Study of aircraft survivability under the conditions of self-defense jamming and chaff jamming[J].Transactions on Beijing Institute of Technology,2013,33(4):375-379(in Chinese). |
[20] | 宋海方,肖明清,吴华,等.不同机载电子干扰条件下的飞机敏感性模型[J].航空学报,2015,36(11):3630-3639.SONG H F,XIAO M Q,WU H,et al.Genetic model of aircraft susceptibility to different airborne electronic countermeasures[J].Acta Aeronautica et Astroautica Sinica,2015,36(11):3630-3639(in Chinese). |
[21] | 娄寿春.面空导弹武器系统分析[M].北京:国防工业出版社,2013:50-132.LOU S C.Surface-to-air missile weapon system analysis[M].Beijing:National Defense Industry Press,2013:50-132(in Chinese). |
[22] | BANDIERA F,FARINA A,ORLANDO D,et al.Detection algorithms to discriminate between radar targets and ECM signals[J].IEEE Transactions on Signal Processing,2010,58(12):5984-5993. |