|
Temporal Difference based Tuning of Fuzzy Logic Controller through Reinforcement Learning to Control an Inverted PendulumKeywords: Reinforcement Learning , Q-learning , Inverted Pendulum , Fuzzy logic control , Temporal Difference Abstract: This paper presents a self-tuning method of fuzzy logic controllers. The consequence part of the fuzzy logic controller is self-tuned through the Q-learning algorithm of reinforcement learning. The off policy temporal difference algorithm is used for tuning which directly approximate the action value function which gives the maximum reward. In this way, the Q-learning algorithm is used for the continuous time environment. The approach considered is having the advantage of fuzzy logic controller in a way that it is robust under the environmental uncertainties and no expert knowledge is required to design the rule base of the fuzzy logic controller.
|