Journal of Xidian University ›› 2019, Vol. 46 ›› Issue (1): 46-50.doi: 10.19665/j.issn1001-2400.2019.01.008

Previous Articles     Next Articles

Method for robot obstacle avoidance based on the improved dueling network

ZHOU Yi1,2,CHEN Bo1,2   

  1. 1. National Key Lab. of Radar Signal Processing, Xidian Univ., Xi’an 710071, China;
    2. Collaborative Innovation Center of Information Sensing and Understanding, Xidian Univ., Xi’an 710071, China;
  • Received:2018-04-17 Online:2019-02-20 Published:2019-03-05

Abstract:

In view of the disadvantages of traditional reinforcement learning methods in motion planning, especially the problem of robot obstacle avoidance, it is easy to have overestimation and difficult to adapt to complex environment. A new model based on deep reinforcement learning is proposed to improve the obstacle avoidance performance of robots. The model combines dueling networks with Q-learning which is the traditional reinforcement learning method, and using two independent trained dueling networks to deal with environmental data and predict the action value. In the output layer, the state value and the action advantage are output respectively, with both values combined as the final action value. The model can process high dimension data to adapt to complex and changeable environment, and output advantageous actions for robot selection to get a higher accumulative reward. It can effectively improve the obstacle avoidance performance of a robot.

Key words: robot obstacle avoidance, deep reinforcement learning, dueling networks, independent trained

CLC Number: 

  • TP242.6

Baidu
map