HAN Zhihao, WANG Yibing, ZHANG Yu, HAO Yongzhi. Automatic Ship Route Planning Based on Deep Reinforcement Learning[J]. Navigation of China, 2021, 44(1): 100-105.
    Citation: HAN Zhihao, WANG Yibing, ZHANG Yu, HAO Yongzhi. Automatic Ship Route Planning Based on Deep Reinforcement Learning[J]. Navigation of China, 2021, 44(1): 100-105.

    Automatic Ship Route Planning Based on Deep Reinforcement Learning

    • The DQN(Deep Q Network) algorithm is introduced into automatic ship route planning to improve the practicality of the output route proposal through learning from actual routes taken by experienced navigators. The algorithm consists of two two-layer neural networks, the actual neural network and the target neural network. The purpose of the arrangement is to avoid data dependence. The experience of the agent is stored in the experience replay buffer and referenced randomly to prevent local convergence. The algorithm works whether the chart in use is same as the one for network training or not.
    • loading

    Catalog

      Turn off MathJax
      Article Contents

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return