通过有效的深度强化学习，基于单眼相机的复杂障碍物避免

论文标题

通过有效的深度强化学习，基于单眼相机的复杂障碍物避免

Monocular Camera-based Complex Obstacle Avoidance via Efficient Deep Reinforcement Learning

论文作者

Ding, Jianchuan, Gao, Lingping, Liu, Wenxi, Piao, Haiyin, Pan, Jia, Du, Zhenjun, Yang, Xin, Yin, Baocai

论文摘要

深度强化学习在基于激光的碰撞避免效果方面取得了巨大的成功，因为激光器可以感知准确的深度信息，而无需太多冗余数据，当算法从模拟环境迁移到现实世界时，这可以保持算法的稳健性。但是，高成本激光设备不仅很难为大规模的机器人部署，而且还表现出对复杂障碍的鲁棒性，包括不规则的障碍，例如桌子，桌子，椅子和架子，以及复杂的地面和特殊材料。在本文中，我们提出了一个新型的基于单眼摄像头的复杂障碍避免框架。特别是，我们创新地将捕获的RGB图像转换为伪激光测量，以进行有效的深度强化学习。与在一定高度捕获的传统激光测量相比，仅包含远离附近障碍的一维距离信息，我们提出的伪激光测量融合了被捕获的RGB图像的深度和语义信息，这使我们的方法有效地有助于复杂的障碍物。我们还设计了一个功能提取指导模块，以加重输入伪激光测量，并且代理对当前状态具有更合理的关注，这有利于提高障碍避免政策的准确性和效率。

Deep reinforcement learning has achieved great success in laser-based collision avoidance works because the laser can sense accurate depth information without too much redundant data, which can maintain the robustness of the algorithm when it is migrated from the simulation environment to the real world. However, high-cost laser devices are not only difficult to deploy for a large scale of robots but also demonstrate unsatisfactory robustness towards the complex obstacles, including irregular obstacles, e.g., tables, chairs, and shelves, as well as complex ground and special materials. In this paper, we propose a novel monocular camera-based complex obstacle avoidance framework. Particularly, we innovatively transform the captured RGB images to pseudo-laser measurements for efficient deep reinforcement learning. Compared to the traditional laser measurement captured at a certain height that only contains one-dimensional distance information away from the neighboring obstacles, our proposed pseudo-laser measurement fuses the depth and semantic information of the captured RGB image, which makes our method effective for complex obstacles. We also design a feature extraction guidance module to weight the input pseudo-laser measurement, and the agent has more reasonable attention for the current state, which is conducive to improving the accuracy and efficiency of the obstacle avoidance policy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题