负责人：在不确定性下学习针对计划的驾驶行为的关注

论文标题

负责人：在不确定性下学习针对计划的驾驶行为的关注

LEADER: Learning Attention over Driving Behaviors for Planning under Uncertainty

论文作者

Danesh, Mohamad H., Cai, Panpan, Hsu, David

论文摘要

人类行为的不确定性对拥挤的城市环境中的自动驾驶构成了重大挑战。部分可观察到的马尔可夫决策过程（POMDPS）为不确定性下的计划提供了一个原则性的框架，通常利用蒙特卡洛抽样来实现在线绩效进行复杂的任务。但是，抽样还通过潜在缺失关键事件引起了安全问题。为了解决这个问题，我们提出了一种新的算法，学习对驾驶行为（领导者）的关注，该算法学会在计划期间接受关键的人类行为。领导者学习了一个神经网络生成器，以实时情况向人类行为提供关注。它将注意力集成到信仰空间的计划者中，使用重要性抽样将推理偏向关键事件。为了训练该算法，我们让注意力生成器和计划者组成了最小游戏。通过解决Min-Max游戏，领导者学会了无需人类标签即可执行风险知识的计划。

Uncertainty on human behaviors poses a significant challenge to autonomous driving in crowded urban environments. The partially observable Markov decision processes (POMDPs) offer a principled framework for planning under uncertainty, often leveraging Monte Carlo sampling to achieve online performance for complex tasks. However, sampling also raises safety concerns by potentially missing critical events. To address this, we propose a new algorithm, LEarning Attention over Driving bEhavioRs (LEADER), that learns to attend to critical human behaviors during planning. LEADER learns a neural network generator to provide attention over human behaviors in real-time situations. It integrates the attention into a belief-space planner, using importance sampling to bias reasoning towards critical events. To train the algorithm, we let the attention generator and the planner form a min-max game. By solving the min-max game, LEADER learns to perform risk-aware planning without human labeling.

下载PDF全文

下载文献需遵守相关版权规定

论文标题